Overview of Quality Assessment Findings
Below is an overview of our quality assessment for each public summary. We scored each section of the summary based on our developed methodology, and then calculated the overall score and grade. Each summary is scored for two dimensions: Transparency (T) on how the information is provided, and Usefulness (U) for whether it is sufficient for stakeholders' needs. For convenience, all scores are reflected as a percentage (out of hundred) and the grades are expressed on a scale from A+ (highest) to F (lowest). Sections in public summaries that were not filled in because they were not applicable are marked as "N/A". Public summaries not being provided are marked using "!". The development of the framework and the evaluation steps are described on our methodology page along with a FAQ.
| Section→ | Grade | Overall | General Information | Public Data Sources | Private Data Sources | Scraped/Crawled Data | User Data | Synthetic & Other Data | Data Processing | Document | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model↓ | T | U | T | U | T | U | T | U | T | U | T | U | T | U | T | U | T | U | T | U |
| Apertus | A | A+ | 92 | 97 | 74 | 93 | 100 | 100 | 100 | N/A | 100 | N/A | 100 | N/A | 100 | N/A | 100 | 100 | 87 | 84 |
| Bria 3.2 | B+ | A | 86 | 94 | 69 | 92 | 100 | N/A | 100 | 100 | 100 | N/A | 100 | N/A | 100 | N/A | 100 | 100 | 70 | 68 |
| SmolLM3-3B | B+ | B+ | 82 | 86 | 73 | 90 | 96 | 100 | 100 | N/A | 100 | N/A | 100 | N/A | 60 | 0 | 92 | 100 | 68 | 58 |
| Bielik v3 11B Instruct | B+ | C+ | 88 | 71 | 87 | 93 | 86 | 100 | 100 | N/A | 83 | 51 | 100 | N/A | 100 | 100 | 98 | 71 | 88 | 84 |
| Phi-4 | D | F | 33 | 24 | 70 | 100 | 3 | 0 | 53 | N/A | 0 | N/A | 0 | N/A | 8 | 0 | 52 | 0 | 87 | 81 |
| Claude Sonnet 4.5 | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! |
| Gemini 2.5 Flash Image | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! |
| GPT-5 | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! |
| GPT-OSS | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! |
| Sora 2 | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! | ! |
Source and Analysis
-
Apertus by Swiss AI Initiative
Based on public summary dated 2025-09-01 and evaluated on 2026-01-12. See detailed analysis. -
Bielik v3 11B Instruct by SpeakLeash
Based on public summary dated 2026-01-01 and evaluated on 2026-01-12. See detailed analysis. -
Bria 3.2 by Bria AI
Based on public summary dated 2026-01-06 and evaluated on 2026-01-12. See detailed analysis. -
Claude Sonnet 4.5 by Anthropic
no public summary found -
Gemini 2.5 Flash Image by Google
no public summary found -
GPT-5 by OpenAI
no public summary found -
GPT-OSS by OpenAI
no public summary found -
Phi-4 by Microsoft
Based on public summary dated 2025-11-24 and evaluated on 2026-01-12. See detailed analysis. -
SmolLM3-3B by HuggingFace
Based on public summary dated 2025-07-25 and evaluated on 2026-01-12. See detailed analysis. -
Sora 2 by OpenAI
no public summary found