Overview of Quality Assessment Findings

Below is an overview of our quality assessment for each public summary. We scored each section of the summary based on our developed methodology, and then calculated the overall score and grade. Each summary is scored for two dimensions: Transparency (T) on how the information is provided, and Usefulness (U) for whether it is sufficient for stakeholders' needs. For convenience, all scores are reflected as a percentage (out of hundred) and the grades are expressed on a scale from A+ (highest) to F (lowest). Sections in public summaries that were not filled in because they were not applicable are marked as "N/A". Public summaries not being provided are marked using "!". The development of the framework and the evaluation steps are described on our methodology page along with a FAQ.

Section→ Grade Overall General Information Public Data Sources Private Data Sources Scraped/Crawled Data User Data Synthetic & Other Data Data Processing Document
Model↓ T U T U T U T U T U T U T U T U T U T U
Apertus A   A+ 92 97 74 93 100 100 100 N/A 100 N/A 100 N/A 100 N/A 100 100 87 84
Bria 3.2 B+ A   86 94 69 92 100 N/A 100 100 100 N/A 100 N/A 100 N/A 100 100 70 68
SmolLM3-3B B+ B+ 82 86 73 90 96 100 100 N/A 100 N/A 100 N/A 60 0 92 100 68 58
Bielik v3 11B Instruct B+ C+ 88 71 87 93 86 100 100 N/A 83 51 100 N/A 100 100 98 71 88 84
Phi-4 D   F   33 24 70 100 3 0 53 N/A 0 N/A 0 N/A 8 0 52 0 87 81
Claude Sonnet 4.5 !!!!!!!!!!!!!!!!!!!!
Gemini 2.5 Flash Image !!!!!!!!!!!!!!!!!!!!
GPT-5 !!!!!!!!!!!!!!!!!!!!
GPT-OSS !!!!!!!!!!!!!!!!!!!!
Sora 2 !!!!!!!!!!!!!!!!!!!!

Source and Analysis

  • Apertus by Swiss AI Initiative
    Based on public summary dated 2025-09-01 and evaluated on 2026-01-12. See detailed analysis.
  • Bielik v3 11B Instruct by SpeakLeash
    Based on public summary dated 2026-01-01 and evaluated on 2026-01-12. See detailed analysis.
  • Bria 3.2 by Bria AI
    Based on public summary dated 2026-01-06 and evaluated on 2026-01-12. See detailed analysis.
  • Claude Sonnet 4.5 by Anthropic
    no public summary found
  • Gemini 2.5 Flash Image by Google
    no public summary found
  • GPT-5 by OpenAI
    no public summary found
  • GPT-OSS by OpenAI
    no public summary found
  • Phi-4 by Microsoft
    Based on public summary dated 2025-11-24 and evaluated on 2026-01-12. See detailed analysis.
  • SmolLM3-3B by HuggingFace
    Based on public summary dated 2025-07-25 and evaluated on 2026-01-12. See detailed analysis.
  • Sora 2 by OpenAI
    no public summary found