Overview of Quality Assessment Findings

Below is an overview of our quality assessment for each public summary. We scored each section of the summary based on our developed methodology, and then calculated the overall score and grade. Each summary is scored for two dimensions: Transparency (T) on how the information is provided, and Usefulness (U) for whether it is sufficient for stakeholders' needs. For convenience, all scores are reflected as a percentage (out of hundred) and the grades are expressed on a scale from A+ (highest) to F (lowest). Sections in public summaries that were not filled in because they were not applicable are marked as "N/A". Public summaries not being provided are marked using "!". The development of the framework and the evaluation steps are described on our methodology page along with a FAQ.

Section→	Grade		Overall		General Information		Public Data Sources		Private Data Sources		Scraped/Crawled Data		User Data		Synthetic & Other Data		Data Processing		Document
Model↓	T	U	T	U	T	U	T	U	T	U	T	U	T	U	T	U	T	U	T	U
Apertus	A	A+	92	97	74	93	100	100	100	N/A	100	N/A	100	N/A	100	N/A	100	100	87	84
Bria 3.2	B+	A	86	94	69	92	100	N/A	100	100	100	N/A	100	N/A	100	N/A	100	100	70	68
SmolLM3-3B	B+	B+	82	86	73	90	96	100	100	N/A	100	N/A	100	N/A	60	0	92	100	68	58
Bielik v3 11B Instruct	B+	C+	88	71	87	93	86	100	100	N/A	83	51	100	N/A	100	100	98	71	88	84
Phi-4	D	F	33	24	70	100	3	0	53	N/A	0	N/A	0	N/A	8	0	52	0	87	81
Claude Sonnet 4.5	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!
Gemini 2.5 Flash Image	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!
GPT-5	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!
GPT-OSS	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!
Sora 2	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!	!

Source and Analysis

Apertus by Swiss AI Initiative
Based on public summary dated 2025-09-01 and evaluated on 2026-01-12. See detailed analysis.
Bielik v3 11B Instruct by SpeakLeash
Based on public summary dated 2026-01-01 and evaluated on 2026-01-12. See detailed analysis.
Bria 3.2 by Bria AI
Based on public summary dated 2026-01-06 and evaluated on 2026-01-12. See detailed analysis.
Claude Sonnet 4.5 by Anthropic
no public summary found
Gemini 2.5 Flash Image by Google
no public summary found
GPT-5 by OpenAI
no public summary found
GPT-OSS by OpenAI
no public summary found
Phi-4 by Microsoft
Based on public summary dated 2025-11-24 and evaluated on 2026-01-12. See detailed analysis.
SmolLM3-3B by HuggingFace
Based on public summary dated 2025-07-25 and evaluated on 2026-01-12. See detailed analysis.
Sora 2 by OpenAI
no public summary found