PRISM-Audio
3 Benchmarks
Audio & Speech Understanding
Evaluation of speech recognition, medical audio reasoning, and spectrogram-based analysis across frontier multimodal models — from dialect-aware ASR to clinical soundscape understanding.
Model Rankings
Weighted Avg · higher = better
Scatter Plot
Full Model Comparison
| PROVIDER | MODEL | WEIGHTED AVG | MCQ LONG FORM | MCQ SOUND COUGH | MCQ SOUND HEART | MCQ SOUND LUNG | MCQ SPEECH | MCQ SPEECH+SOUND | MULTI-TURN | OE SPEECH |
|---|---|---|---|---|---|---|---|---|---|---|
| gemini-2.5-pro | 68.10 | 72.30 | 71.20 | 65.40 | 70.80 | 82.10 | 69.30 | 48.20 | 61.40 | |
| gemini-2.5-flash | 60.50 | 65.10 | 63.40 | 58.20 | 62.70 | 74.30 | 61.80 | 42.10 | 54.20 | |
| alibaba | qwen-2.5-omni-7b | 42.80 | 45.20 | 44.10 | 39.80 | 43.20 | 52.40 | 43.70 | 31.20 | 38.10 |
| gemma-3n-8b | 42.10 | 44.80 | 43.20 | 38.90 | 42.70 | 51.30 | 42.80 | 30.40 | 37.40 | |
| desta | desta25-audio | 41.00 | 43.70 | 42.10 | 37.80 | 41.50 | 50.20 | 41.70 | 29.80 | 36.80 |
| baichuan | baichuan-omni | 38.60 | 41.20 | 39.80 | 35.40 | 39.10 | 47.30 | 39.20 | 28.10 | 34.70 |
| microsoft | phi-4-mm | 37.30 | 39.80 | 38.40 | 34.10 | 37.80 | 45.70 | 37.90 | 27.20 | 33.40 |
| moonshot | kimi-audio | 36.40 | 38.90 | 37.50 | 33.20 | 36.90 | 44.80 | 37.10 | 26.40 | 32.70 |
| openai | gpt-4o-audio | 35.70 | 38.10 | 36.80 | 32.40 | 36.20 | 43.90 | 36.30 | 25.80 | 31.90 |
| community | audio-reasoner | 32.80 | 35.20 | 33.90 | 29.70 | 33.30 | 40.40 | 33.40 | 23.70 | 29.30 |
| community | audio-flamingo-3 | 24.10 | 25.90 | 24.90 | 21.80 | 24.50 | 29.70 | 24.60 | 17.40 | 21.60 |
| community | gama | 23.20 | 24.90 | 23.90 | 21.00 | 23.50 | 28.50 | 23.70 | 16.80 | 20.80 |
| community | r1-aqa | 20.80 | 22.40 | 21.50 | 18.80 | 21.10 | 25.60 | 21.30 | 15.10 | 18.70 |
Security
Disciplined security and privacy practices aligned with global standards to protect sensitive data, intellectual property, and model assets throughout the AI lifecycle.
Centific applies rigorous security, access control, and auditability standards to safeguard enterprise data, human workflows, and AI systems at scale.
Blog
Customer Stories
Proven results
with leading AI teams.
See how organizations use Centific’s data and expert services to build, deploy, and scale production-ready AI.
Connect with Centific
Updates from the frontier of AI data.
Receive updates on platform improvements, new workflows, evaluation capabilities, data quality enhancements, and best practices for enterprise AI teams.
Data
Infrastructure
engineered for Trust.
Confidently scale every part of your AI development lifecycle with secure, compliant, production-ready operations.
Connect data, models, and people — in one enterprise-ready platform.
Seamlessly connect your existing systems, infrastructure, and workflows — all in one unified platform.
Centific Premier Hackathon 2.0
This is your moment.
Seamlessly connect your existing systems, infrastructure, and workflows — all in one unified platform.
Connect data, models, and people — in one enterprise-ready platform.






