PRISM evaluation suite · v2.0 · Jun 2026
The PRISM suite provides rigorous assessments of frontier models across Internationalization, Audio, Vision, Agentic & RL, Physical AI, Healthcare, and AI Safety.
Domains
Benchmarks
Eval Tasks
Models Evaluated

PRISM-Agentic & RL
3 benchmarks
Frontier evaluation of LLM agents on real enterprise workflows — multi-step business analyst pipelines for requirements generation, and adversarial security agents for phishing triage red-teamed by attacker LLMs in closed-loop generation pipelines.
Connect with Centific
Updates from the frontier of AI data.
Receive updates on platform improvements, new workflows, evaluation capabilities, data quality enhancements, and best practices for enterprise AI teams.