A new benchmark for testing LLMs for deterministic outputs screenshot

What is A new benchmark for testing LLMs for deterministic outputs?

A multi-source LLM benchmark across text, image, and audio that measures JSON value accuracy per field, not just schema compliance. 20+ models, 7 metrics, full leaderboard.