Benchmark Runner
Run vision LLMs on 930 tasks and measure their performance
Recent Runs
No benchmark runs yet. Select a model and task above to get started.
Run vision LLMs on 930 tasks and measure their performance
No benchmark runs yet. Select a model and task above to get started.
We can't find the internet
Attempting to reconnect
Something went wrong!
Attempting to reconnect