Task Catalog¶
This page is the searchable public catalog of benchmark tasks.
Public Catalog¶
The current view is driven directly from the generated task_catalog.json file. It includes only the public test subset and supports lightweight search and domain filtering.
Data Source¶
The catalog is intended to be generated from task_catalog.json, which itself is built from structured task metadata in the main benchmark repository.