Common Unified Benchmark Environments — a protocol standard that eliminates the integration tax of agentic benchmarks by providing a universal interface between benchmarks and evaluation frameworks.
ArXiv Paper — Read the paper detailing the vision for this project.
About CUBE — learn more about the project, the problem it solves, and who it’s for.
Get Started
- GitHub README — Installation, quick start, and contributing
- DeepWiki — Full API reference and guides
