Registry
Browse the datasets available in the Harbor registry.
uvx harbor datasets listuvx harbor run -d terminal-bench@2.089 tasks
uvx harbor run -d swebench-verified@1.0500 tasks
uvx harbor run -d researchcodebench@1.0212 tasks
uvx harbor run -d gso@1.0102 tasks
uvx harbor run -d cooperbench@1.0652 tasks
uvx harbor run -d legacy-bench@1.010 tasks
uvx harbor run -d scale-ai/swe-atlas-tw@1.090 tasks
uvx harbor run -d scale-ai/swe-atlas-qna@1.0124 tasks
uvx harbor run -d swebench_multilingual@1.0300 tasks
uvx harbor run -d spreadsheetbench-verified@1.0400 tasks
uvx harbor run -d pixiu@parity435 tasks
uvx harbor run -d rexbench@1.02 tasks
uvx harbor run -d ml-dev-bench@1.033 tasks
uvx harbor run -d featurebench@1.0200 tasks
uvx harbor run -d featurebench-lite-modal@1.030 tasks
uvx harbor run -d featurebench-modal@1.0200 tasks
uvx harbor run -d featurebench-lite@1.030 tasks
uvx harbor run -d ade-bench@1.048 tasks
uvx harbor run -d medagentbench@1.0300 tasks
uvx harbor run -d bigcodebench-hard-complete@1.0.0145 tasks
uvx harbor run -d deveval@1.063 tasks
uvx harbor run -d quixbugs@1.080 tasks
uvx harbor run -d qcircuitbench@1.028 tasks
uvx harbor run -d bfcl_parity@1.0123 tasks
uvx harbor run -d bfcl@1.03641 tasks
uvx harbor run -d labbench@1.0181 tasks
uvx harbor run -d satbench@1.02100 tasks
uvx harbor run -d financeagent@public50 tasks
uvx harbor run -d bird-bench@parity150 tasks
uvx harbor run -d kumo@1.05300 tasks
uvx harbor run -d kumo@hard250 tasks
uvx harbor run -d kumo@easy5050 tasks
uvx harbor run -d kumo@parity212 tasks
uvx harbor run -d gaia@1.0165 tasks
uvx harbor run -d simpleqa@1.04326 tasks
uvx harbor run -d termigen-environments@1.03566 tasks
uvx harbor run -d openthoughts-tblite@2.0100 tasks
uvx harbor run -d dabstep@1.0450 tasks
uvx harbor run -d code-contests@1.09644 tasks
uvx harbor run -d binary-audit@1.046 tasks
uvx harbor run -d otel-bench@1.026 tasks
uvx harbor run -d seta-env@1.01376 tasks
uvx harbor run -d vmax-tasks@1.01043 tasks
uvx harbor run -d mmmlu@parity150 tasks
uvx harbor run -d swe-gen-js@1.01000 tasks
uvx harbor run -d reasoning-gym-easy@parity288 tasks
uvx harbor run -d reasoning-gym-hard@parity288 tasks
uvx harbor run -d terminal-bench-sample@2.010 tasks
uvx harbor run -d swe-lancer-diamond@manager265 tasks
uvx harbor run -d swe-lancer-diamond@ic198 tasks
uvx harbor run -d swe-lancer-diamond@all463 tasks
uvx harbor run -d lawbench@1.01000 tasks
uvx harbor run -d crustbench@1.0100 tasks
uvx harbor run -d bixbench-cli@1.5205 tasks
uvx harbor run -d spider2-dbt@1.064 tasks
uvx harbor run -d algotune@1.0154 tasks
uvx harbor run -d ineqmath@1.0100 tasks
uvx harbor run -d ds-1000@head1000 tasks
uvx harbor run -d hello-world@1.01 tasks
uvx harbor run -d bixbench@1.5205 tasks
uvx harbor run -d strongreject@parity150 tasks
uvx harbor run -d arc_agi_2@1.0167 tasks
uvx harbor run -d humanevalfix@1.0164 tasks
uvx harbor run -d mmau@1.01000 tasks
uvx harbor run -d swtbench-verified@1.0433 tasks
uvx harbor run -d mlgym-bench@1.012 tasks
uvx harbor run -d gpqa-diamond@1.0198 tasks
uvx harbor run -d replicationbench@1.090 tasks
uvx harbor run -d aider-polyglot@1.0225 tasks
uvx harbor run -d terminal-bench-pro@1.0200 tasks
uvx harbor run -d swesmith@1.0100 tasks
uvx harbor run -d swebenchpro@1.0731 tasks
uvx harbor run -d sldbench@1.08 tasks
uvx harbor run -d compilebench@1.015 tasks
uvx harbor run -d autocodebench@lite200200 tasks
uvx harbor run -d usaco@2.0304 tasks
uvx harbor run -d aime@1.060 tasks
uvx harbor run -d codepde@1.05 tasks
uvx harbor run -d evoeval@1.0100 tasks
uvx harbor run -d livecodebench@6.0100 tasks