scale-ai/swe-atlas-qna
v1.0SWE-Atlas Codebase QnA benchmark that evaluates AI agents' ability to comprehend and query existing codebases.
uvx harbor run -d scale-ai/swe-atlas-qna@1.0Tasks (124)
task-6905333b74f22949d97baa14
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa14b0611b0
task-6905333b74f22949d97baa15
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa15b0611b0
task-6905333b74f22949d97baa16
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa16b0611b0
task-6905333b74f22949d97baa17
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa17b0611b0
task-6905333b74f22949d97baa19
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa19b0611b0
task-6905333b74f22949d97baa1a
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1ab0611b0
task-6905333b74f22949d97baa1b
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1bb0611b0
task-6905333b74f22949d97baa1c
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1cb0611b0
task-6905333b74f22949d97baa1d
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1db0611b0
task-6905333b74f22949d97baa1e
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1eb0611b0
task-6905333b74f22949d97baa1f
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1fb0611b0
task-6905333b74f22949d97baa20
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa20b0611b0
task-6905333b74f22949d97baa21
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa21b0611b0
task-6905333b74f22949d97baa22
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa22b0611b0
task-6905333b74f22949d97baa23
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa23b0611b0
task-6905333b74f22949d97baa24
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa24b0611b0
task-6905333b74f22949d97baa25
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa25b0611b0
task-6905333b74f22949d97baa26
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa26b0611b0
task-6905333b74f22949d97baa27
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa27b0611b0
task-6905333b74f22949d97baa28
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa28b0611b0
task-6905333b74f22949d97baa2a
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2ab0611b0
task-6905333b74f22949d97baa2b
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2bb0611b0
task-6905333b74f22949d97baa2c
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2cb0611b0
task-6905333b74f22949d97baa2d
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2db0611b0