scale-ai/swe-atlas-qna

v1.0

SWE-Atlas Codebase QnA benchmark that evaluates AI agents' ability to comprehend and query existing codebases.

uvx harbor run -d scale-ai/swe-atlas-qna@1.0

Tasks (124)

task-6905333b74f22949d97baa14
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa14
b0611b0
task-6905333b74f22949d97baa15
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa15
b0611b0
task-6905333b74f22949d97baa16
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa16
b0611b0
task-6905333b74f22949d97baa17
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa17
b0611b0
task-6905333b74f22949d97baa19
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa19
b0611b0
task-6905333b74f22949d97baa1a
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1a
b0611b0
task-6905333b74f22949d97baa1b
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1b
b0611b0
task-6905333b74f22949d97baa1c
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1c
b0611b0
task-6905333b74f22949d97baa1d
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1d
b0611b0
task-6905333b74f22949d97baa1e
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1e
b0611b0
task-6905333b74f22949d97baa1f
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa1f
b0611b0
task-6905333b74f22949d97baa20
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa20
b0611b0
task-6905333b74f22949d97baa21
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa21
b0611b0
task-6905333b74f22949d97baa22
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa22
b0611b0
task-6905333b74f22949d97baa23
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa23
b0611b0
task-6905333b74f22949d97baa24
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa24
b0611b0
task-6905333b74f22949d97baa25
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa25
b0611b0
task-6905333b74f22949d97baa26
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa26
b0611b0
task-6905333b74f22949d97baa27
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa27
b0611b0
task-6905333b74f22949d97baa28
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa28
b0611b0
task-6905333b74f22949d97baa2a
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2a
b0611b0
task-6905333b74f22949d97baa2b
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2b
b0611b0
task-6905333b74f22949d97baa2c
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2c
b0611b0
task-6905333b74f22949d97baa2d
uvx harbor run -d scale-ai/swe-atlas-qna@1.0 -t task-6905333b74f22949d97baa2d
b0611b0