feat: multi-gpu inference, trajectory analyzer #2

andre15silva · 2025-12-20T17:07:24Z

Most of the PR is the trajectory analyzer, but this also includes the changes to the dependencies that made it possible to run inference outisde of apptainer and on the full 8 GPU node with no issues from vllm coming up.

@BjarniHaukur you should be able to just uv sync and run it. See benchmarks/swe_bench/run_harness_eval.sh for the batchscript.

I created a pyproject from scratch since I was facing some version conflicts, but haven't added training dependencies back to it. Let me know if adding them to the new setup is not enough, we can debug if that's not the case. After that we can make a run to see if the parallelism is also working for training.

YourName and others added 11 commits December 2, 2025 13:52

update inference scripts

f2506b9

add support for running envs in apptainer

c7ea79e

fix swebench harness script

87ffa85

add matplotlib

17742e0

update nano agent, use setup function

5ccf313

update nano agent script

af0d377

clean repo state

be91855

trajectory analyzer

ec8b06e

update dependencies to include vllm

8d710fe

keep old pyproject for reference

fa17d7d

remove unused scripts

f8e745a

andre15silva requested a review from BjarniHaukur December 20, 2025 17:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: multi-gpu inference, trajectory analyzer #2

feat: multi-gpu inference, trajectory analyzer #2

Uh oh!

andre15silva commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: multi-gpu inference, trajectory analyzer #2

Are you sure you want to change the base?

feat: multi-gpu inference, trajectory analyzer #2

Uh oh!

Conversation

andre15silva commented Dec 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants