Add inference_max_seq_len to ray mbridge deployment path #588

athitten · 2026-02-07T03:08:43Z

Adds inference_max_seq_len to ray mbridge deployment path. This was not exposed in the ray mbridge deployment path and was existing only in the pytriton path. Its needed to be set while running deployment for eval benchmarks like humaneval that have a large value of max_tokens.

Signed-off-by: Abhishree <abhishreetm@gmail.com>

copy-pr-bot · 2026-02-07T03:08:47Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

athitten · 2026-02-07T03:10:17Z

/ok to test 8668e30

Signed-off-by: Abhishree <abhishreetm@gmail.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

Add inference_max_seq_len to ray mbridge deployment path

8668e30

Signed-off-by: Abhishree <abhishreetm@gmail.com>

athitten requested review from meatybobby, oyilmaz-nvidia and pthombre as code owners February 7, 2026 03:08

github-actions bot added deploy LLM scripts labels Feb 7, 2026

athitten added r0.4.0 Cherry-pick PR to r0.4.0 release branch and removed deploy LLM scripts labels Feb 7, 2026

copy-pr-bot bot temporarily deployed to test February 7, 2026 03:10 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 7, 2026 03:53 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 7, 2026 04:32 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci February 7, 2026 04:35 Inactive

oyilmaz-nvidia approved these changes Feb 9, 2026

View reviewed changes

oyilmaz-nvidia merged commit 0997912 into main Feb 9, 2026
26 checks passed

oyilmaz-nvidia deleted the athitten/inf_max_seqlen_ray branch February 9, 2026 21:08

ko3n1g pushed a commit that referenced this pull request Feb 9, 2026

Add inference_max_seq_len to ray mbridge deployment path (#588)

dad07ab

Signed-off-by: Abhishree <abhishreetm@gmail.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inference_max_seq_len to ray mbridge deployment path #588

Add inference_max_seq_len to ray mbridge deployment path #588

athitten commented Feb 7, 2026

Uh oh!

copy-pr-bot bot commented Feb 7, 2026

Uh oh!

athitten commented Feb 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add inference_max_seq_len to ray mbridge deployment path #588

Add inference_max_seq_len to ray mbridge deployment path #588

Conversation

athitten commented Feb 7, 2026

Uh oh!

copy-pr-bot bot commented Feb 7, 2026

Uh oh!

athitten commented Feb 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants