Skip to content

Misc v0.7.0 Fixes#843

Merged
mergify[bot] merged 3 commits into
vllm-project:mainfrom
sjmonson:fix/misc
Jun 25, 2026
Merged

Misc v0.7.0 Fixes#843
mergify[bot] merged 3 commits into
vllm-project:mainfrom
sjmonson:fix/misc

Conversation

@sjmonson

@sjmonson sjmonson commented Jun 24, 2026

Copy link
Copy Markdown
Collaborator

Summary

Fixes a couple smaller issues.

Details

1. Bump max_concurrency

See more details in the commit description, but the primary reason to bump it now is because the replay profile is also subjected to it.

2. Drop the max-request modifier from replay

This code never made sense to have and was just a holdover from when the profile was responsible for trace timing. There is an automatic constraint that ends the benchmark when the dataset is exhausted that serves the same purpose.

Test Plan

For replays a longer dataset can be provided and limited to a smaller number of samples:

guidellm run \
    --backend kind=openai_http,target=https://localhost:8000 \
    --profile kind=replay,time_scale=0.01 \
    --data-loader kind=pytorch,samples=30 \
    --data kind=trace_synthetic,path=./data.jsonl

  • "I certify that all code in this PR is my own, except as noted below."

Use of AI

  • Includes code generated or substantially modified by an AI agent
  • Includes tests generated or substantially modified by an AI agent

NOTE: the Generated-by or Assisted-by trailers should be used in git commit messages when code or tests were generated or substantially modified by an AI agent, as described in the project's DEVELOPING.md file.


git log

commit 78b4014
Author: Samuel Monson smonson@redhat.com
Date: Wed Jun 24 13:52:57 2026 -0400

Bump max_concurrency

The max_concurrency limit applies to all non-concurrency workloads. The
current lmit is way too low and was kept that way because it is also the
default for sweep. This chnage splits the two.

Signed-off-by: Samuel Monson <smonson@redhat.com>

commit 850e8f3
Author: Samuel Monson smonson@redhat.com
Date: Thu Jun 25 15:07:50 2026 -0400

Remove timestamps injection from replay profile

Signed-off-by: Samuel Monson <smonson@redhat.com>

commit 522b457
Author: Samuel Monson smonson@redhat.com
Date: Thu Jun 25 15:23:31 2026 -0400

Drop unneeded tests

Signed-off-by: Samuel Monson <smonson@redhat.com>

Signed-off-by: Samuel Monson smonson@redhat.com

The max_concurrency limit applies to all non-concurrency workloads. The
current lmit is way too low and was kept that way because it is also the
default for sweep. This chnage splits the two.

Signed-off-by: Samuel Monson <smonson@redhat.com>
dbutenhof
dbutenhof previously approved these changes Jun 24, 2026
@jaredoconnell

Copy link
Copy Markdown
Collaborator

Ping me when this is ready for review. I don't want it to auto-merge.

sjmonson added 2 commits June 25, 2026 15:18
Signed-off-by: Samuel Monson <smonson@redhat.com>
Signed-off-by: Samuel Monson <smonson@redhat.com>
@sjmonson sjmonson marked this pull request as ready for review June 25, 2026 19:32
@mergify

mergify Bot commented Jun 25, 2026

Copy link
Copy Markdown
Contributor

Queued — the merge queue status continues in this comment ↓.

@dbutenhof dbutenhof left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And it stops when it should? Yea!

@mergify

mergify Bot commented Jun 25, 2026

Copy link
Copy Markdown
Contributor

Merge Queue Status

This pull request spent 3 minutes 41 seconds in the queue, including 3 minutes 22 seconds running CI.

Required conditions to merge
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = quality (3.10) / type-checks
    • check-neutral = quality (3.10) / type-checks
    • check-skipped = quality (3.10) / type-checks
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = quality (3.10) / precommit-checks
    • check-neutral = quality (3.10) / precommit-checks
    • check-skipped = quality (3.10) / precommit-checks
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = quality (3.10) / quality-checks
    • check-neutral = quality (3.10) / quality-checks
    • check-skipped = quality (3.10) / quality-checks
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = tests (3.10) / e2e-tests
    • check-neutral = tests (3.10) / e2e-tests
    • check-skipped = tests (3.10) / e2e-tests
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = tests (3.10) / integration-tests
    • check-neutral = tests (3.10) / integration-tests
    • check-skipped = tests (3.10) / integration-tests
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = tests (3.10) / unit-tests
    • check-neutral = tests (3.10) / unit-tests
    • check-skipped = tests (3.10) / unit-tests
  • any of [🛡 GitHub repository ruleset rule Merge Requirements]:
    • check-success = update-description
    • check-neutral = update-description
    • check-skipped = update-description

mergify Bot added a commit that referenced this pull request Jun 25, 2026
@mergify mergify Bot added the queued label Jun 25, 2026
@mergify mergify Bot merged commit d44a004 into vllm-project:main Jun 25, 2026
13 of 14 checks passed
@mergify mergify Bot removed the queued label Jun 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants