[Benchmark] Allow oversample request in benchmark dataset #15170

JenZhao · 2025-03-19T22:47:18Z

Enable oversample requests in HuggingFaceDataset, VisionArenaDataset and ShareGPTDataset.

This is mostly needed for small datasets like visionarena.

This branch

Dataset	Backend	Successful requests	Benchmark duration (s)	Total input tokens
hf-vision-arena	openai-chat	10000	444.00	627745
hf	openai-chat	10000	490.06	114223
sharegpt	vllm	10000	262.65	2205227

Main branch

Dataset	Backend	Successful requests	Benchmark duration (s)	Total input tokens
hf-vision-arena	openai-chat	500	36.29	33418
hf	openai-chat	10000	413.91	114223
sharegpt	vllm	10000	297.65	2205227

vllm serve Qwen/Qwen2-VL-7B-Instruct     --swap-space 16     --disable-log-requests

python3 vllm/benchmarks/benchmark_serving.py --model Qwen/Qwen2-VL-7B-Instruct --backend openai-chat --endpoint /v1/chat/completions --dataset-name hf --dataset-path lmarena-ai/vision-arena-bench-v0.1 --hf-split train --num-prompts 10000 --percentile-metrics ttft,tpot,e2el

python3 vllm/benchmarks/benchmark_serving.py --model Qwen/Qwen2-VL-7B-Instruct --backend openai-chat --endpoint /v1/chat/completions --dataset-name hf --dataset-path lmms-lab/LLaVA-OneVision-Data --hf-split train --hf-subset "chart2text(cauldron)" --num-prompts 10000 --percentile-metrics ttft,tpot,e2el

python3 vllm/benchmarks/benchmark_serving.py --backend vllm --model Qwen/Qwen2-VL-7B-Instruct --dataset-name sharegpt --dataset-path /home/jovyan/data/vllm_benchmark_datasets/ShareGPT_V3_unfiltered_cleaned_split.json --num-prompts 10000

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

github-actions · 2025-03-19T22:47:27Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

ywang96

Overall LGTM - left a small naming nit

benchmarks/benchmark_dataset.py

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

JenZhao added 8 commits March 19, 2025 20:45

update dataset sampling

1440b84

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

Merge branch 'vllm-project:main' into fix_sample

74dcb95

update

6023013

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

update

99d8f36

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

update

291dfa2

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

update

9382a50

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

update

88d0823

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

update

abddb11

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

JenZhao changed the title ~~[Feature] Allow oversample request in benchmark dataset~~ [Benchmark] Allow oversample request in benchmark dataset Mar 19, 2025

update readme with more examples

276b8a0

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

JenZhao marked this pull request as ready for review March 19, 2025 23:15

ywang96 reviewed Mar 19, 2025

View reviewed changes

benchmarks/benchmark_dataset.py Outdated Show resolved Hide resolved

update naming

963054d

Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

ywang96 approved these changes Mar 20, 2025

View reviewed changes

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 20, 2025

DarkLight1337 merged commit b88be22 into vllm-project:main Mar 20, 2025
21 of 23 checks passed

erictang000 pushed a commit to erictang000/vllm that referenced this pull request Mar 25, 2025

[Benchmark] Allow oversample request in benchmark dataset (vllm-proje…

97bab14

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

gmarinho2 pushed a commit to gmarinho2/vllm that referenced this pull request Apr 1, 2025

[Benchmark] Allow oversample request in benchmark dataset (vllm-proje…

4876126

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[Benchmark] Allow oversample request in benchmark dataset (vllm-proje…

99e48a3

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

nishith-fujitsu pushed a commit to nishith-fujitsu/vllm that referenced this pull request Apr 9, 2025

[Benchmark] Allow oversample request in benchmark dataset (vllm-proje…

9a557c8

…ct#15170) Signed-off-by: Jennifer Zhao <ai.jenniferzhao@gmail.com>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Benchmark] Allow oversample request in benchmark dataset #15170

[Benchmark] Allow oversample request in benchmark dataset #15170

JenZhao commented Mar 19, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Mar 19, 2025

ywang96 left a comment

[Benchmark] Allow oversample request in benchmark dataset #15170

[Benchmark] Allow oversample request in benchmark dataset #15170

Conversation

JenZhao commented Mar 19, 2025 • edited by github-actions bot Loading

github-actions bot commented Mar 19, 2025

ywang96 left a comment

Choose a reason for hiding this comment

JenZhao commented Mar 19, 2025 •

edited by github-actions bot

Loading