Add unique prefix - increasing counter #217

MML-coder · 2025-07-01T20:54:41Z

The SyntheticTextItemsGenerator was generating prompts that could trigger vLLM's automatic prefix caching, leading to hitting the prefix cache up to 80% in some cases during the performance benchmarking.

Implemented unique prefix injection to guarantee 0% prefix cache hit rate while maintaining realistic prompt characteristics.

Test:
Performing some tests on the H200 target accelerator to confirm the fix.

nm-red-hat-upstream-automation-bot · 2025-07-01T20:55:34Z

📦 Build Artifacts Available
The build artifacts (.whl and .tar.gz) have been successfully generated and are available for download: https://github.com/neuralmagic/guidellm/actions/runs/16009951836/artifacts/3444362586.
They will be retained for up to 30 days.

Initial commit to add unique prefix - increasing number

c2c72d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add unique prefix - increasing counter #217

Add unique prefix - increasing counter #217

Uh oh!

MML-coder commented Jul 1, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jul 1, 2025

Uh oh!

Uh oh!

Add unique prefix - increasing counter #217

Are you sure you want to change the base?

Add unique prefix - increasing counter #217

Uh oh!

Conversation

MML-coder commented Jul 1, 2025

Uh oh!

nm-red-hat-upstream-automation-bot bot commented Jul 1, 2025

Uh oh!

Uh oh!