Running time with prefix is not faster than running without prefix #1219

GuyEyalGong · 2025-01-09T14:20:41Z

Hi
We are trying to utilize the prefix prompt (tokens after the SOT) to reduce running time in cases we already transcribed the first part of the audio segment (running in sliding windows over long audio)

However, when comparing running times, we do not see an improvement.

Our assumption: For a given audio with the first X seconds transcribed, putting them in the prompt will result in fewer decoding steps in ctranslate. Are we wrong?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running time with prefix is not faster than running without prefix #1219

Running time with prefix is not faster than running without prefix #1219

GuyEyalGong commented Jan 9, 2025

Running time with prefix is not faster than running without prefix #1219

Running time with prefix is not faster than running without prefix #1219

Comments

GuyEyalGong commented Jan 9, 2025