Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running time with prefix is not faster than running without prefix #1219

Open
GuyEyalGong opened this issue Jan 9, 2025 · 0 comments
Open

Comments

@GuyEyalGong
Copy link

Hi
We are trying to utilize the prefix prompt (tokens after the SOT) to reduce running time in cases we already transcribed the first part of the audio segment (running in sliding windows over long audio)

However, when comparing running times, we do not see an improvement.

Our assumption: For a given audio with the first X seconds transcribed, putting them in the prompt will result in fewer decoding steps in ctranslate. Are we wrong?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant