Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"wave_length": 7680,what this hp mean? #28

Open
switchzts opened this issue Oct 21, 2018 · 1 comment
Open

"wave_length": 7680,what this hp mean? #28

switchzts opened this issue Oct 21, 2018 · 1 comment

Comments

@switchzts
Copy link

switchzts commented Oct 21, 2018

Does it represent the minimum length of each speech? Why do you want to send each segment of speech into training? Is it the reason for memory resources?If my audio is about 10-15s, does it cause my model to generate meaningless audio?

@bfs18
Copy link
Owner

bfs18 commented Oct 22, 2018

--Does it represent the minimum length of each speech?
--No, all input waves are cropped to length 7680. In input queue, a wave segment of length 7680 is randomly cropped from a longer input wave.

--Why do you want to send each segment of speech into training? Is it the reason for memory resources?
--Yes, longer wave segments consume much more gpu memory.

--If my audio is about 10-15s, does it cause my model to generate meaningless audio?
--No, the model trained on waves of length 7680 generalizes well on longer sequences.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants