Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How about the audio quality? #4

Open
OnceJune opened this issue Apr 22, 2022 · 6 comments
Open

How about the audio quality? #4

OnceJune opened this issue Apr 22, 2022 · 6 comments

Comments

@OnceJune
Copy link

Hi, thanks to the implement, the inference speed is impressive. How about the audio quality? And have you tried v2 config? Thanks in advance.

@rishikksh20
Copy link
Owner

Quality is better than v1 of hifigan with less training

@SolomidHero
Copy link

Hi, I trained this model several times with different scheduling and didn't get appropriate audios by inference scripts.
Can you share with some training hyperparameters if this is the case?
Also, what data it uses (what kHz, spectrum, etc)?
I also wonder what is the difference of stft and mel with that in tacatron2?

Thank you!

@SolomidHero
Copy link

@rishikksh20 ?

@rishikksh20
Copy link
Owner

@SolomidHero I will check, but I think audio would be good I have train this model in 4 dataset including LJSpeech and it perform good not as good as mentioned in paper but still decent enough.

@rishikksh20
Copy link
Owner

We tested it on multiple datasets and it working better than hifigan in speed as well as quality please follow same pre-processing and hyperparameter mentioned in the repo.

@a897456
Copy link

a897456 commented Feb 29, 2024

We tested it on multiple datasets and it working better than hifigan in speed as well as quality please follow same pre-processing and hyperparameter mentioned in the repo.

However, I found that the attenuation coefficient b1, b2 in the paper is different from that in the ".json" file, and the number of test sets, verification sets and training sets, as well as in the paper and in the code are inconsistent , so I don't know which version should be followed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants