Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

.lrc filetypes for download #893

Open
mknepper opened this issue May 5, 2023 · 2 comments
Open

.lrc filetypes for download #893

mknepper opened this issue May 5, 2023 · 2 comments

Comments

@mknepper
Copy link

mknepper commented May 5, 2023

The ability to download lyrics from the database in .lrc would be pretty neat. It'd make things easier when using stuff like Audacious or other music programs that scrape lyrics.

@snshn
Copy link
Member

snshn commented May 7, 2023

It could be problematic, since this database was initially meant to be used by humans, that is why the plaintext format with a little bit of metadata at the end was chosen, and I still like that format myself.

How hard do you think it would be to have a piece of software automatically time and match lines from plaintext files with when it's in the song, and attaching the timestamp to it? That's the only easy way to generate .lrc out of these files that comes to my mind.

@C0rn3j
Copy link
Contributor

C0rn3j commented Jan 13, 2025

Hard, but not infeasible.


VLC is doing almost the same, including the transcription, with models from whisper. That's aimed at shows/movies/videos though

https://www.omgubuntu.co.uk/2025/01/vlc-demo-ai-real-time-subtitles-for-any-video

Songs however have a lot of continuous audio and a lot of noise, reverb, layered sounds needing an arbiter to decide what to keep, etc.

So whatever VLC will end up releasing may not be the best tool for the job, time will tell when they release.

You can try making raw whisper do it and fixing the rest I suppose - openai/whisper#98

There's also Gentle which seems to be also within the LLM approach - https://github.com/itsvaibhav01/Gentle

Subaligner seems actually maintained - https://github.com/baxtree/subaligner


There seem to be (mostly dead) old-time non-LLM projects that target to do this exact thing, like https://pypi.org/project/py3-aeneas/, but I have not had the time to check them out yet.

Would require the song source + plaintext lyrics, so the end result would best be stored in the repo, we can talk formats when we figure out how to achieve it in the first place.

It does look like an imperfect solution for music, unfortunately - https://news.ycombinator.com/item?id=13908579


Karaoke-level lyrics look like this in LRC (I am sure the good japanese karaoke ones look even crazier since I've seen people do every damn syllable in .ASS subtitles):

[00:53.60]On a dark [00:54.85]desert highway[00:56.26] so that does not look the best for human readability, but hey, the website can simply show the parsed plaintext of this.


It is a question of whether we want to settle on LRC though, as something like ASS(I did not pick the acronym) might be more appropriate as it has styling abilities and definitely a good karaoke support.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants