This repository has been archived by the owner on Oct 10, 2022. It is now read-only.
Dataset conversion to MP3
Pre-release
Pre-release
Key changes:
- Converted the majority of the dataset to MP3;
- Added download script, md5 hashes into download script;
- Fixed license;
- Added items to FAQ and common issues;
THE MAJORITY OF WAV LINKS WILL BE DELETED SOON.
Coming soon:
- Download via torrent;
- Large (1,500 hours) YouTube dataset;
- ... and more)
Dataset composition
Dataset | Utterances | Hours | GB | Av s/chars | Comment | Annotation | Quality/noise |
---|---|---|---|---|---|---|---|
public_youtube1500 (*) | 1,500 | * Coming soon | |||||
audiobook_2 | 1,149,404 | 1,511 | 166 | 4.7s / 56 | Books | Alignment (*) | 95% / crisp |
public_youtube700 | 759,483 | 701 | 75 | 3.3s / 43 | Youtube videos | Subtitles | 95% / ~crisp |
tts_russian_addresses | 1,741,838 | 754 | 81 | 1.6s / 20 | Russian addresses | TTS 4 voices | 100% / crisp |
asr_public_phone_calls_2 | 603,797 | 601 | 66 | 3.6s / 37 | Phone calls | ASR | 70% / noisy |
asr_public_phone_calls_1 | 233,868 | 211 | 23 | 3.3s / 29 | Phone calls | ASR | 70% / noisy |
asr_public_stories_2 | 78,186 | 78 | 9 | 3.5s / 43 | Books | ASR | 80% / crisp |
asr_public_stories_1 | 46,142 | 38 | 4 | 3.0s / 30 | Books | ASR | 80% / crisp |
public_series_1 | 20,243 | 17 | 2 | 3.1s / 38 | Youtube videos | Subtitles | 95% / ~crisp |
ru_RU | 5,826 | 17 | 2 | 11s / 12 | Public dataset | Alignment | 99% / crisp |
voxforge_ru | 8,344 | 17 | 2 | 7.5s / 77 | Public dataset | Reading | 100% / crisp |
russian_single | 3,357 | 9 | 1 | 9.3s / 102 | Public dataset | Alignment | 99% / crisp |
public_lecture_1 | 6,803 | 6 | 1 | 3.4s / 47 | Lectures | Subtitles | 95% / crisp |
Total | 4,657,291 | 3,961 | 431 |
Links
Meta data file.
Dataset | GB, wav | GB, mp3 | Wav | Mp3 | Source | Manifest |
---|---|---|---|---|---|---|
audiobook_2 | 166 | 21.0 | down | part1 | Sources from the Internet + alignment | link |
asr_public_phone_calls_2 | 66 | 7.5 | down | part1 | Sources from the Internet + ASR | link |
asr_public_stories_2 | 9 (7.5) | NA | part1 | NA | Sources from the Internet + alignment | link |
tts_russian_addresses_rhvoice_4voices | 80.9 | 9.9 | down | part1 | TTS | link |
public_youtube700 | 75.0 | 9.6 | down | part1 | YouTube videos | link |
asr_public_phone_calls_1 | 22.7 | 2.6 | down | part1 | Sources from the Internet + ASR | link |
asr_public_stories_1 | 4.1 | 0.5 | down | part1 | Public stories | link |
public_series_1 | 1.9 | 0.2 | down | part1 | Public series | link |
ru_RU | 1.9 | 0.2 | down | part1 | Caito.de dataset | link |
voxforge_ru | 1.9 | 0.2 | down | part1 | Voxforge dataset | link |
russian_single | 0.9 | 0.1 | down | part1 | Russian single speaker dataset | link |
public_lecture_1 | 0.7 | 0.1 | down | part1 | Sources from the Internet | link |
Total | 431 | 52 |