Skip to content
This repository has been archived by the owner on Oct 10, 2022. It is now read-only.

Dataset conversion to MP3

Pre-release
Pre-release
Compare
Choose a tag to compare
@snakers4 snakers4 released this 10 May 15:00
· 57 commits to master since this release

Key changes:

  • Converted the majority of the dataset to MP3;
  • Added download script, md5 hashes into download script;
  • Fixed license;
  • Added items to FAQ and common issues;

THE MAJORITY OF WAV LINKS WILL BE DELETED SOON.

Coming soon:

  • Download via torrent;
  • Large (1,500 hours) YouTube dataset;
  • ... and more)

Dataset composition

Dataset Utterances Hours GB Av s/chars Comment Annotation Quality/noise
public_youtube1500 (*) 1,500 * Coming soon
audiobook_2 1,149,404 1,511 166 4.7s / 56 Books Alignment (*) 95% / crisp
public_youtube700 759,483 701 75 3.3s / 43 Youtube videos Subtitles 95% / ~crisp
tts_russian_addresses 1,741,838 754 81 1.6s / 20 Russian addresses TTS 4 voices 100% / crisp
asr_public_phone_calls_2 603,797 601 66 3.6s / 37 Phone calls ASR 70% / noisy
asr_public_phone_calls_1 233,868 211 23 3.3s / 29 Phone calls ASR 70% / noisy
asr_public_stories_2 78,186 78 9 3.5s / 43 Books ASR 80% / crisp
asr_public_stories_1 46,142 38 4 3.0s / 30 Books ASR 80% / crisp
public_series_1 20,243 17 2 3.1s / 38 Youtube videos Subtitles 95% / ~crisp
ru_RU 5,826 17 2 11s / 12 Public dataset Alignment 99% / crisp
voxforge_ru 8,344 17 2 7.5s / 77 Public dataset Reading 100% / crisp
russian_single 3,357 9 1 9.3s / 102 Public dataset Alignment 99% / crisp
public_lecture_1 6,803 6 1 3.4s / 47 Lectures Subtitles 95% / crisp
Total 4,657,291 3,961 431

Links

Meta data file.

Dataset GB, wav GB, mp3 Wav Mp3 Source Manifest
audiobook_2 166 21.0 down part1 Sources from the Internet + alignment link
asr_public_phone_calls_2 66 7.5 down part1 Sources from the Internet + ASR link
asr_public_stories_2 9 (7.5) NA part1 NA Sources from the Internet + alignment link
tts_russian_addresses_rhvoice_4voices 80.9 9.9 down part1 TTS link
public_youtube700 75.0 9.6 down part1 YouTube videos link
asr_public_phone_calls_1 22.7 2.6 down part1 Sources from the Internet + ASR link
asr_public_stories_1 4.1 0.5 down part1 Public stories link
public_series_1 1.9 0.2 down part1 Public series link
ru_RU 1.9 0.2 down part1 Caito.de dataset link
voxforge_ru 1.9 0.2 down part1 Voxforge dataset link
russian_single 0.9 0.1 down part1 Russian single speaker dataset link
public_lecture_1 0.7 0.1 down part1 Sources from the Internet link
Total 431 52