Skip to content
This repository has been archived by the owner on Oct 10, 2022. It is now read-only.

Commit

Permalink
Readability
Browse files Browse the repository at this point in the history
  • Loading branch information
snakers4 committed May 5, 2020
1 parent de459af commit 36fcdb4
Showing 1 changed file with 24 additions and 24 deletions.
48 changes: 24 additions & 24 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -185,30 +185,30 @@ If you are using Windows, you may use **Linux subsystem** to run these commands.

## **Links**

| Dataset | GB, wav | GB, archive | Archive | Source | Manifest |
|---------------------------------------|---------|-------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------|
| Train | | | | | |
| radio_v4 | 1059 | 176 | [opus](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_v4_manifest.tar.gz), [txt](https://forms.gle/nosMaNgj8MWKm99d9) | Radio | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_v4_manifest.csv) |
| public_speech | 257 | 47.4 | [opus](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_speech_manifest.tar.gz), [txt](https://forms.gle/nosMaNgj8MWKm99d9) | Sources from the Internet + alignment | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_speech_manifest.csv) |
| radio_v4_add | 15.7 | 2.8 | [opus](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_v4_add_manifest.tar.gz), [txt](https://forms.gle/nosMaNgj8MWKm99d9) | Radio | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_v4_add_manifest.csv) |
| 5% of radio_v4 + public_speech | - | 11.4 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_pspeech_sample_manifest.tar.gz) | - | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_pspeech_sample_manifest.csv) |
| audiobook_2 | 162 | 25.8 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/private_buriy_audiobooks_2.tar.gz) | Sources from the Internet + alignment | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/private_buriy_audiobooks_2.csv) |
| radio_2 | 154 | 24.6 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_2.tar.gz) | Radio | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_2.csv) |
| public_youtube1120 | 237 | 19.0 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube1120.tar.gz) | YouTube videos | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube1120.csv) |
| asr_public_phone_calls_2 | 66 | 9.4 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_phone_calls_2.tar.gz) | Sources from the Internet + ASR | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_phone_calls_2.csv) |
| public_youtube1120_hq | 31 | 4.9 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube1120_hq.tar.gz) | YouTube videos | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube1120_hq.csv) |
| asr_public_stories_2 | 9 | 1.4 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_stories_2.tar.gz) | Sources from the Internet + alignment | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_stories_2.csv) |
| tts_russian_addresses_rhvoice_4voices | 80.9 | 12.9 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/tts_russian_addresses_rhvoice_4voices.tar.gz) | TTS | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/tts_russian_addresses_rhvoice_4voices.csv) |
| public_youtube700 | 75.0 | 12.2 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube700.tar.gz) | YouTube videos | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube700.csv) |
| asr_public_phone_calls_1 | 22.7 | 3.2 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_phone_calls_1.tar.gz) | Sources from the Internet + ASR | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_phone_calls_1.csv) |
| asr_public_stories_1 | 4.1 | 0.7 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_stories_1.tar.gz) | Public stories | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_stories_1.csv) |
| public_series_1 | 1.9 | 0.3 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_series_1.tar.gz) | Public series | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_series_1.csv) |
| public_lecture_1 | 0.7 | 0.1 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_lecture_1.tar.gz) | Sources from the Internet + manual | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_lecture_1.csv) |
| Val | | | | | |
| asr_calls_2_val | 2 | 0.8 | [wav+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_calls_2_val.tar.gz) | Sources from the Internet | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_calls_2_val.csv) |
| buriy_audiobooks_2_val | 1 | 0.5 | [wav+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/buriy_audiobooks_2_val.tar.gz) | Books + manual | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/buriy_audiobooks_2_val.csv) |
| public_youtube700_val | 2 | 0.13 | [wav+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube700_val.tar.gz) | YouTube videos + manual | [manifest file](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube700_val.csv) |
| Total | 2,186 | 354 | | | |
| Dataset | GB, wav | GB, archive | Archive | Source | Manifest |
|---------------------------------------|---------|-------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------|---------------------------------------------------------------------------------------------------------------------------------------------|
| Train | | | | | |
| radio_v4 | 1059 | 176 | [opus](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_v4_manifest.tar.gz), [txt](https://forms.gle/nosMaNgj8MWKm99d9) | Radio | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_v4_manifest.csv) |
| public_speech | 257 | 47.4 | [opus](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_speech_manifest.tar.gz), [txt](https://forms.gle/nosMaNgj8MWKm99d9) | Internet + alignment | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_speech_manifest.csv) |
| radio_v4_add | 15.7 | 2.8 | [opus](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_v4_add_manifest.tar.gz), [txt](https://forms.gle/nosMaNgj8MWKm99d9) | Radio | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_v4_add_manifest.csv) |
| 5% of radio_v4 + public_speech | - | 11.4 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_pspeech_sample_manifest.tar.gz) | - | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_pspeech_sample_manifest.csv) |
| audiobook_2 | 162 | 25.8 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/private_buriy_audiobooks_2.tar.gz) | Internet + alignment | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/private_buriy_audiobooks_2.csv) |
| radio_2 | 154 | 24.6 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/radio_2.tar.gz) | Radio | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/radio_2.csv) |
| public_youtube1120 | 237 | 19.0 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube1120.tar.gz) | YouTube videos | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube1120.csv) |
| asr_public_phone_calls_2 | 66 | 9.4 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_phone_calls_2.tar.gz) | Internet + ASR | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_phone_calls_2.csv) |
| public_youtube1120_hq | 31 | 4.9 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube1120_hq.tar.gz) | YouTube videos | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube1120_hq.csv) |
| asr_public_stories_2 | 9 | 1.4 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_stories_2.tar.gz) | Internet + alignment | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_stories_2.csv) |
| tts_russian_addresses_rhvoice_4voices | 80.9 | 12.9 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/tts_russian_addresses_rhvoice_4voices.tar.gz) | TTS | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/tts_russian_addresses_rhvoice_4voices.csv) |
| public_youtube700 | 75.0 | 12.2 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube700.tar.gz) | YouTube videos | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube700.csv) |
| asr_public_phone_calls_1 | 22.7 | 3.2 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_phone_calls_1.tar.gz) | Internet + ASR | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_phone_calls_1.csv) |
| asr_public_stories_1 | 4.1 | 0.7 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_public_stories_1.tar.gz) | Public stories | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_public_stories_1.csv) |
| public_series_1 | 1.9 | 0.3 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_series_1.tar.gz) | Public series | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_series_1.csv) |
| public_lecture_1 | 0.7 | 0.1 | [opus+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_lecture_1.tar.gz) | Internet + manual | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_lecture_1.csv) |
| Val | | | | | |
| asr_calls_2_val | 2 | 0.8 | [wav+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/asr_calls_2_val.tar.gz) | Internet | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/asr_calls_2_val.csv) |
| buriy_audiobooks_2_val | 1 | 0.5 | [wav+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/buriy_audiobooks_2_val.tar.gz) | Books + manual | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/buriy_audiobooks_2_val.csv) |
| public_youtube700_val | 2 | 0.13 | [wav+txt](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/archives/public_youtube700_val.tar.gz) | YouTube videos + manual | [manifest](https://azureopendatastorage.blob.core.windows.net/openstt/ru_open_stt_opus/manifests/public_youtube700_val.csv) |
| Total | 2,186 | 354 | | | |


## **Download instructions**
Expand Down

0 comments on commit 36fcdb4

Please sign in to comment.