Skip to content
Change the repository type filter

All

    Repositories list

    • Amphion

      Public
      Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
      Jupyter Notebook
      MIT License
      663000Updated Dec 10, 2024Dec 10, 2024
    • Foundational model for human-like, expressive TTS
      Python
      Apache License 2.0
      676000Updated Nov 17, 2024Nov 17, 2024
    • F5-TTS

      Public
      Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
      Python
      MIT License
      1.3k000Updated Nov 7, 2024Nov 7, 2024
    • AI powered speech denoising and enhancement
      Python
      MIT License
      183000Updated Nov 5, 2024Nov 5, 2024
    • VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
      Python
      MIT License
      13000Updated Oct 5, 2024Oct 5, 2024
    • BigVGAN

      Public
      Official PyTorch implementation of BigVGAN (ICLR 2023)
      Python
      MIT License
      122000Updated Sep 5, 2024Sep 5, 2024
    • for preparing LJSpeech
      Python
      24000Updated Aug 19, 2023Aug 19, 2023