Skip to content
@FunAudioLLM

FunAudioLLM

Popular repositories Loading

  1. CosyVoice CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python 18.7k 2.1k

  2. SenseVoice SenseVoice Public

    Multilingual Voice Understanding Model

    Python 7.3k 675

  3. FunMusic FunMusic Public

    A fundamental toolkit designed for music, song, and audio generation

    Python 1.3k 131

  4. ThinkSound ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    Python 1.1k 65

  5. Fun-ASR Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    Python 597 40

  6. Fun-Audio-Chat Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    Python 501 50

Repositories

Showing 10 of 12 repositories
  • Fun-ASR Public

    Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

    FunAudioLLM/Fun-ASR’s past year of commit activity
    Python 597 Apache-2.0 40 34 0 Updated Dec 31, 2025
  • CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    FunAudioLLM/CosyVoice’s past year of commit activity
    Python 18,663 Apache-2.0 2,077 855 15 Updated Dec 31, 2025
  • FunAudioLLM/FunAudioLLM.github.io’s past year of commit activity
    HTML 57 MIT 10 0 1 Updated Dec 31, 2025
  • SenseVoice Public

    Multilingual Voice Understanding Model

    FunAudioLLM/SenseVoice’s past year of commit activity
    Python 7,269 675 163 3 Updated Dec 30, 2025
  • Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    FunAudioLLM/Fun-Audio-Chat’s past year of commit activity
    Python 501 Apache-2.0 50 3 1 Updated Dec 25, 2025
  • FunResearch Public

    This repository is maintained by the Speech Team at Alibaba’s Tongyi Lab, serving as an open-source platform for our cutting-edge research in speech, audio, NLP technologies. We believe in accelerating scientific progress through transparent collaboration, and invite the global research community to explore, reproduce, and build upon our work.

    FunAudioLLM/FunResearch’s past year of commit activity
    Python 13 Apache-2.0 1 0 0 Updated Dec 20, 2025
  • ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    FunAudioLLM/ThinkSound’s past year of commit activity
    Python 1,122 65 30 1 Updated Nov 25, 2025
  • CV3-Eval Public
    FunAudioLLM/CV3-Eval’s past year of commit activity
    Python 165 Apache-2.0 14 6 0 Updated Aug 25, 2025
  • MME-Emotion Public

    Official repository for the paper “MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models”

    FunAudioLLM/MME-Emotion’s past year of commit activity
    Python 17 MIT 2 1 0 Updated Aug 19, 2025
  • OmniAudio Public
    FunAudioLLM/OmniAudio’s past year of commit activity
    Python 7 3 0 0 Updated May 21, 2025