Models

28

Full-text search

Active filters: audio-language-model

nvidia/audio-flamingo-next-hf

Audio-Text-to-Text • 8B • Updated 8 days ago • 2.58k • 37

tencent/Unified_Audio_Schema

Audio-Text-to-Text • 8B • Updated 7 days ago • 24 • 5

mispeech/midashenglm-7b-0804-fp32

Audio-Text-to-Text • 8B • Updated Mar 17 • 51.5k • 80

nvidia/audio-flamingo-next-think-hf

Audio-Text-to-Text • 8B • Updated 8 days ago • 2.49k • 5

nvidia/audio-flamingo-next-captioner-hf

Audio-Text-to-Text • 8B • Updated 8 days ago • 873 • 8

moonshotai/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated May 29, 2025 • 42.6k • 393

maitrix-org/Voila-base

Audio-to-Audio • 8B • Updated May 6, 2025 • 9 • 13

maitrix-org/Voila-chat

Audio-to-Audio • Updated May 6, 2025 • 60 • 55

moonshotai/Kimi-Audio-7B

Text-to-Speech • 10B • Updated May 29, 2025 • 161 • 78

rsxdalv/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated May 23, 2025 • 7

zh794390558/Kimi-Audio-7B

Text-to-Speech • 10B • Updated Jun 9, 2025 • 5

mispeech/midashenglm-7b-0804-4bit-bnb

Audio-Text-to-Text • 8B • Updated Oct 20, 2025 • 30 • 1

mispeech/midashenglm-7b-0804-bf16

Audio-Text-to-Text • 8B • Updated Mar 17 • 120

mispeech/midashenglm-7b-0804-fp8

Audio-Text-to-Text • 8B • Updated Oct 31, 2025 • 14

mispeech/midashenglm-7b-0804-w4a16-gptq

Audio-Text-to-Text • 3B • Updated Oct 31, 2025 • 3

mispeech/midashenglm-7b-1021-bf16

Audio-Text-to-Text • 8B • Updated Mar 17 • 257 • 2

mispeech/midashenglm-7b-1021-fp8

Audio-Text-to-Text • 8B • Updated Oct 31, 2025 • 26 • 1

mispeech/midashenglm-7b-1021-fp32

Audio-Text-to-Text • 8B • Updated Oct 31, 2025 • 105

mispeech/midashenglm-7b-1021-w4a16-gptq

Audio-Text-to-Text • 3B • Updated Oct 31, 2025 • 43 • 1

FunAudioLLM/Fun-Audio-Chat-8B

Any-to-Any • 9B • Updated Dec 24, 2025 • 3.06k • 183

Mayank022/Audio-Language-Model

Audio-Text-to-Text • Updated Feb 26

tunglinwood/Kimi-Audio-7B-Instruct

Text-to-Speech • 10B • Updated Feb 12 • 6

cslys1999/Eureka-Audio-Instruct

Audio-Text-to-Text • 3B • Updated Feb 26 • 251 • 6

teamvizuara/Vocal-LLM

Audio-Text-to-Text • Updated Feb 26

mispeech/midashenglm-0.6b-fp32

Audio-Text-to-Text • 0.7B • Updated 20 days ago • 261 • 2

mlx-community/kimi-audio-7b

Text-to-Speech • 10B • Updated 19 days ago • 689

mispeech/midashenglm-0.6b-gguf

Audio-Text-to-Text • 0.6B • Updated 7 days ago • 393

mispeech/midashenglm-7b-1021-gguf

Audio-Text-to-Text • 8B • Updated 7 days ago • 204