view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 1 day ago • 192
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published 8 days ago • 22
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 15 days ago • 560
CLAP: Contrastive Language-Audio Pretraining Collection CLAP is to audio what CLIP is to image. • 5 items • Updated Oct 31, 2023 • 10
snowpark Collection JAX ports of popular models(audio, vision, llm, diffusion, etc) • 1 item • Updated 13 days ago