Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 5 days ago • 45
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 8 days ago • 49
Style-Friendly SNR Sampler for Style-Driven Generation Paper • 2411.14793 • Published Nov 22, 2024 • 36
Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling Paper • 2411.18664 • Published Nov 27, 2024 • 24
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 279
VidPanos: Generative Panoramic Videos from Casual Panning Videos Paper • 2410.13832 • Published Oct 17, 2024 • 12
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 8 days ago • 97
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs Paper • 2311.04901 • Published Nov 8, 2023 • 7
High-Resolution Image Synthesis with Latent Diffusion Models Paper • 2112.10752 • Published Dec 20, 2021 • 12