Submitted by akhaliq 40 DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data · 9 authors 6
Submitted by akhaliq 19 LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models · 5 authors 11
Submitted by akhaliq 16 DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis · 8 authors
Submitted by akhaliq 15 Improved Distribution Matching Distillation for Fast Image Synthesis · 7 authors
Submitted by akhaliq 14 Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation · 7 authors 1
Submitted by akhaliq 14 AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability · 7 authors
Submitted by akhaliq 12 RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance · 12 authors
Submitted by akhaliq 12 CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers · 6 authors 1
Submitted by akhaliq 10 NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections · 7 authors
Submitted by akhaliq 10 Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling · 8 authors
Submitted by akhaliq 9 Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras · 12 authors