-
MLP Can Be A Good Transformer Learner
Paper ā¢ 2404.05657 ā¢ Published ā¢ 1 -
Toward a Better Understanding of Fourier Neural Operators: Analysis and Improvement from a Spectral Perspective
Paper ā¢ 2404.07200 ā¢ Published ā¢ 1 -
An inclusive review on deep learning techniques and their scope in handwriting recognition
Paper ā¢ 2404.08011 ā¢ Published ā¢ 1 -
Long-form music generation with latent diffusion
Paper ā¢ 2404.10301 ā¢ Published ā¢ 25
Collections
Discover the best community collections!
Collections including paper arxiv:2404.12385
-
I am a Strange Dataset: Metalinguistic Tests for Language Models
Paper ā¢ 2401.05300 ā¢ Published ā¢ 3 -
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Paper ā¢ 2404.08801 ā¢ Published ā¢ 66 -
SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification
Paper ā¢ 2305.09781 ā¢ Published ā¢ 4 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ā¢ 2404.12385 ā¢ Published ā¢ 27
-
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Paper ā¢ 2404.02514 ā¢ Published ā¢ 10 -
MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance
Paper ā¢ 2404.08252 ā¢ Published ā¢ 6 -
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video
Paper ā¢ 2404.09833 ā¢ Published ā¢ 30 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ā¢ 2404.12385 ā¢ Published ā¢ 27
-
3D Congealing: 3D-Aware Image Alignment in the Wild
Paper ā¢ 2404.02125 ā¢ Published ā¢ 9 -
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Paper ā¢ 2404.04319 ā¢ Published ā¢ 24 -
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Paper ā¢ 2404.09956 ā¢ Published ā¢ 12 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ā¢ 2404.12385 ā¢ Published ā¢ 27
-
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding
Paper ā¢ 2403.10395 ā¢ Published ā¢ 8 -
CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Paper ā¢ 2403.05034 ā¢ Published ā¢ 21 -
FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
Paper ā¢ 2404.00987 ā¢ Published ā¢ 22 -
Advances in 3D Generation: A Survey
Paper ā¢ 2401.17807 ā¢ Published ā¢ 19
-
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology
Paper ā¢ 2203.00585 ā¢ Published ā¢ 2 -
Emerging Properties in Self-Supervised Vision Transformers
Paper ā¢ 2104.14294 ā¢ Published ā¢ 3 -
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Paper ā¢ 2404.06903 ā¢ Published ā¢ 19 -
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Paper ā¢ 2404.07973 ā¢ Published ā¢ 31
-
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Paper ā¢ 2403.01807 ā¢ Published ā¢ 8 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper ā¢ 2403.02151 ā¢ Published ā¢ 13 -
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper ā¢ 2403.01779 ā¢ Published ā¢ 29 -
MagicClay: Sculpting Meshes With Generative Neural Fields
Paper ā¢ 2403.02460 ā¢ Published ā¢ 8
-
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper ā¢ 2402.08714 ā¢ Published ā¢ 12 -
Data Engineering for Scaling Language Models to 128K Context
Paper ā¢ 2402.10171 ā¢ Published ā¢ 24 -
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper ā¢ 2402.10893 ā¢ Published ā¢ 11 -
Coercing LLMs to do and reveal (almost) anything
Paper ā¢ 2402.14020 ā¢ Published ā¢ 13