Collections
Discover the best community collections!
Collections including paper arxiv:2404.11925
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 26 -
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens
Paper • 2404.03413 • Published • 28 -
openai/clip-vit-large-patch14-336
Zero-Shot Image Classification • Updated • 5.52M • • 228 -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 14.8M • • 634
-
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control
Paper • 2403.09055 • Published • 26 -
ReNoise: Real Image Inversion Through Iterative Noising
Paper • 2403.14602 • Published • 21 -
EdgeFusion: On-Device Text-to-Image Generation
Paper • 2404.11925 • Published • 22