Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published 9 days ago • 27
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published 13 days ago • 33
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published 13 days ago • 33
MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 30 days ago • 8
MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 30 days ago • 8
laion/CLIP-ViT-B-32-laion2B-s34B-b79K Zero-Shot Image Classification • Updated 25 days ago • 2.62M • 106
laion/CLIP-convnext_large_d_320.laion2B-s29B-b131K-ft-soup Zero-Shot Image Classification • Updated 25 days ago • 422k • 18
laion/CLIP-ViT-H-14-laion2B-s32B-b79K Zero-Shot Image Classification • Updated 25 days ago • 1.04M • 356
laion/CLIP-ViT-g-14-laion2B-s34B-b88K Zero-Shot Image Classification • Updated 25 days ago • 30.7k • 24
laion/CLIP-ViT-bigG-14-laion2B-39B-b160k Zero-Shot Image Classification • Updated 25 days ago • 1.84M • 250