-
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Paper • 2403.09622 • Published • 17 -
TableGPT2: A Large Multimodal Model with Tabular Data Integration
Paper • 2411.02059 • Published • 5 -
POINTS1.5: Building a Vision-Language Model towards Real World Applications
Paper • 2412.08443 • Published • 38
Ming
nodejs
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 hours ago
Alpha-VLLM/Lumina-Image-2.0
liked
a model
1 day ago
coqui/XTTS-v2
liked
a model
1 day ago
Zyphra/Zonos-v0.1-hybrid
Organizations
None yet
Collections
1
spaces
1
models
None public yet
datasets
None public yet