arxiv:2601.10061
Yuran Wang
Ryann829
AI & ML interests
Multimodal Large Language Model
Recent Activity
authored
a paper
2 days ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
upvoted
a
paper
2 days ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
updated
a dataset
4 days ago
Ryann829/SconeEval