Submitted by akhaliq 6 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model · 6 authors
Submitted by akhaliq 5 Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers · 3 authors 1
Submitted by akhaliq 4 InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning · 9 authors
Submitted by akhaliq 3 EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention · 6 authors 1
Submitted by akhaliq 2 Chain-of-Dictionary Prompting Elicits Translation in Large Language Models · 6 authors
Submitted by akhaliq 1 Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting · 7 authors
Submitted by akhaliq 1 Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction · 7 authors
Submitted by akhaliq 1 LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM · 3 authors