The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 29 days ago • 184
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5? Paper • 2311.07587 • Published Nov 8, 2023 • 5
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data Paper • 2311.06753 • Published Nov 12, 2023 • 8
Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer Paper • 2311.06720 • Published Nov 12, 2023 • 9
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 Paper • 2311.07361 • Published Nov 13, 2023 • 14
LayoutPrompter: Awaken the Design Ability of Large Language Models Paper • 2311.06495 • Published Nov 11, 2023 • 12
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation Paper • 2311.07562 • Published Nov 13, 2023 • 14
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Paper • 2311.07463 • Published Nov 13, 2023 • 15
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models Paper • 2311.07575 • Published Nov 13, 2023 • 15
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models Paper • 2311.06783 • Published Nov 12, 2023 • 28
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning Paper • 2311.07574 • Published Nov 13, 2023 • 16
PolyMaX: General Dense Prediction with Mask Transformer Paper • 2311.05770 • Published Nov 9, 2023 • 11
ADaPT: As-Needed Decomposition and Planning with Language Models Paper • 2311.05772 • Published Nov 8, 2023 • 15
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities Paper • 2311.05698 • Published Nov 9, 2023 • 14
Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems Paper • 2311.05884 • Published Nov 10, 2023 • 11
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores Paper • 2311.05908 • Published Nov 10, 2023 • 16
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization Paper • 2311.06243 • Published Nov 10, 2023 • 22