ChoiRI's picture

28 9

ChoiRI

ChoiRI

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

upvoted a paper 25 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper 25 days ago

4K4D: Real-Time 4D View Synthesis at 4K Resolution

View all activity

Organizations

None yet

ChoiRI's activity

upvoted a paper 3 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 3 days ago • 62

upvoted 19 papers 25 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 29 days ago • 184

4K4D: Real-Time 4D View Synthesis at 4K Resolution

Paper • 2310.11448 • Published Oct 17, 2023 • 39

Ranking LLM-Generated Loop Invariants for Program Verification

Paper • 2310.09342 • Published Oct 13, 2023 • 4

When can transformers reason with abstract symbols?

Paper • 2310.09753 • Published Oct 15, 2023 • 4

Improving Large Language Model Fine-tuning for Solving Math Problems

Paper • 2310.10047 • Published Oct 16, 2023 • 7

Microscaling Data Formats for Deep Learning

Paper • 2310.10537 • Published Oct 16, 2023 • 8

Farzi Data: Autoregressive Data Distillation

Paper • 2310.09983 • Published Oct 15, 2023 • 10

Interactive Task Planning with Language Models

Paper • 2310.10645 • Published Oct 16, 2023 • 12

Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Paper • 2310.09520 • Published Oct 14, 2023 • 12

Video Language Planning

Paper • 2310.10625 • Published Oct 16, 2023 • 11

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

Paper • 2310.09478 • Published Oct 14, 2023 • 21

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Paper • 2310.10638 • Published Oct 16, 2023 • 30

Llemma: An Open Language Model For Mathematics

Paper • 2310.10631 • Published Oct 16, 2023 • 53

Toward Joint Language Modeling for Speech Units and Text

Paper • 2310.08715 • Published Oct 12, 2023 • 9

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Paper • 2310.08992 • Published Oct 13, 2023 • 13

A Zero-Shot Language Agent for Computer Control with Structured Reflection

Paper • 2310.08740 • Published Oct 12, 2023 • 16

Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams

Paper • 2310.08678 • Published Oct 12, 2023 • 14

The Consensus Game: Language Model Generation via Equilibrium Search

Paper • 2310.09139 • Published Oct 13, 2023 • 14

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 27