uu's picture

19 8

uu

JayZc

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper 11 days ago

Optimizing Large Language Model Training Using FP4 Quantization

upvoted a paper 11 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

View all activity

Organizations

None yet

JayZc's activity

upvoted a paper about 15 hours ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 3 days ago • 104

upvoted 3 papers 11 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 18 days ago • 34

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 18 days ago • 105

Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts

Paper • 2501.14334 • Published 23 days ago • 17

liked a dataset 11 days ago

axxkaya/UVT-Explanatory-based-Vision-Tasks

Viewer • Updated 4 days ago • 284k • 71 • 6

upvoted 2 papers 11 days ago

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 17 days ago • 23

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published 22 days ago • 29

liked a dataset 11 days ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 103k • 737

upvoted a paper 11 days ago

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published 18 days ago • 25

upvoted 11 papers 19 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 37

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 45

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 73

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 62

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published 24 days ago • 20

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 25 days ago • 319

Control LLM: Controlled Evolution for Intelligence Retention in LLM

Paper • 2501.10979 • Published 28 days ago • 6

GSTAR: Gaussian Surface Tracking and Reconstruction

Paper • 2501.10283 • Published 29 days ago • 5

Hallucinations Can Improve Large Language Models in Drug Discovery

Paper • 2501.13824 • Published 23 days ago • 9

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation

Paper • 2403.14614 • Published Mar 21, 2024 • 3

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published 23 days ago • 30