5 5 12

Yingfa Chen

chen-yingfa

https://chen-yingfa.github.io

AI & ML interests

Long-context modeling, continual learning, architectures

Recent Activity

authored a paper about 5 hours ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

upvoted a paper about 6 hours ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

commented on a paper about 6 hours ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

View all activity

Organizations

None yet

chen-yingfa's activity

authored a paper about 5 hours ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Paper • 2503.09579 • Published about 19 hours ago • 2

upvoted a paper about 6 hours ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Paper • 2503.09579 • Published about 19 hours ago • 2

commented a paper about 6 hours ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Paper • 2503.09579 • Published about 19 hours ago • 2 •

updated a dataset 3 months ago

chen-yingfa/CFDBench-raw

Viewer • Updated Dec 12, 2024 • 5.13B • 804

upvoted a paper 4 months ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

authored a paper 4 months ago

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

authored a paper 5 months ago

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Paper • 2410.07145 • Published Oct 9, 2024 • 2

upvoted a paper 5 months ago

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Paper • 2410.07145 • Published Oct 9, 2024 • 2

commented a paper 5 months ago

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Paper • 2410.07145 • Published Oct 9, 2024 • 2 •

updated a dataset 6 months ago

chen-yingfa/CFDBench

Updated Sep 4, 2024 • 107 • 2

updated a dataset 7 months ago

chen-yingfa/CHUBS

Viewer • Updated Aug 20, 2024 • 2.22k • 115

upvoted an article 7 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 60

authored 4 papers 7 months ago

CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics

Paper • 2310.05963 • Published Sep 13, 2023

upvoted a paper 9 months ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22, 2024 • 14

authored a paper 9 months ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22, 2024 • 14

New activity in fla-hub/rwkv6-7B-finch 9 months ago

Can you add some details about this model

#1 opened 9 months ago by

chen-yingfa

New activity in xiaol/RWKV-v5-12B-one-state-chat-16k 10 months ago

Please can you provide a example of use the model weights?

#1 opened about 1 year ago by

Ulov888