1 13 5

Heejun Lee

gmlwns5176

gmlwns2000

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

upvoted a paper about 21 hours ago

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

upvoted a paper 2 days ago

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

View all activity

Organizations

None yet

gmlwns5176's activity

upvoted 2 papers about 21 hours ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 5 days ago • 128

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published 1 day ago • 19

upvoted 2 papers 2 days ago

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

Paper • 2502.12464 • Published 3 days ago • 26

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 4 days ago • 48

upvoted a paper 4 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 7 days ago • 74

upvoted a paper 5 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 10 days ago • 42

commented a paper 5 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 8 days ago • 138 •

upvoted a paper 6 days ago

Typhoon T1: An Open Thai Reasoning Model

Paper • 2502.09042 • Published 8 days ago • 15

commented a paper 7 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 8 days ago • 138 •

authored a paper 7 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 8 days ago • 138

liked a model 7 days ago

hbseong/HarmAug-Guard

Text Classification • Updated Oct 14, 2024 • 471 • 38

upvoted a paper 7 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 8 days ago • 138

authored a paper 10 days ago

HiP Attention: Sparse Sub-Quadratic Attention with Hierarchical Attention Pruning

Paper • 2406.09827 • Published Jun 14, 2024 • 2

upvoted a paper 11 days ago

HiP Attention: Sparse Sub-Quadratic Attention with Hierarchical Attention Pruning

Paper • 2406.09827 • Published Jun 14, 2024 • 2

upvoted an article 28 days ago

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

29 days ago

• 63

upvoted a paper about 1 month ago

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10 • 67

upvoted a paper 3 months ago

VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding

Paper • 2412.02186 • Published Dec 3, 2024 • 22

liked a model 3 months ago

hmlee/exaone_pruned

Updated Nov 29, 2024 • 1

liked a dataset 3 months ago

xinrongzhang2022/InfiniteBench

Preview • Updated Oct 8, 2024 • 6.53k • 27

upvoted a paper 11 months ago

SEA: Sparse Linear Attention with Estimated Attention Mask

Paper • 2310.01777 • Published Oct 3, 2023 • 1