sometimesanotion's picture

sometimesanotion PRO

sometimesanotion

·

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

liked a model about 2 hours ago

3WaD/Lamarck-14B-v0.7-bnb-4bit

new activity about 4 hours ago

CultriX/MergeStage2v3:This is starting to look a bit like the Lamarck process

liked a model about 4 hours ago

CultriX/MergeStage1

View all activity

Organizations

sometimesanotion's activity

upvoted an article 8 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By

•

13 days ago

• 24

upvoted a collection about 1 month ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 64 items • Updated 1 day ago • 531

upvoted a paper 2 months ago

Lost in the Middle: How Language Models Use Long Contexts

Paper • 2307.03172 • Published Jul 6, 2023 • 39

upvoted a paper 3 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 46