Song Qiang

namespace-sq

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a Space 9 months ago

HuggingFaceFW/blogpost-fineweb-v1

liked a model 11 months ago

CohereForAI/c4ai-command-r-plus

View all activity

Organizations

None yet

namespace-sq's activity

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 808

liked a Space 9 months ago

875

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model 11 months ago

CohereForAI/c4ai-command-r-plus

Text Generation • Updated Sep 27, 2024 • 3.71k • 1.72k

upvoted a paper about 1 year ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 39

liked 2 models about 1 year ago

BAAI/bge-m3

togethercomputer/m2-bert-80M-32k-retrieval

upvoted a paper about 1 year ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 62

upvoted 2 collections about 1 year ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 234

MoEs papers reading list

Collection

60 items • Updated Nov 4, 2024 • 141

liked a model about 1 year ago

microsoft/phi-2

Text Generation • Updated Apr 29, 2024 • 391k • • 3.29k

liked a Space about 1 year ago

915

Model Memory Utility

🚀

Calculate memory needed to train AI models

liked a Space over 1 year ago

2.09k

Whisper

📉

Transcribe audio from microphone, file, or YouTube link

liked 3 models over 1 year ago

liked 2 datasets over 1 year ago

BAAI/COIG-PC

Viewer • Updated Jun 14, 2024 • 540M • 151 • 267

garage-bAInd/Open-Platypus

Viewer • Updated Jan 24, 2024 • 24.9k • 3.83k • 382

liked 3 models over 1 year ago

garage-bAInd/Platypus2-70B-instruct

Text Generation • Updated Jan 4, 2024 • 3.73k • 174

THUDM/codegeex2-6b

Updated Dec 10, 2024 • 363 • 253

stabilityai/stablecode-completion-alpha-3b-4k

Text Generation • Updated Aug 8, 2023 • 2.31k • 282