1 8

liudekai

ShadowWolf1999

letmego2022

AI & ML interests

None yet

Recent Activity

reacted to onekq's post with 🤗 15 days ago

Necessity is mother of invention. To understand ⚡FlashMLA⚡ by 🐋DeepSeek 🐋, the first question to ask is why. The keyword here is H800, a lower-end product tailored for export control. The purpose here is to squeeze out as much performance as possible. But here is the most important takeaway: this invention benefits EVERYONE.

liked a Space 15 days ago

Ki-Seki/ultrascale-playbook-zh-cn

liked a model 15 days ago

moonshotai/Moonlight-16B-A3B-Instruct

View all activity

Organizations

None yet

ShadowWolf1999's activity

reacted to onekq's post with 🤗 15 days ago

Post

2760

Necessity is mother of invention. To understand ⚡FlashMLA⚡ by
🐋DeepSeek 🐋, the first question to ask is why.

The keyword here is H800, a lower-end product tailored for export control. The purpose here is to squeeze out as much performance as possible.

But here is the most important takeaway: this invention benefits EVERYONE.

2 replies

liked a Space 15 days ago

183

LLM训练终极指南 | The Ultra-Scale Playbook

🔥

了解LLM训练的方方面面

liked 2 models 15 days ago

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 10 days ago • 4.36k • 131

Wan-AI/Wan2.1-T2V-1.3B

Text-to-Video • Updated 12 days ago • 21.9k • • 272

New activity in deepseek-ai/DeepSeek-R1 about 2 months ago

this is the killer

#1 opened about 2 months ago by

blackcat1402

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 2.75M • • 11.3k

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 10 days ago • 333k • 1.04k

reacted to mitkox's post with 🔥 2 months ago

Post

2487

Can it run DeepSeek V3 671B is the new 'can it run Doom'.

How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU.

Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.

5 replies

liked 2 models 2 months ago

xey/sldr_flux_nsfw_v2-studio

Text-to-Image • Updated Jan 14 • 236k • • 230

cognitivecomputations/Dolphin3.0-Llama3.1-8B

Updated Jan 5 • 3.9k • 154

liked a model 3 months ago

Datou1111/shou_xin

Text-to-Image • Updated Dec 9, 2024 • 1.99k • 866