Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
14
3
1
Simon Mo
simon-mo
Follow
ayub07's profile picture
Glowin's profile picture
shixiaoshi's profile picture
18 followers
Β·
10 following
https://github.com/simon-mo
simon_mo_
simon-mo
AI & ML interests
System for ML
Recent Activity
new
activity
about 7 hours ago
openai/gpt-oss-120b:
[v1 engine][flash_attn backend] TypeError: flash_attn_varlen_func() got an unexpected keyword argument 's_aux' when running gpt-oss-120b on H200
new
activity
1 day ago
openai/gpt-oss-120b:
VLLM - Flash-attn 3
reacted
to
erikkaum
's
post
with π€
20 days ago
We just released native support for @SGLang and @vllm-project in Inference Endpoints π₯ Inference Endpoints is becoming the central place where you deploy high performance Inference Engines. And that provides the managed infra for it. Instead of spending weeks configuring infrastructure, managing servers, and debugging deployment issues, you can focus on what matters most: your AI model and your users π
View all activity
Organizations
models
0
None public yet
datasets
0
None public yet