Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ytol
's Collections
Multimodal agents (robotics)
Robotics stack
Vision-Language-Action Models
Robotics stack
updated
Apr 23, 2024
Upvote
-
openai/whisper-base
Automatic Speech Recognition
•
Updated
Feb 29, 2024
•
5.36M
•
•
205
HuggingFaceM4/idefics2-8b-AWQ
Image-Text-to-Text
•
Updated
May 6, 2024
•
174
•
26
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
•
Updated
Apr 30, 2024
•
11.7k
•
350
dora-rs/dora-idefics2
Updated
May 5, 2024
•
201
•
5
MIT/ast-finetuned-speech-commands-v2
Audio Classification
•
Updated
Sep 10, 2023
•
8.51k
•
14
jxu124/OpenX-Embodiment
Updated
Oct 16, 2024
•
3.35k
•
53
LiheYoung/depth-anything-small-hf
Depth Estimation
•
Updated
Jan 25, 2024
•
111k
•
28
ybelkada/segment-anything
Updated
Dec 26, 2023
•
96
Upvote
-
Share collection
View history
Collection guide
Browse collections