Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ji-Xiang
's Collections
Reasoning datasets
Test-time scaling Datasets
RLVR Datasets
Thinking/Reasoning Datasets
WebGPU
RLHF Datasets
HTML to Markdown
Math Datasets
Logical Reasoning Datasets
Multilingual-dataset
Object Detection
rag dataset
image-to-video
Multilingual Large Language Models
SFT Datasets
Recommended Datasets
Coder LLM
Text-to-Video
Multimodal Language Models
Image Chatbot
traditional-chinese-dataset
Suggest Spaces
Suggestion Models
Chinese models
China models
Uncensored models
china-dataset
common-dataset
unfiltered dataset
Image Generator AI
Edge Computing
Voice
Medical
Big Language Models
GGUF Models
TTS
Visual Question Answering
Chat
Multi Tasks
Vision
DPO datasets
ORPO-DPO datasets
Code dataset
SLM (small language models)
automatic speech recognition (ASR)
Vision-Language dataset
MoE
Dense Passage Retrieval (DPR) Datasets
Audio-To-Text
background-removal
Extreme Quantization
Try on
common-dataset
updated
about 1 month ago
Upvote
-
HuggingFaceH4/ultrachat_200k
Viewer
•
Updated
Oct 16, 2024
•
515k
•
14.1k
•
507
epfl-llm/meditron-7b
Text Generation
•
Updated
Dec 7, 2023
•
1.68k
•
267
shareAI/ShareGPT-Chinese-English-90k
Preview
•
Updated
Aug 16, 2024
•
623
•
248
bigcode/starcoderdata
Viewer
•
Updated
May 16, 2023
•
207M
•
2.54k
•
412
lmsys/chatbot_arena_conversations
Viewer
•
Updated
Sep 30, 2023
•
33k
•
1.07k
•
364
tiiuae/falcon-refinedweb
Viewer
•
Updated
Jun 20, 2023
•
968M
•
23.9k
•
832
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
•
Updated
Mar 10, 2024
•
70k
•
931
•
189
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
•
Updated
Feb 11, 2024
•
1.4k
•
175
Salesforce/wikisql
Updated
Jan 18, 2024
•
1.55k
•
109
microsoft/orca-math-word-problems-200k
Viewer
•
Updated
Mar 4, 2024
•
200k
•
2.03k
•
433
shareAI/CodeChat
Preview
•
Updated
Mar 25, 2024
•
84
•
27
HuggingFaceFW/fineweb
Viewer
•
Updated
12 days ago
•
25B
•
492k
•
1.92k
Yukang/LongAlpaca-16k-length
Viewer
•
Updated
Nov 18, 2023
•
6.28k
•
67
•
25
yahma/alpaca-cleaned
Viewer
•
Updated
Apr 10, 2023
•
51.8k
•
17.5k
•
631
emozilla/dolma-v1_7-305B
Viewer
•
Updated
May 13, 2024
•
343M
•
341
•
9
NousResearch/json-mode-eval
Viewer
•
Updated
Feb 21, 2024
•
100
•
1.3k
•
33
NousResearch/func-calling-eval-singleturn
Viewer
•
Updated
Jan 31, 2024
•
112
•
58
•
6
NousResearch/func-calling-eval-glaive
Viewer
•
Updated
Feb 6, 2024
•
100
•
68
•
7
legacy-datasets/wikipedia
Updated
Mar 11, 2024
•
27.4k
•
574
allenai/c4
Viewer
•
Updated
Jan 9, 2024
•
10.4B
•
541k
•
366
open-web-math/open-web-math
Viewer
•
Updated
Oct 17, 2023
•
6.32M
•
5.93k
•
295
codeparrot/github-code-clean
Viewer
•
Updated
Jul 5, 2022
•
11M
•
7.55k
•
116
HuggingFaceFW/fineweb-edu-score-2
Viewer
•
Updated
12 days ago
•
13.1B
•
25.3k
•
70
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
12 days ago
•
3.3B
•
488k
•
617
tatsu-lab/alpaca
Viewer
•
Updated
May 22, 2023
•
52k
•
29.5k
•
728
YeungNLP/ultrachat
Viewer
•
Updated
Jun 19, 2023
•
772k
•
55
•
22
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
•
Updated
Jul 2, 2023
•
143k
•
31
•
11
Open-Orca/OpenOrca
Viewer
•
Updated
Oct 21, 2023
•
2.91M
•
11k
•
1.36k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
•
Updated
Mar 10, 2024
•
143k
•
291
•
234
timdettmers/openassistant-guanaco
Viewer
•
Updated
May 27, 2023
•
10.4k
•
8.54k
•
425
garage-bAInd/Open-Platypus
Viewer
•
Updated
Jan 24, 2024
•
24.9k
•
4.05k
•
379
Salesforce/wikitext
Viewer
•
Updated
Jan 4, 2024
•
3.71M
•
434k
•
402
Salesforce/dialogstudio
Updated
19 days ago
•
1.03k
•
219
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
19 days ago
•
60k
•
2.22k
•
413
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
Sep 6, 2024
•
237M
•
10.4k
•
294
glaiveai/glaive-function-calling-v2
Viewer
•
Updated
Sep 27, 2023
•
113k
•
783
•
410
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
Jul 19, 2024
•
2.73B
•
11.6k
•
25
mlfoundations/dclm-baseline-1.0
Preview
•
Updated
Jul 22, 2024
•
175k
•
198
ruslanmv/ai-medical-chatbot
Viewer
•
Updated
Mar 23, 2024
•
257k
•
6.77k
•
211
mlabonne/FineTome-100k
Viewer
•
Updated
Jul 29, 2024
•
100k
•
12.5k
•
161
PleIAs/common_corpus
Viewer
•
Updated
about 21 hours ago
•
470M
•
11.3k
•
217
xzuyn/manythings-translations-alpaca
Viewer
•
Updated
Aug 1, 2023
•
6.33M
•
120
•
6
BAAI/Infinity-Instruct
Viewer
•
Updated
27 days ago
•
20.4M
•
5.4k
•
589
arcee-ai/The-Tome
Viewer
•
Updated
Aug 15, 2024
•
1.75M
•
260
•
82
mlabonne/open-perfectblend
Viewer
•
Updated
28 days ago
•
1.42M
•
156
•
47
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
•
Updated
18 days ago
•
1.05M
•
256
•
55
allenai/tulu-3-sft-mixture
Viewer
•
Updated
Dec 2, 2024
•
939k
•
4.51k
•
104
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
29 days ago
•
16.4k
•
4.68k
•
172
Upvote
-
Share collection
View history
Collection guide
Browse collections