LLuke777 commited on
Commit
26a26b3
·
verified ·
1 Parent(s): 1815c77

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +59 -9
README.md CHANGED
@@ -1,29 +1,46 @@
1
  ---
2
  license: llama3.1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  language:
4
  - en
5
  base_model:
6
- - meta-llama/Llama-3.1-8B-Instruct
7
  ---
8
 
9
- # Dolphin Llama 3.1 8B Instruct 🐬
 
 
 
10
 
11
  [![Discord](https://img.shields.io/discord/1156064224225808488?logo=Discord&logoColor=%23ffffff&label=Discord&link=https%3A%2F%2Fdiscord.gg%2FtCMkMDDHwm)](https://discord.gg/cognitivecomputations)
12
  Discord: https://discord.gg/cognitivecomputations
13
 
14
- <img src="https://i.postimg.cc/bvWXwnz7/dolphin.webp" width="600" />
15
 
16
  ## Sponsors
17
- Our appreciation for the generous sponsors of Dolphin:
18
  - [Crusoe Cloud](https://crusoe.ai/) - provided 16x L40s for training and evals
19
  - [Akash](https://akash.network/) - provided on-demand 8x H100 for training
20
  - [Lazarus](https://www.lazarusai.com/) - provided 16x H100 for training
21
  - [Cerebras](https://cerebras.ai/) - provided excellent and fast inference services for data labeling
22
  - [Andreessen Horowitz](https://a16z.com/) - provided a [grant](https://a16z.com/supporting-the-open-source-ai-community/) that make Dolphin 1.0 possible and enabled me to bootstrap my homelab
23
 
24
- ## What is Dolphin Llama 3.1 8B Instruct?
25
 
26
- Dolphin Llama 3.1 8B Instruct is a result of our effort to directly uncensor Llama's 3.1 8B instruct-tuned model.
27
 
28
  Dolphin aims to be a general purpose model, similar to the models behind ChatGPT, Claude, Gemini. But these models present problems for businesses seeking to include AI in their products.
29
  1) They maintain control of the system prompt, deprecating and changing things as they wish, often causing software to break.
@@ -39,7 +56,15 @@ https://erichartford.com/uncensored-models
39
 
40
  ## Chat Template
41
 
42
- We maintained the default Llama chat template for this model.
 
 
 
 
 
 
 
 
43
 
44
  ## System Prompt
45
 
@@ -59,7 +84,9 @@ Please implement A* using python<|im_end|>
59
 
60
  ## Sample Outputs
61
 
62
- **add sample outputs here**
 
 
63
 
64
  ## How to use
65
 
@@ -71,6 +98,29 @@ There are many ways to use a huggingface model including:
71
  - sglang
72
  - tgi
73
 
 
 
 
 
 
74
  ## Evals
75
 
76
- TBD
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: llama3.1
3
+ datasets:
4
+ - OpenCoder-LLM/opc-sft-stage1
5
+ - OpenCoder-LLM/opc-sft-stage2
6
+ - microsoft/orca-agentinstruct-1M-v1
7
+ - microsoft/orca-math-word-problems-200k
8
+ - NousResearch/hermes-function-calling-v1
9
+ - AI-MO/NuminaMath-CoT
10
+ - AI-MO/NuminaMath-TIR
11
+ - allenai/tulu-3-sft-mixture
12
+ - cognitivecomputations/dolphin-coder
13
+ - HuggingFaceTB/smoltalk
14
+ - cognitivecomputations/samantha-data
15
+ - m-a-p/CodeFeedback-Filtered-Instruction
16
+ - m-a-p/Code-Feedback
17
  language:
18
  - en
19
  base_model:
20
+ - meta-llama/Llama-3.1-8B
21
  ---
22
 
23
+ # Dolphin 3.0 Llama 3.1 8B 🐬
24
+ Part of the [Dolphin 3.0 Collection](https://huggingface.co/collections/cognitivecomputations/dolphin-30-677ab47f73d7ff66743979a3)
25
+
26
+ Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)
27
 
28
  [![Discord](https://img.shields.io/discord/1156064224225808488?logo=Discord&logoColor=%23ffffff&label=Discord&link=https%3A%2F%2Fdiscord.gg%2FtCMkMDDHwm)](https://discord.gg/cognitivecomputations)
29
  Discord: https://discord.gg/cognitivecomputations
30
 
31
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/cNCs1TBD3FelWCJGkZ3cd.png" width="600" />
32
 
33
  ## Sponsors
34
+ Our appreciation for the generous sponsors of Dolphin 3.0:
35
  - [Crusoe Cloud](https://crusoe.ai/) - provided 16x L40s for training and evals
36
  - [Akash](https://akash.network/) - provided on-demand 8x H100 for training
37
  - [Lazarus](https://www.lazarusai.com/) - provided 16x H100 for training
38
  - [Cerebras](https://cerebras.ai/) - provided excellent and fast inference services for data labeling
39
  - [Andreessen Horowitz](https://a16z.com/) - provided a [grant](https://a16z.com/supporting-the-open-source-ai-community/) that make Dolphin 1.0 possible and enabled me to bootstrap my homelab
40
 
41
+ ## What is Dolphin?
42
 
43
+ Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
44
 
45
  Dolphin aims to be a general purpose model, similar to the models behind ChatGPT, Claude, Gemini. But these models present problems for businesses seeking to include AI in their products.
46
  1) They maintain control of the system prompt, deprecating and changing things as they wish, often causing software to break.
 
56
 
57
  ## Chat Template
58
 
59
+ We use ChatML for the chat template.
60
+
61
+ ```
62
+ <|im_start|>system
63
+ You are Dolphin, a helpful AI assistant.<|im_end|>
64
+ <|im_start|>user
65
+ {prompt}<|im_end|>
66
+ <|im_start|>assistant
67
+ ```
68
 
69
  ## System Prompt
70
 
 
84
 
85
  ## Sample Outputs
86
 
87
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/C-r1X13UBjnUUNb0q2JLV.png" width="600" />
88
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/4l3KAZiKej2ON7i35PsOa.png" width="600" />
89
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/1ZalmR66LnwhEQQEFttlu.png" width="600" />
90
 
91
  ## How to use
92
 
 
98
  - sglang
99
  - tgi
100
 
101
+ ### ollama
102
+ - [Install ollama](https://ollama.com/download)
103
+ - ```ollama run hf.co/cognitivecomputations/Dolphin3.0-Llama3.1-8B-GGUF:Q4_0```
104
+ - ```/set system <your system prompt>```
105
+
106
  ## Evals
107
 
108
+ TBD
109
+
110
+ ## Appreciation
111
+
112
+ Respect and thanks to the creators of the open source datasets that were used:
113
+ - [OpenCoder-LLM](https://huggingface.co/OpenCoder-LLM) (opc-sft-stage1, opc-sft-stage2)
114
+ - [microsoft](https://huggingface.co/OpenCoder-LLM) (orca-agentinstruct-1M-v1, orca-math-word-problems-200k)
115
+ - [NousResearch](https://huggingface.co/NousResearch) (hermes-function-calling-v1)
116
+ - [AI-MO](https://huggingface.co/AI-MO) (NuminaMath-CoT, NuminaMath-TIR)
117
+ - [allenai](https://huggingface.co/allenai) (tulu-3-sft-mixture)
118
+ - [HuggingFaceTB](https://huggingface.co/HuggingFaceTB) (smoltalk)
119
+ - [m-a-p](https://huggingface.co/m-a-p) (CodeFeedback-Filtered-Instruction, Code-Feedback)
120
+
121
+ Special thanks to
122
+ - Meta, Qwen, and OpenCoder, who wrote papers and published models that were instrumental in creating Dolphin 3.0.
123
+ - [RLHFlow](https://huggingface.co/RLHFlow) for the excellent reward model used to filter the datasets
124
+ - Deepseek, for the ridiculously fast Deepseek-V3 that we used to augment the data.
125
+
126
+