Update README.md
Browse files
README.md
CHANGED
@@ -24,14 +24,14 @@ We introduce Bamba-9B-v2, a decoder-only language model based on the [Mamba-2](h
|
|
24 |
|
25 |
|
26 |
The current release includes the following models:
|
27 |
-
| **Stage** | **Bamba 9B**
|
28 |
-
|
29 |
-
| **Base Model** | [ibm-fms/Bamba-9B-v2](https://huggingface.co/ibm-ai-platform/Bamba-9B-v2)
|
30 |
-
| **Base Model** | [ibm-fms/Bamba-9B-v1](https://huggingface.co/ibm-
|
31 |
-
| **Base Model** | [ibm-fms/Bamba-9B-2T](https://huggingface.co/ibm-fms/Bamba-9B-2T)
|
32 |
-
| **Base Model** | [ibm-fms/Bamba-9B-1.8T](https://huggingface.co/ibm-fms/Bamba-9B-1.8T)| [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Intermediate checkpoints during Stage 1, more to come |
|
33 |
-
| **SFT** | coming soon
|
34 |
-
| **DPO** | coming soon
|
35 |
|
36 |
Original checkpoints (in dcp format) were also uploaded to public bucket:
|
37 |
```
|
|
|
24 |
|
25 |
|
26 |
The current release includes the following models:
|
27 |
+
| **Stage** | **Bamba 9B** | **Quantized** | **Note** |
|
28 |
+
|----------------------|----------------------------------------------------------------------------|-------------------------------------------------------------------------|-------------------------------------------------------------------|
|
29 |
+
| **Base Model** | [ibm-fms/Bamba-9B-v2](https://huggingface.co/ibm-ai-platform/Bamba-9B-v2) | coming soon | Stage 2 pretraining + Annealing |
|
30 |
+
| **Base Model** | [ibm-fms/Bamba-9B-v1](https://huggingface.co/ibm-ai-platform/Bamba-9B-v1) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Stage 2 pretraining |
|
31 |
+
| **Base Model** | [ibm-fms/Bamba-9B-2T](https://huggingface.co/ibm-fms/Bamba-9B-2T) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Stage 1 pretraining |
|
32 |
+
| **Base Model** | [ibm-fms/Bamba-9B-1.8T](https://huggingface.co/ibm-fms/Bamba-9B-1.8T) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Intermediate checkpoints during Stage 1, more to come |
|
33 |
+
| **SFT** | coming soon | coming soon | to be released in the next drop |
|
34 |
+
| **DPO** | coming soon | coming soon | to be released in the next drop |
|
35 |
|
36 |
Original checkpoints (in dcp format) were also uploaded to public bucket:
|
37 |
```
|