divykum commited on
Commit
b42852d
·
verified ·
1 Parent(s): ad081bc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -24,14 +24,14 @@ We introduce Bamba-9B-v2, a decoder-only language model based on the [Mamba-2](h
24
 
25
 
26
  The current release includes the following models:
27
- | **Stage** | **Bamba 9B** | **Quantized** | **Note** |
28
- |----------------------|----------------------------------------------------------------------|-------------------------------------------------------------------------|-------------------------------------------------------------------|
29
- | **Base Model** | [ibm-fms/Bamba-9B-v2](https://huggingface.co/ibm-ai-platform/Bamba-9B-v2) | coming soon | Stage 2 pretraining + Annealing |
30
- | **Base Model** | [ibm-fms/Bamba-9B-v1](https://huggingface.co/ibm-fms/Bamba-9B-v1) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Stage 2 pretraining |
31
- | **Base Model** | [ibm-fms/Bamba-9B-2T](https://huggingface.co/ibm-fms/Bamba-9B-2T) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Stage 1 pretraining |
32
- | **Base Model** | [ibm-fms/Bamba-9B-1.8T](https://huggingface.co/ibm-fms/Bamba-9B-1.8T)| [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Intermediate checkpoints during Stage 1, more to come |
33
- | **SFT** | coming soon | coming soon | to be released in the next drop |
34
- | **DPO** | coming soon | coming soon | to be released in the next drop |
35
 
36
  Original checkpoints (in dcp format) were also uploaded to public bucket:
37
  ```
 
24
 
25
 
26
  The current release includes the following models:
27
+ | **Stage** | **Bamba 9B** | **Quantized** | **Note** |
28
+ |----------------------|----------------------------------------------------------------------------|-------------------------------------------------------------------------|-------------------------------------------------------------------|
29
+ | **Base Model** | [ibm-fms/Bamba-9B-v2](https://huggingface.co/ibm-ai-platform/Bamba-9B-v2) | coming soon | Stage 2 pretraining + Annealing |
30
+ | **Base Model** | [ibm-fms/Bamba-9B-v1](https://huggingface.co/ibm-ai-platform/Bamba-9B-v1) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Stage 2 pretraining |
31
+ | **Base Model** | [ibm-fms/Bamba-9B-2T](https://huggingface.co/ibm-fms/Bamba-9B-2T) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Stage 1 pretraining |
32
+ | **Base Model** | [ibm-fms/Bamba-9B-1.8T](https://huggingface.co/ibm-fms/Bamba-9B-1.8T) | [ibm-fms/Bamba-9B-fp8](https://huggingface.co/ibm-fms/Bamba-9B-fp8) | Intermediate checkpoints during Stage 1, more to come |
33
+ | **SFT** | coming soon | coming soon | to be released in the next drop |
34
+ | **DPO** | coming soon | coming soon | to be released in the next drop |
35
 
36
  Original checkpoints (in dcp format) were also uploaded to public bucket:
37
  ```