Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ The model has hybrid architecture with Mamba and Attention heads running in para
|
|
14 |
|
15 |
This model is ready for commercial use.
|
16 |
|
17 |
-
**[Model Weights Coming Soon]**
|
18 |
|
19 |
**[Caution] During generation, the batch size needs to be 1. Our current implementation does not fully support padding of Meta tokens + SWA; this is a work in progress. Training and pre-filling support any batch size.**
|
20 |
|
|
|
14 |
|
15 |
This model is ready for commercial use.
|
16 |
|
17 |
+
**[Model Weights Coming Soon, expected Nov 25th]**
|
18 |
|
19 |
**[Caution] During generation, the batch size needs to be 1. Our current implementation does not fully support padding of Meta tokens + SWA; this is a work in progress. Training and pre-filling support any batch size.**
|
20 |
|