Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ C-RADIOv2 models are available in multiple sizes:
|
|
15 |
* Base (90M parameters).
|
16 |
* Large (320M parameters).
|
17 |
* Huge (653M parameters).
|
18 |
-
* Gigantic (1.
|
19 |
|
20 |
C-RADIOv2 was trained for 1M steps (400k more steps than v1), using inverse frequency sampling for data balancing, and [PHI Standardization](https://arxiv.org/abs/2410.01680) for teacher distribution balancing.
|
21 |
|
@@ -134,7 +134,7 @@ The resulting tensor will have shape `(B,D,H,W)`, as is typically seen with comp
|
|
134 |
* https://huggingface.co/nvidia/C-RADIOv2-B
|
135 |
* https://huggingface.co/nvidia/C-RADIOv2-L
|
136 |
* https://huggingface.co/nvidia/C-RADIOv2-H
|
137 |
-
* https://huggingface.co/nvidia/C-RADIOv2-
|
138 |
|
139 |
# Training and Evaluation Datasets
|
140 |
|
|
|
15 |
* Base (90M parameters).
|
16 |
* Large (320M parameters).
|
17 |
* Huge (653M parameters).
|
18 |
+
* Gigantic (1.1B parameters).
|
19 |
|
20 |
C-RADIOv2 was trained for 1M steps (400k more steps than v1), using inverse frequency sampling for data balancing, and [PHI Standardization](https://arxiv.org/abs/2410.01680) for teacher distribution balancing.
|
21 |
|
|
|
134 |
* https://huggingface.co/nvidia/C-RADIOv2-B
|
135 |
* https://huggingface.co/nvidia/C-RADIOv2-L
|
136 |
* https://huggingface.co/nvidia/C-RADIOv2-H
|
137 |
+
* https://huggingface.co/nvidia/C-RADIOv2-g
|
138 |
|
139 |
# Training and Evaluation Datasets
|
140 |
|