Commit
Β·
5e191d2
1
Parent(s):
349443b
modified model card
Browse files
README.md
CHANGED
@@ -25,15 +25,14 @@ Training was conducted on a single NVIDIA DGX node:
|
|
25 |
|
26 |
- Hardware: **8Γ NVIDIA A100 GPUs (80GB HBM each)**
|
27 |
- Training duration: **1440 GPU hours** (~180 hours wall-clock on 8 GPUs)
|
28 |
-
- Total
|
29 |
|
30 |
### π Training Loss Curve
|
31 |
|
32 |
Here's the training loss progression:
|
33 |
|
34 |
-
```markdown
|
35 |

|
36 |
-
|
37 |
|
38 |
### π Example Outputs
|
39 |
Below are generated examples illustrating Argonne-1.0's style and capability when prompted:
|
|
|
25 |
|
26 |
- Hardware: **8Γ NVIDIA A100 GPUs (80GB HBM each)**
|
27 |
- Training duration: **1440 GPU hours** (~180 hours wall-clock on 8 GPUs)
|
28 |
+
- Total steps: **160,000 global steps**
|
29 |
|
30 |
### π Training Loss Curve
|
31 |
|
32 |
Here's the training loss progression:
|
33 |
|
|
|
34 |

|
35 |
+
|
36 |
|
37 |
### π Example Outputs
|
38 |
Below are generated examples illustrating Argonne-1.0's style and capability when prompted:
|