innovation-hacking2 commited on
Commit
8d65448
·
verified ·
1 Parent(s): bfa2138

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -10,4 +10,6 @@ library_name: adapter-transformers
10
  tags:
11
  - reasoning
12
  - math
13
- ---
 
 
 
10
  tags:
11
  - reasoning
12
  - math
13
+ ---
14
+
15
+ TNG Technology Consulting fine-tuned the 32-billion-parameter OLMo-2 Large Language Model using AMD's MI300X GPUs and the Open R1 dataset, focusing on enhancing the model's reasoning capabilities. The MI300X accelerators, with their multi-chip module architecture and substantial memory bandwidth, facilitated efficient handling of the model's training requirements. The Open R1 dataset, curated by Hugging Face, provided a comprehensive collection of mathematical problems with detailed reasoning traces, serving as an ideal foundation for this fine-tuning endeavor. This collaborative effort underscores the potential of open-source initiatives and advanced hardware in advancing AI research.