kevin009
/

llama346

@@ -1,22 +1,36 @@
 ---
-base_model: unsloth/Meta-Llama-3.1-8B-Instruct
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- llama
-- trl
 license: apache-2.0
 language:
 - en
 ---
-# Uploaded  model
-- **Developed by:** kevin009
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/Meta-Llama-3.1-8B-Instruct
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
 license: apache-2.0
 language:
 - en
+base_model:
+- meta-llama/Llama-3.1-8B-instruct
+pipeline_tag: text-generation
+tags:
+- lora
+- adapter
+- Math
+- CoT
 ---
+## Model Details
+- Base Model: meta-llama/Llama-3.1-8B-instruct
+- SFT
+## Datasets:
+- 300st random passages human writing
+- 134st customized set (combined synthatic/human writing)
+- 18k MMLU style 1 epoch
+- 5K Math from continuation of llama343
+### Source Adapters
+All source adapters share the following configuration:
+- Rank (r): 16
+- Alpha: 16
+- Target Modules:
+  - q_proj (Query projection)
+  - k_proj (Key projection)
+  - v_proj (Value projection)
+  - o_proj (Output projection)
+  - up_proj (Upsampling projection)
+  - down_proj (Downsampling projection)
+  - gate_proj (Gate projection)