DeepAuto-AI
/

Explore_Llama-3.1-8B-Inst

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bedio commited on Sep 24, 2024

Commit

5261536

·

verified ·

1 Parent(s): e259032

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -107,8 +107,7 @@ model-index:
 **DeepAutoAI/Explore_Llama-3.1-8B-Inst** is developed by **deepAuto.ai** by learning the distribution of llama-3.1-8B-instruct.
 Our approach leverages the base model’s pretrained weights and optimizes them for the **Winogrande** and **ARC-Challenge** datasets by
-training a latent diffusion model on the pretrained weights. specifically , this model is based on learning the distrinution of
-the last transformer block, 30, and 24th FFN layers of the original Llama model.
 Through this process, we learn the distribution of the base model's weight space, enabling us to explore optimal configurations.
 We then sample multiple sets of weights, using the **model-soup averaging technique** to identify the best-performing weights for both datasets.

 **DeepAutoAI/Explore_Llama-3.1-8B-Inst** is developed by **deepAuto.ai** by learning the distribution of llama-3.1-8B-instruct.
 Our approach leverages the base model’s pretrained weights and optimizes them for the **Winogrande** and **ARC-Challenge** datasets by
+training a latent diffusion model on the pretrained weights. specifically , this model is based on learning the distrinution of transformer layers from 16 to 31.
 Through this process, we learn the distribution of the base model's weight space, enabling us to explore optimal configurations.
 We then sample multiple sets of weights, using the **model-soup averaging technique** to identify the best-performing weights for both datasets.