charioteer
/

Neural-phi2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

charioteer commited on Apr 8, 2024

Commit

9de5aac

·

verified ·

1 Parent(s): e32e32e

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -9,16 +9,16 @@ pipeline_tag: text-generation
 ---
 # Model Card: Neural-phi2
-![Poster Image](path/to/poster.png)
 ## Model Details
 - **Model Name**: Neural-phi2
 - **Model Type**: Large Language Model (LLM)
 - **Model Architecture**: A finetuned version of the Phi2 model from Microsoft, utilizing Direct Preference Optimization (DPO) on the `distilabel-intel-orca-dpo-pairs` dataset.
-- **Model Size**: Approximately 7B parameters
 - **Training Data**: The model was finetuned on the `distilabel-intel-orca-dpo-pairs` dataset, which consists of chat-like prompts and responses.
-- **Training Procedure**: The Phi2 model was finetuned using the DPO technique as described in the Jupyter notebook. The training process involved:
   - Loading and formatting the `distilabel-intel-orca-dpo-pairs` dataset
   - Defining the training configuration, including batch size, learning rate, and number of epochs
   - Initializing the DPO Trainer and training the model

 ---
 # Model Card: Neural-phi2
+![Poster Image](poster.png)
 ## Model Details
 - **Model Name**: Neural-phi2
 - **Model Type**: Large Language Model (LLM)
 - **Model Architecture**: A finetuned version of the Phi2 model from Microsoft, utilizing Direct Preference Optimization (DPO) on the `distilabel-intel-orca-dpo-pairs` dataset.
+- **Model Size**: Approximately 2B parameters
 - **Training Data**: The model was finetuned on the `distilabel-intel-orca-dpo-pairs` dataset, which consists of chat-like prompts and responses.
+- **Training Procedure**: The Phi2 model was finetuned using the DPO technique. The training process involved:
   - Loading and formatting the `distilabel-intel-orca-dpo-pairs` dataset
   - Defining the training configuration, including batch size, learning rate, and number of epochs
   - Initializing the DPO Trainer and training the model