Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-GGUF

Quantized GGUF model files for phi-2-sft-dpo-gpt4_en-ep1 from Yhyu13

Original Model Card:

This is the merged model for LoRA https://huggingface.co/Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-lora

This model is a dpo improvement to this base model https://huggingface.co/Yhyu13/phi-2-sft-alpaca_gpt4_en-ep1 who achieve better than text-davinci-003 on AlpcaEval judged by ChatGPT.

Downloads last month
50
GGUF
Model size
2.78B params
Architecture
phi2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF

Quantized
(1)
this model