Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
llama2thedog
/
mistral-7B-qlora-grpo-100
like
0
Text Generation
Transformers
Safetensors
mistral
conversational
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
mistral-7B-qlora-grpo-100
1 contributor
History:
3 commits
llama2thedog
Upload tokenizer
b02f1b9
verified
about 15 hours ago
.gitattributes
1.52 kB
initial commit
about 15 hours ago
README.md
5.17 kB
Upload MistralForCausalLM
about 15 hours ago
config.json
743 Bytes
Upload MistralForCausalLM
about 15 hours ago
generation_config.json
157 Bytes
Upload MistralForCausalLM
about 15 hours ago
model-00001-of-00003.safetensors
4.95 GB
LFS
Upload MistralForCausalLM
about 15 hours ago
model-00002-of-00003.safetensors
5 GB
LFS
Upload MistralForCausalLM
about 15 hours ago
model-00003-of-00003.safetensors
4.55 GB
LFS
Upload MistralForCausalLM
about 15 hours ago
model.safetensors.index.json
24 kB
Upload MistralForCausalLM
about 15 hours ago
special_tokens_map.json
560 Bytes
Upload tokenizer
about 15 hours ago
tokenizer.json
3.67 MB
Upload tokenizer
about 15 hours ago
tokenizer.model
587 kB
LFS
Upload tokenizer
about 15 hours ago
tokenizer_config.json
141 kB
Upload tokenizer
about 15 hours ago