trollek
/

Tyr-4B-v0.1

Not-For-All-Audiences

Model card Files Files and versions Community

trollek commited on Sep 4, 2024

Commit

d66ec26

·

verified ·

1 Parent(s): 8c740b8

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -10,6 +10,8 @@ datasets:
 language:
 - en
 base_model: trollek/LittleInstructionMaker-4B-v0.2
 ---
 # Tyr-4B-v0.1
@@ -19,10 +21,15 @@ Merging seems like the way to go when it comes to training language models on a
 ## Model Description
-This model was created by training LoRAs and Della merge them. It saves space and time this way and the result is good. The WhiteRabbitNeo datasets are the focus in this one along with coding.
 Incedentally, it seems uncensored. It was trained using the ChatML template and can be used with or without a system prompt.
 ## Apache-2.0 + WhiteRabbitNeo Extended Version
 ### Licence: Usage Restrictions

 language:
 - en
 base_model: trollek/LittleInstructionMaker-4B-v0.2
+tags:
+- not-for-all-audiences
 ---
 # Tyr-4B-v0.1
 ## Model Description
+This model was created by training LoRAs ([rsLoRA](https://arxiv.org/abs/2312.03732) with [NEFTune](https://arxiv.org/abs/2310.05914) noise alpha at 5) and [DELLA merge](https://arxiv.org/abs/2406.11617) them. It saves space and time this way and the result is good. The WhiteRabbitNeo datasets are the focus in this one along with coding.
 Incedentally, it seems uncensored. It was trained using the ChatML template and can be used with or without a system prompt.
+I am very grateful for all the different components that made this model possible. From H2O and their danube models, through Huggingface and Llama-Factory for making the fine-tuning easy, and to all the great dataset creators. **Thank you!**
+### Quants
+- [trollek/Tyr-4B-v0.1-GGUF](https://huggingface.co/trollek/Tyr-4B-v0.1-GGUF)
 ## Apache-2.0 + WhiteRabbitNeo Extended Version
 ### Licence: Usage Restrictions