ohno-8x7B-fp16 / README.md
rAIfle's picture
Update README.md
472a0b1 verified
---
base_model:
- Envoid/Mixtral-Instruct-ITR-8x7B
- Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora
- Envoid/Mixtral-Instruct-ITR-8x7B
- retrieval-bar/Mixtral-8x7B-v0.1_case-briefs
- NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss
- Envoid/Mixtral-Instruct-ITR-8x7B
tags:
- mergekit
- merge
---
# ohno-8x7b
this... will either be my magnum opus... or terrible. no inbetweens!
Post-test verdict: It's mostly braindamaged. Might be my settings or something, idk.
the `./output` mentioned below is my own merge using identical recipe as [Envoid/Mixtral-Instruct-ITR-8x7B](https://huggingface.co/Envoid/Mixtral-Instruct-ITR-8x7B).
# output_merge2
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [Envoid/Mixtral-Instruct-ITR-8x7B](https://huggingface.co/Envoid/Mixtral-Instruct-ITR-8x7B) as a base.
### Models Merged
The following models were included in the merge:
* ./output/ + /ai/LLM/tmp/pefts/daybreak-peft/mixtral-8x7b
* [Envoid/Mixtral-Instruct-ITR-8x7B](https://huggingface.co/Envoid/Mixtral-Instruct-ITR-8x7B) + [Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora](https://huggingface.co/Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora)
* [Envoid/Mixtral-Instruct-ITR-8x7B](https://huggingface.co/Envoid/Mixtral-Instruct-ITR-8x7B) + [retrieval-bar/Mixtral-8x7B-v0.1_case-briefs](https://huggingface.co/retrieval-bar/Mixtral-8x7B-v0.1_case-briefs)
* [NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss](https://huggingface.co/NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: ./output/+/ai/LLM/tmp/pefts/daybreak-peft/mixtral-8x7b
parameters:
density: 0.66
weight: 1.0
- model: Envoid/Mixtral-Instruct-ITR-8x7B+retrieval-bar/Mixtral-8x7B-v0.1_case-briefs
parameters:
density: 0.1
weight: 0.25
- model: Envoid/Mixtral-Instruct-ITR-8x7B+Doctor-Shotgun/limarp-zloss-mixtral-8x7b-qlora
parameters:
density: 0.66
weight: 0.5
- model: NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss
parameters:
density: 0.15
weight: 0.3
merge_method: dare_ties
base_model: Envoid/Mixtral-Instruct-ITR-8x7B
dtype: float16
```