TareksGraveyard
/

Ethos-Alpha-LLaMa-70B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ethos-Alpha-LLaMa-70B / README.md

Tarek07's picture

Update README.md

3d62d96 verified 8 days ago

|

2.44 kB

	---
	base_model:
	- Sao10K/L3.3-70B-Euryale-v2.3
	- Sao10K/70B-L3.3-mhnnn-x1
	- EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
	- Doctor-Shotgun/L3.3-70B-Magnum-v4-SE
	- Sao10K/70B-L3.3-Cirrus-x1
	- meta-llama/Llama-3.3-70B-Instruct
	library_name: transformers
	tags:
	- mergekit
	- merge
	license: llama3.3
	---
	Part of a multi merge experiment.
	The idea behind it is to create 3 individual models:
	- Pathos: For ERP and uncensored NSFW content
	- Ethos: For prose and storytelling
	- Logos: For intelligence and awareness

	The three models above will then be combined into:
	- Kairos: The best of all three hopefully.

	I will be using differnet merge methods for these merges in an attempt to find the best combinations hence the Alpha, Beta and Delta tags you will see on each which represent different merge methods.
	# merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [Linear DELLA](https://arxiv.org/abs/2406.11617) merge method using [meta-llama/Llama-3.3-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) as a base.

	### Models Merged

	The following models were included in the merge:
	* [Sao10K/L3.3-70B-Euryale-v2.3](https://huggingface.co/Sao10K/L3.3-70B-Euryale-v2.3)
	* [Sao10K/70B-L3.3-mhnnn-x1](https://huggingface.co/Sao10K/70B-L3.3-mhnnn-x1)
	* [EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1](https://huggingface.co/EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1)
	* [Doctor-Shotgun/L3.3-70B-Magnum-v4-SE](https://huggingface.co/Doctor-Shotgun/L3.3-70B-Magnum-v4-SE)
	* [Sao10K/70B-L3.3-Cirrus-x1](https://huggingface.co/Sao10K/70B-L3.3-Cirrus-x1)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: Doctor-Shotgun/L3.3-70B-Magnum-v4-SE
	parameters:
	weight: 0.20
	density: 0.7
	- model: Sao10K/70B-L3.3-mhnnn-x1
	parameters:
	weight: 0.20
	density: 0.7
	- model: Sao10K/70B-L3.3-Cirrus-x1
	parameters:
	weight: 0.20
	density: 0.7
	- model: EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
	parameters:
	weight: 0.20
	density: 0.7
	- model: Sao10K/L3.3-70B-Euryale-v2.3
	parameters:
	weight: 0.20
	density: 0.7
	merge_method: della_linear
	base_model: meta-llama/Llama-3.3-70B-Instruct
	parameters:
	epsilon: 0.2
	lambda: 1.1
	out_dtype: bfloat16
	tokenizer:
	source: Sao10K/L3.3-70B-Euryale-v2.3
	```