divinetaco
/

L3.3-70B-Lycosa-v0.1

Text Generation

Not-For-All-Audiences

nsfw

text-generation-inference

Model card Files Files and versions Community

L3.3-70B-Lycosa-v0.1 / README.md

divinetaco's picture

Upload /README.md with huggingface_hub

cb6a65c verified 2 months ago

|

history blame contribute delete

1.48 kB

	---
	license: llama3.3
	base_model:
	- deepseek-ai/DeepSeek-R1-Distill-Llama-70B
	library_name: transformers
	tags:
	- not-for-all-audiences
	- nsfw
	- mergekit
	- merge

	---
	# L3.3-70B-Lycosa-v0.1

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details

	An RP merge with a focus on:
	<br>\- model intelligence
	<br>\- removing positive bias
	<br>\- creativity

	This model was merged using the sce merge method using deepseek-r1-distill-llama-70b as a base.
	<br>\
	Note: forcing the llama3.3 chat template can sometimes yield better results. The deepseek chat template is the default provided in config.

	<img src="https://huggingface.co/divinetaco/L3.3-70B-Lycosa-v0.1/resolve/main/lycosa.png">

	### Models Merged

	The following models were included in the merge:
	* deepseek-ai/DeepSeek-R1-Distill-Llama-70B
	* Sao10K/70B-L3.3-Cirrus-x1
	* TheDrummer/Nautilus-70B-v0.1
	* Doctor-Shotgun/L3.3-70B-Magnum-v4-SE
	* SicariusSicariiStuff/Negative_LLAMA_70B

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	# Pivot model
	- model: llama-3.3-70b-instruct
	# Target models
	- model: Sao10K/70B-L3.3-Cirrus-x1
	- model: TheDrummer/Nautilus-70B-v0.1
	- model: Doctor-Shotgun/L3.3-70B-Magnum-v4-SE
	- model: SicariusSicariiStuff/Negative_LLAMA_70B
	merge_method: sce
	base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-70B
	parameters:
	select_topk: 1.0
	dtype: bfloat16

	```