jeffmeloy
/

Qwen2.5-7B-olm-v1.4

Text Generation

text-generation-inference

Model card Files Files and versions Community

Qwen2.5-7B-olm-v1.4 / README.md

jeffmeloy's picture

Update README.md

63f9f26 verified about 2 months ago

|

history blame contribute delete

887 Bytes

	---
	license: apache-2.0
	base_model:
	- Qwen/Qwen2.5-7B
	pipeline_tag: text-generation
	language:
	- en
	library_name: transformers
	tags:
	- text-generation-inference
	---

	## Model Description

	Optimized Layer Merging (OLM)
	Is a transformer optimization framework implementing automated layer recombination.

	Olm create Frankenstein's monster out of language models by cherry-picking the best performing layers across different models to create a superior hybrid.
	The core mechanism:

	- Takes multiple language models as input
	- Uses a base model as the foundation
	- Iteratively replaces individual layers, evaluating performance on specified datasets
	- Keeps the best performing layer at each position based on metrics like perplexity, exact match, and a custom "quality" score
	- Builds a fusion model layer-by-layer while maintaining or improving performance

	https://github.com/jeffmeloy/olm