RichardErkhov
/

choprahetarth_-_gemma-merged-one-layer-only-awq

4-bit precision

Model card Files Files and versions Community

RichardErkhov commited on Jan 10

Commit

77e1260

·

verified ·

1 Parent(s): 5e7d552

uploaded readme

Files changed (1) hide show

README.md +69 -0

README.md ADDED Viewed

	@@ -0,0 +1,69 @@

+Quantization made by Richard Erkhov.
+[Github](https://github.com/RichardErkhov)
+[Discord](https://discord.gg/pvy7H8DZMG)
+[Request more models](https://github.com/RichardErkhov/quant_request)
+gemma-merged-one-layer-only - AWQ
+- Model creator: https://huggingface.co/choprahetarth/
+- Original model: https://huggingface.co/choprahetarth/gemma-merged-one-layer-only/
+Original model description:
+---
+base_model:
+- google/gemma-2-2b
+- google/gemma-2-2b-it
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# Untitled Model (1)
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [google/gemma-2-2b](https://huggingface.co/google/gemma-2-2b) as a base.
+### Models Merged
+The following models were included in the merge:
+* [google/gemma-2-2b-it](https://huggingface.co/google/gemma-2-2b-it)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+base_model: google/gemma-2-2b
+dtype: bfloat16
+merge_method: ties
+parameters:
+  int8_mask: 1.0
+  normalize: 1.0
+slices:
+- sources:
+  - layer_range: [0, 26]
+    model: google/gemma-2-2b
+  - layer_range: [0, 26]
+    model: google/gemma-2-2b-it
+    parameters:
+      density:
+      - filter: self_attn.o_proj.9
+        value: 1.0
+      - value: 0.001
+      weight:
+      - value: 1.0
+tokenizer_source: union
+```