merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the passthrough merge method using microsoft/Phi-3.5-mini-instruct as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: microsoft/Phi-3.5-mini-instruct  # Correct base model identifier
merge_method: passthrough  # Use passthrough to add layers
slices:
  - sources:
      - model: microsoft/Phi-3.5-mini-instruct  # Source model for the base layers
        layer_range: [0, 32]  # Use all existing layers (adjust if the base model has a different number of layers)
  - sources:
      - model: microsoft/Phi-3.5-mini-instruct  # Source model for the additional layers
        layer_range: [24, 32]  # Add 8 new layers initialized from the base model
tokenizer_source: microsoft/Phi-3.5-mini-instruct  # Use the tokenizer from the base model
dtype: float16  # Data type for the merged model
Downloads last month
18
Safetensors
Model size
4.73B params
Tensor type
FP16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for mergekit-community/mergekit-passthrough-smmjedo

Finetuned
(67)
this model