AIM Paper Checkpoints Uploaded For Replication

This repository includes one of the checkpoints used in the paper "Activation-Informed Merging of Large Language Models". Specifics of this model are as follows:

Merging Method: dare_linear
Models Used In Merging
- Base Model: unsloth/llama-2-13b
- Code: layoric/llama-2-13b-code-alpaca
- Math: vanillaOVO/WizardMath-13B-V1.0
AIM: True

Benchmark results and paper details can be found at the official GitHub.

Downloads last month: 29

Safetensors

Model size

13B params

Tensor type

FP16

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for ahn1376/DARETaskArithmetic_Code-Math_AIM

layoric/llama-2-13b-code-alpaca

unsloth/llama-2-13b

vanillaOVO/WizardMath-13B-V1.0

Merge model

this model

Collection including ahn1376/DARETaskArithmetic_Code-Math_AIM

AIM Merged Checkpoints (With AIM)

Collection

The full set of checkpoints merged and with AIM applied, used in Activation Informed Merging (AIM) merging paper experiments. • 21 items • Updated 14 days ago

AIM Paper Checkpoints Uploaded For Replication

Model tree for ahn1376/DARETaskArithmetic___Code-Math___AIM

Collection including ahn1376/DARETaskArithmetic___Code-Math___AIM

Model tree for ahn1376/DARETaskArithmetic_Code-Math_AIM

Collection including ahn1376/DARETaskArithmetic_Code-Math_AIM