--- base_model: - neopolita/jessi-v0.4-falcon3-7b-instruct - tiiuae/Falcon3-7B-Instruct library_name: transformers license: other license_name: falcon-llm-license license_link: https://falconllm.tii.ae/falcon-terms-and-conditions.html tags: - mergekit - merge - falcon3 language: - en - fr - es - pt --- # Merged Model This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ![Falcon-Merge-Logo](falcon-merge.png) ## Merge Details ### Merge Method This model was merged using the SLERP merge method. ### Models Merged The following models were included in the merge: * [neopolita/jessi-v0.4-falcon3-7b-instruct](https://huggingface.co/neopolita/jessi-v0.4-falcon3-7b-instruct) * [tiiuae/Falcon3-7B-Instruct](https://huggingface.co/tiiuae/Falcon3-7B-Instruct) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: neopolita/jessi-v0.4-falcon3-7b-instruct dtype: bfloat16 merge_method: slerp parameters: t: - filter: self_attn value: [0.0, 0.5, 0.3, 0.7, 1.0] - filter: mlp value: [1.0, 0.5, 0.7, 0.3, 0.0] - value: 0.5 slices: - sources: - layer_range: [0, 28] model: tiiuae/Falcon3-7B-Instruct - layer_range: [0, 28] model: neopolita/jessi-v0.4-falcon3-7b-instruct ``` Buy Me A Coffee