You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

  • Developed by: seniruk
  • License: apache-2.0
  • Finetuned from model : unsloth/Qwen2.5-Coder-1.5B-Instruct

This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.


base_model: unsloth/Qwen2.5-Coder-1.5B-Instruct tags: - text-generation-inference - transformers - unsloth - qwen2 - trl license: apache-2.0 language: - en


datasets: - bigcode/commitpackft

Purpose

Used for generating high quality commit messages for a given git difference

Model Description

Generated by fine tuning Qwen2.5-Coder-1.5B-Instruct on bigcode/commitpackft dataset for 2 epochs Trained on a total of 277 Languages Achieved a final training loss in the range of 1- 1.7 (due to data set not containing equal data rows for each language) For common languages(python, java ,javascripts,c etc) loss went for a minimum of 1.0335

Environmental Impact

  • Hardware Type: geforce RTX 4060 TI - 16GB]
  • Hours used: 10 Hours
  • Cloud Provider: local

Results

Logo Logo

Inference input format (If using API mostly)

<|im_start|>system
You are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|>
<|im_start|>user
{instructions}
{git_diff}<|im_end|>
<|im_start|>assistant

And the model will predict the rest of the content -> {assistant output}<|im_end|>

Downloads last month
7
Safetensors
Model size
1.54B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for seniruk/commitGen

Base model

Qwen/Qwen2.5-1.5B
Finetuned
(47)
this model

Dataset used to train seniruk/commitGen