seniruk
/

commitGen

Model card Files Files and versions Community

commitGen / README.md

seniruk's picture

Update README.md

1b2d0d0 verified about 2 months ago

|

history blame contribute delete

1.67 kB

	---
	datasets:
	- bigcode/commitpackft
	language:
	- en
	base_model:
	- Qwen/Qwen2.5-Coder-1.5B-Instruct
	---
	- Developed by: seniruk
	- License: apache-2.0
	- Finetuned from model : unsloth/Qwen2.5-Coder-1.5B-Instruct

	This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

	---
	base_model: unsloth/Qwen2.5-Coder-1.5B-Instruct
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- qwen2
	- trl
	license: apache-2.0
	language:
	- en
	---

	---
	datasets:
	- bigcode/commitpackft
	---
	# Purpose

	Used for generating high quality commit messages for a given git difference


	### Model Description

	Generated by fine tuning Qwen2.5-Coder-1.5B-Instruct on bigcode/commitpackft dataset for 2 epochs
	Trained on a total of 277 Languages
	Achieved a final training loss in the range of 1- 1.7 (due to data set not containing equal data rows for each language)
	For common languages(python, java ,javascripts,c etc) loss went for a minimum of 1.0335


	## Environmental Impact

	- Hardware Type: geforce RTX 4060 TI - 16GB]
	- Hours used: 10 Hours
	- Cloud Provider: local


	### Results
	![Logo](./image1.png)
	![Logo](./image2.png)

	### Inference input format (If using API mostly)
	```
	<\|im_start\|>system
	You are Qwen, created by Alibaba Cloud. You are a helpful assistant.<\|im_end\|>
	<\|im_start\|>user
	{instructions}
	{git_diff}<\|im_end\|>
	<\|im_start\|>assistant
	```
	And the model will predict the rest of the content -> {assistant output}<\|im_end\|>