Upload folder using huggingface_hub

Browse files

Files changed (5) hide show

README.md +88 -0
adapter_config.json +37 -0
adapter_model.safetensors +3 -0
pytorch_model.bin +3 -0
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,88 @@

+---
+license: llama3.2
+language:
+- en
+base_model: meta-llama/Llama-3.2-1B
+pipeline_tag: text-classification
+library_name: peft
+tags:
+- regression
+- story-point-estimation
+- software-engineering
+datasets:
+- talenddataquality
+- appceleratorstudio
+metrics:
+- mae
+- mdae
+model-index:
+- name: llama-3.2-1b-story-point-estimation
+  results:
+  - task:
+      type: regression
+      name: Story Point Estimation
+    dataset:
+      name: appceleratorstudio Dataset
+      type: appceleratorstudio
+      split: test
+    metrics:
+    - type: mae
+      value: 3.211
+      name: Mean Absolute Error (MAE)
+    - type: mdae
+      value: 3.152
+      name: Median Absolute Error (MdAE)
+---
+# LLAMA 3 Story Point Estimator - talenddataquality - appceleratorstudio
+This model is fine-tuned on issue descriptions from talenddataquality and tested on appceleratorstudio for story point estimation.
+## Model Details
+- Base Model: LLAMA 3.2 1B
+- Training Project: talenddataquality
+- Test Project: appceleratorstudio
+- Task: Story Point Estimation (Regression)
+- Architecture: PEFT (LoRA)
+- Tokenizer: SP WordPiece
+- Input: Issue titles
+- Output: Story point estimation (continuous value)
+## Usage
+```python
+from transformers import AutoModelForSequenceClassification, BertTokenizer
+from peft import PeftConfig, PeftModel
+# Load peft config model
+config = PeftConfig.from_pretrained("DEVCamiloSepulveda/666-LLAMA3SP-talenddataquality-appceleratorstudio")
+# Load tokenizer and model
+tokenizer = BertTokenizer('vocab.txt')
+base_model = AutoModelForSequenceClassification.from_pretrained(
+    config.base_model_name_or_path,
+    num_labels=1,
+    torch_dtype=torch.float16,
+    device_map='auto'
+)
+model = PeftModel.from_pretrained(base_model, "DEVCamiloSepulveda/666-LLAMA3SP-talenddataquality-appceleratorstudio")
+# Prepare input text
+text = "Your issue description here"
+inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=20, padding="max_length")
+# Get prediction
+outputs = model(**inputs)
+story_points = outputs.logits.item()
+```
+## Training Details
+- Fine-tuning method: LoRA (Low-Rank Adaptation)
+- Sequence length: 20 tokens
+- Best training epoch: 6 / 20 epochs
+- Batch size: 32
+- Training time: 484.246 seconds
+- Mean Absolute Error (MAE): 3.211
+- Median Absolute Error (MdAE): 3.152
+### Framework versions
+- PEFT 0.14.0

adapter_config.json ADDED Viewed

	@@ -0,0 +1,37 @@

+{
+  "alpha_pattern": {},
+  "auto_mapping": null,
+  "base_model_name_or_path": "meta-llama/Llama-3.2-1B",
+  "bias": "none",
+  "eva_config": null,
+  "exclude_modules": null,
+  "fan_in_fan_out": false,
+  "inference_mode": true,
+  "init_lora_weights": true,
+  "layer_replication": null,
+  "layers_pattern": null,
+  "layers_to_transform": null,
+  "loftq_config": {},
+  "lora_alpha": 16,
+  "lora_bias": false,
+  "lora_dropout": 0.1,
+  "megatron_config": null,
+  "megatron_core": "megatron.core",
+  "modules_to_save": [
+    "classifier",
+    "score"
+  ],
+  "peft_type": "LORA",
+  "r": 8,
+  "rank_pattern": {},
+  "revision": null,
+  "target_modules": [
+    "q_proj",
+    "o_proj",
+    "k_proj",
+    "v_proj"
+  ],
+  "task_type": "SEQ_CLS",
+  "use_dora": false,
+  "use_rslora": false
+}

adapter_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6e10c07641a02af10de588627a97219ddf2caade4e93f834485ddeeb0f6534fd
+size 6840816

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5c02d56c991c7999d6a3229068ea76b7d8da4fa663b8913f3152ae08ae74d048
+size 1560270490

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff