Surabhi-K
/

code_llama_library2

Generated from Trainer

Model card Files Files and versions Community

Surabhi-K commited on Apr 17, 2024

Commit

c9e7d5c

·

verified ·

1 Parent(s): 8e065a1

Delete README (1).md

Files changed (1) hide show

README (1).md +0 -69

README (1).md DELETED Viewed

@@ -1,69 +0,0 @@
----
-license: llama2
-library_name: peft
-tags:
-- generated_from_trainer
-base_model: codellama/CodeLlama-7b-hf
-model-index:
-- name: working
-  results: []
----
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# working
-This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.1536
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 3
-- eval_batch_size: 3
-- seed: 42
-- gradient_accumulation_steps: 5
-- total_train_batch_size: 15
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 20
-- num_epochs: 7
-- mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 2.0255        | 1.0   | 63   | 0.5661          |
-| 0.3616        | 2.0   | 126  | 0.3047          |
-| 0.1979        | 3.0   | 189  | 0.2129          |
-| 0.1565        | 4.0   | 252  | 0.1817          |
-| 0.1409        | 5.0   | 315  | 0.1644          |
-| 0.1319        | 6.0   | 378  | 0.1561          |
-| 0.1277        | 7.0   | 441  | 0.1536          |
-### Framework versions
-- PEFT 0.7.1
-- Transformers 4.36.2
-- Pytorch 2.1.2
-- Datasets 2.15.0
-- Tokenizers 0.15.2