trichter commited on
Commit
e6abf8e
·
verified ·
1 Parent(s): a82d1e2

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -0
README.md ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ base_model:
5
+ - google-t5/t5-large
6
+ ---
7
+ Model: t5-DistillingSbS-ABSA
8
+
9
+ Task: Aspect-Based Sentiment Analysis (ABSA) - specifically, Aspect Pair Sentiment Extraction
10
+
11
+ Technique: Distilling Step-by-Step (DistillingSbS)
12
+
13
+ Model Description
14
+ t5-DistillingSbS-ABSA is a fine-tuned t5-large model designed to perform Aspect-Based Sentiment Analysis (ABSA), particularly for the task of Aspect Pair Sentiment Extraction.
15
+ I used a training approach called Distilling Step-by-Step originally proposed in [This Paper](https://arxiv.org/abs/2305.02301) by Hsieh et al. at Google Research
16
+
17
+ Data Annotation: Reviews were initially unannotated and were labeled using the OpenAI API for aspect-sentiment pairs and rationales.
18
+
19
+ Dataset
20
+ The dataset consists of customer reviews of mobile apps that were originally unannotated. The data was annotated via the OpenAI API, with each review labeled for specific aspects (e.g., UI, functionality, performance) and the corresponding sentiment (positive, negative, neutral). Additionally, sentence-long rationales were extracted to justify the aspect-sentiment pair annotations, aiding in Distilling Step-by-Step training.
21
+
22
+ Training was performed using Hugging Face's Trainer API in Google Colaboratory using 1 A100 GPU with 40 GB of VRAM.
23
+ Training took around 6 hours with a cost of about 80 compute units.
24
+ With a custom loss function, tokenization function and training loop. All code can be found at my [My GitHub Repository](https://github.com/trichter93/ABSA-LLMs-DistillingSbS/)
25
+
26
+ Hyperparameters
27
+ Some of the key hyperparameters used for fine-tuning:
28
+
29
+ Batch Size: 3
30
+ Learning Rate: 1e-4
31
+ Epochs: 5
32
+ Max Sequence Length: 512