Add model documentation
Browse files
README.md
CHANGED
@@ -19,8 +19,8 @@ Fine-tuned Pythia-14m model for HDFS log analysis, specifically for anomaly dete
|
|
19 |
|
20 |
## Model Description
|
21 |
|
22 |
-
This model is fine-tuned from EleutherAI/pythia-14m for analyzing HDFS log sequences. It's designed to understand and predict patterns in
|
23 |
-
HDFS log data so that we can detect anomalies using the perplexity of the log sequence.
|
24 |
so we can use it to validate that the model can predict anomalies.
|
25 |
|
26 |
We will use this model to understand the ability of a small model to predict anomalies in a specific dataset. We will study model scale
|
@@ -33,7 +33,7 @@ effectively predict anomalies. We will then attempt build a model that is more
|
|
33 |
- Batch size: 4
|
34 |
- Max sequence length: 343
|
35 |
- Learning rate: 0.0001
|
36 |
-
- Training steps:
|
37 |
|
38 |
## Special Tokens
|
39 |
- Added `<|sep|>` token for event ID separation
|
|
|
19 |
|
20 |
## Model Description
|
21 |
|
22 |
+
This model is fine-tuned from `EleutherAI/pythia-14m` for analyzing HDFS log sequences. It's designed to understand and predict patterns in
|
23 |
+
HDFS log data so that we can detect anomalies using the perplexity of the log sequence. THhe HDFS sequence is handy because it has labels
|
24 |
so we can use it to validate that the model can predict anomalies.
|
25 |
|
26 |
We will use this model to understand the ability of a small model to predict anomalies in a specific dataset. We will study model scale
|
|
|
33 |
- Batch size: 4
|
34 |
- Max sequence length: 343
|
35 |
- Learning rate: 0.0001
|
36 |
+
- Training steps: 2110
|
37 |
|
38 |
## Special Tokens
|
39 |
- Added `<|sep|>` token for event ID separation
|