honicky
/

pythia-14m-hdfs-logs

Model card Files Files and versions Community

honicky commited on Dec 1, 2024

Commit

e6cd290

·

verified ·

1 Parent(s): 3bc7ebb

Add model documentation

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -31,9 +31,9 @@ effectively predict anomalies.  We will then attempt build a model that is more
 - Base model: EleutherAI/pythia-14m
 - Dataset: https://zenodo.org/records/8196385/files/HDFS_v1.zip?download=1 + preprocessed data at honicky/log-analysis-hdfs-preprocessed
 - Batch size: 4
-- Max sequence length: 343
 - Learning rate: 0.0001
-- Training steps: 2110
 ## Special Tokens
 - Added `<|sep|>` token for event ID separation
@@ -46,6 +46,6 @@ This model is intended for:
 ## Limitations
 - Model is specifically trained on HDFS logs and may not generalize to other log formats
-- Limited to the context window size of 343 tokens

 - Base model: EleutherAI/pythia-14m
 - Dataset: https://zenodo.org/records/8196385/files/HDFS_v1.zip?download=1 + preprocessed data at honicky/log-analysis-hdfs-preprocessed
 - Batch size: 4
+- Max sequence length: 405
 - Learning rate: 0.0001
+- Training steps: 12000
 ## Special Tokens
 - Added `<|sep|>` token for event ID separation
 ## Limitations
 - Model is specifically trained on HDFS logs and may not generalize to other log formats
+- Limited to the context window size of 405 tokens