danj0nes
/

dropout_gpt2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

danj0nes commited on Jan 8, 2024

Commit

9844405

·

1 Parent(s): 88f1163

Update README.md

Files changed (1) hide show

README.md +7 -22

README.md CHANGED Viewed

@@ -4,30 +4,19 @@ base_model: gpt2
 tags:
 - generated_from_trainer
 model-index:
-- name: dropout_c-gpt2_s-0
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# dropout_c-gpt2_s-0
-This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -42,13 +31,9 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - training_steps: 4000
-### Training results
 ### Framework versions
 - Transformers 4.35.2
 - Pytorch 2.1.0+cu121
 - Datasets 2.16.1
-- Tokenizers 0.15.0

 tags:
 - generated_from_trainer
 model-index:
+- name: dropout_gpt2
   results: []
+language:
+- en
 ---
+# dropout_gpt2
+This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on a English Wikipedia dataset.
 ## Model description
+Dropout debiased GPT-2 using the hyperparameters specified in [Measuring and Reducing Gendered Correlations in Pre-trained Models (Webster et al. 2021)](https://arxiv.org/abs/2010.06032).
 ### Training hyperparameters
 - lr_scheduler_type: linear
 - training_steps: 4000
 ### Framework versions
 - Transformers 4.35.2
 - Pytorch 2.1.0+cu121
 - Datasets 2.16.1
+- Tokenizers 0.15.0