bhargavis
/

fewshot-xsum-bart

Model card Files Files and versions Community

bhargavis commited on Feb 15

Commit

bb6bddd

·

verified ·

1 Parent(s): 3ea03e2

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -42,6 +42,8 @@ The small dataset size is intentional, as the focus is on few-shot learning rath
     - Max Input Length: 512 tokens
     - Max Output Length: 64 tokens
 ### Performance
 Due to the few-shot nature of this model, its performance is not directly comparable to models trained on the full XSUM dataset. However, it demonstrates the potential of few-shot learning for summarization tasks. Key metrics on the validation set (50 samples) include:
@@ -87,9 +89,6 @@ print(summary[0]["summary_text"])
 - The model is fine-tuned on BBC articles from the XSUM dataset. Its performance may vary on text from other domains.
 - The model may overfit to the training data due to the small dataset size.
-##### Full-Shot learning model- For a more general-purpose summarization model, check out the full model trained on the entire XSUM dataset: [fulltrain-xsum-bart](https://huggingface.co/bhargavis/fulltrain-xsum-bart).
 ### Citation
 If you use this model in your research please cite it as follows:

     - Max Input Length: 512 tokens
     - Max Output Length: 64 tokens
+##### Full-Shot learning model- For a more general-purpose summarization model, check out the full model trained on the entire XSUM dataset: [fulltrain-xsum-bart](https://huggingface.co/bhargavis/fulltrain-xsum-bart).
 ### Performance
 Due to the few-shot nature of this model, its performance is not directly comparable to models trained on the full XSUM dataset. However, it demonstrates the potential of few-shot learning for summarization tasks. Key metrics on the validation set (50 samples) include:
 - The model is fine-tuned on BBC articles from the XSUM dataset. Its performance may vary on text from other domains.
 - The model may overfit to the training data due to the small dataset size.
 ### Citation
 If you use this model in your research please cite it as follows: