nlpie
/

tiny-clinicalbert

Model card Files Files and versions

mohammadmahdinouri commited on Feb 11, 2023

Commit

41c6323

·

1 Parent(s): 2f63d6c

Update README.md

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -1,3 +1,25 @@
 ---
-license: mit
 ---

 ---
+title: README
+emoji: 🏃
+colorFrom: gray
+colorTo: purple
+sdk: static
+pinned: false
 ---
+# Model Description
+TinyClinicalBERT is a distilled version of the [BioClinicalBERT](https://huggingface.co/emilyalsentzer/Bio_ClinicalBERT) which is distilled for 3 epochs using a total batch size of 192 on the MIMIC-III notes dataset.
+# Distillation Procedure
+This model uses a unique distillation method called ‘transformer-layer distillation’ which is applied on each layer of the student to align the attention maps and the hidden states of the student with those of the teacher.
+# Architecture and Initialisation
+This model uses 4 hidden layers with a hidden dimension size and an embedding size of 768 resulting in a total of 15M parameters. Due to the model's small hidden dimension size, it uses random initialisation.
+# Citation
+If you use this model, please consider citing the following paper:
+```bibtex
+```