SRDdev commited on
Commit
8cc4b26
·
1 Parent(s): d560399

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +62 -0
README.md ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: afl-3.0
3
+ datasets:
4
+ - WillHeld/hinglish_top
5
+ language:
6
+ - en
7
+ metrics:
8
+ - accuracy
9
+ library_name: transformers
10
+ pipeline_tag: fill-mask
11
+ ---
12
+
13
+ ### HingMaskedLM
14
+ This is a BERT model trained for Masked Language Modeling for Hinglish Data.
15
+
16
+ ### Dataset
17
+ Hinglish-Top [Dataset](https://huggingface.co/datasets/WillHeld/hinglish_top) columns
18
+ - en_query
19
+ - cs_query
20
+ - en_parse
21
+ - cs_parse
22
+ - domain
23
+
24
+ ### Training
25
+ |Epoch|Loss|
26
+ |:--:|:--:|
27
+ |1 |0.0465|
28
+ |2 |0.0262|
29
+ |3 |0.0116|
30
+ |4 |0.00385|
31
+ |5 |0.0103|
32
+ |6 |0.00738|
33
+ |7 |0.00892|
34
+ |8 |0.00379|
35
+ |9 |0.00126|
36
+ |10 |0.000684|
37
+
38
+
39
+ ### Inference
40
+ ```python
41
+ from transformers import AutoTokenizer, AutoModelForMaskedLM, pipeline
42
+
43
+ tokenizer = AutoTokenizer.from_pretrained("SRDdev/HingMaskedLM")
44
+
45
+ model = AutoModelForMaskedLM.from_pretrained("SRDdev/HingMaskedLM")
46
+
47
+ fill = pipeline('fill-mask', model=model, tokenizer=tokenizer)
48
+ ```
49
+ ```python
50
+ fill(f'please {fill.tokenizer.mask_token} ko cancel kardo')
51
+ ```
52
+
53
+ ### Citation
54
+ Author: @[SRDdev](https://huggingface.co/SRDdev)
55
+ ```
56
+ Name : Shreyas Dixit
57
+ framework : Pytorch
58
+ Year: Jan 2023
59
+ Pipeline : fill-mask
60
+ Github : https://github.com/SRDdev
61
+ LinkedIn : https://www.linkedin.com/in/srddev/
62
+ ```