haryoaw
/

id-recigen-bart

Text2Text Generation

Inference Endpoints

Model card Files Files and versions Community

haryoaw commited on Apr 17, 2022

Commit

ec17889

·

1 Parent(s): c671738

Update README.md

Files changed (1) hide show

README.md +14 -4

README.md CHANGED Viewed

@@ -8,9 +8,9 @@ license: mit
 # Indonesia Recipe Ingredients Generator Model
-**WARNING: inference on Huggingface might not run since the tokenizer is not the default one. Currently, I want to build a spaces to run the inference.
-Please wait for it**
 😎 **Have fun on generating ingredients** 😎
@@ -33,6 +33,9 @@ Since we use `indobart-v2`, we need to use their tokenizer.
 First, install the tokenizer by doing `pip install indobenchmark-toolkit`.
 After that, you can load the tokenizer:
 ```python
@@ -41,6 +44,11 @@ from indobenchmark.tokenization_indonlg import IndoNLGTokenizer
 tokenizer = IndoNLGTokenizer.from_pretrained("haryoaw/id-recigen-bart")
 ```
 ### Model
 The model can be loaded by using AutoModel.
@@ -52,7 +60,9 @@ model = AutoModelForSeq2SeqLM.from_pretrained("haryoaw/id-recigen-bart")
 ```
-## Example of input
 ```
 sayur asam
@@ -62,6 +72,6 @@ sayur asam
 nasi goreng ayam
 ```
-~To be continued

 # Indonesia Recipe Ingredients Generator Model
+**WARNING: inference on Huggingface might not run since the tokenizer used is not transformers's tokenizer.**
+Feel free to test the model [in this space](https://huggingface.co/spaces/haryoaw/id-recigen)
 😎 **Have fun on generating ingredients** 😎
 First, install the tokenizer by doing `pip install indobenchmark-toolkit`.
 After that, you can load the tokenizer:
 ```python
 tokenizer = IndoNLGTokenizer.from_pretrained("haryoaw/id-recigen-bart")
 ```
+**EDIT**:
+Seems like the tokenizer in the package is not the same as the one that I use to finetune the model.
+There are some noticeable bug such as some subword tokens are not considered as subword. Nevertheless, it stil works!
 ### Model
 The model can be loaded by using AutoModel.
 ```
+## Input Example
+Make sure to input a **LOWERCASE** food name. The tokenizer is case-sensitive!
 ```
 sayur asam
 nasi goreng ayam
 ```
+~To be continued..