transformers-CFG-JSON-demo

Sleeping

Saibo-backup commited on Apr 10, 2024

Commit

2f4db23

1 Parent(s): 4d5cbc9

put on bold up to truncation

Files changed (1) hide show

app.py CHANGED Viewed

@@ -72,15 +72,18 @@ if __name__ == "__main__":
         gr.Markdown(
             """
             # 👻 Transformers-CFG JSON Demo
-            This is a demo of how you can constrain the output of a GPT-2 model to be a valid JSON string(up to truncation).
             Here we use a simple JSON grammar to constrain the output of the model.
-            The grammar is defined in `json_minimal.ebnf` and is written in the Extended Backus-Naur Form (EBNF).
             Internally, it relies on the library [`transformers-cfg`](https://github.com/epfl-dlab/transformers-CFG).
             For demo purpose, gpt2-large is used, but you can use much larger models for better performance.
             The inference is a bit slow because of the inference is run on **CPU(~20s for 30 tokens)**.
             The constraint itself **doesn't** introduce significant overhead to the inference.
             """
         )

         gr.Markdown(
             """
             # 👻 Transformers-CFG JSON Demo
+            This is a demo of how you can constrain the output of a GPT-2 model to be a *valid* JSON string(*up to max length truncation*).
             Here we use a simple JSON grammar to constrain the output of the model.
+            The grammar is defined in `json_minimal.ebnf` and is written in the **Extended Backus-Naur Form (EBNF)**.
             Internally, it relies on the library [`transformers-cfg`](https://github.com/epfl-dlab/transformers-CFG).
             For demo purpose, gpt2-large is used, but you can use much larger models for better performance.
             The inference is a bit slow because of the inference is run on **CPU(~20s for 30 tokens)**.
             The constraint itself **doesn't** introduce significant overhead to the inference.
+            The output may be *truncated* to 30 tokens due to the limitation of the maximum length of the output.
+            In practice, with a decent `max_length` parameter, your JSON output will be *complete* and *valid*.
             """
         )