Spaces:

david-thrower
/

basic-SmolLM2-chatbot

Running

david-thrower commited on Feb 7

Commit

a5697da

verified ·

1 Parent(s): 0051fda

Update app.py

Lower max_new_tokens to 250. It still takes an eternity to respond with the 135M model.

Files changed (1) hide show

app.py CHANGED Viewed

@@ -4,7 +4,7 @@ import gradio as gr
 from transformers import pipeline
 import torch
-MAX_NEW_TOKENS = 600
 MODEL="HuggingFaceTB/SmolLM2-135M-Instruct"
 # MODEL="HuggingFaceTB/SmolLM2-360M-Instruct"

 from transformers import pipeline
 import torch
+MAX_NEW_TOKENS = 250
 MODEL="HuggingFaceTB/SmolLM2-135M-Instruct"
 # MODEL="HuggingFaceTB/SmolLM2-360M-Instruct"