Commit History

keep debug message
37f7787
Running

Luigi commited on

add debug to show web resarch result
a2f07a4

Luigi commited on

give 1 second for web search to grab data
9ad3ffd

Luigi commited on

inject web search result if web search enabled
bc257ff

Luigi commited on

refactor(app): improve streaming, background search, dtype fallback, and cleanup :contentReference[oaicite:0]{index=0}
293686e

Luigi commited on

bugfixc: not using pipeline for response generation
939895d

Luigi commited on

Add original SmolLM2 135M Instruct for comparaison
423dc1a

Luigi commited on

Add SmolLM2-135M-Instruct-TaiwanChat
38fcc03

Luigi commited on

Add SmolLM2-135M TaiwanChat
0d642b7

Luigi commited on

default to gemma-3-4b
88a6a62

Luigi commited on

model repo_id typo fix
89372fa

Luigi commited on

enable web search by default
6235e63

Luigi commited on

remove tinyllama which has bad response quality
a22cf42

Luigi commited on

make streaming response
5ea073d

Luigi commited on

apply history flatten before it goint to prompt
ef361b0

Luigi commited on

better management on system prompt
5f6306a

Luigi commited on

usue chat pipeline instead of model and tokenizer individually
ac8e9cc

Luigi commited on

bugfix to padding-related issues
f248fec

Luigi commited on

add attention mask
b6b3940

Luigi commited on

Clean model description
4731160

Luigi commited on

pin torch to 2.4.0
4c6b4c5

Luigi commited on

enable zerogpu with decorator
c5b2897

Luigi commited on

Apply ZeroGPU
d181b45

Luigi commited on

switch to gradio version for stability reason
a703203

Luigi commited on

add taiwan tinyllama
794ee70

Luigi commited on

Improve responsiveness by asynchronously retrieving web search context
acda3f1

Luigi commited on

add llama 3.2 taiwan 3b
b1544e2

Luigi commited on

add minicpm3 4b
f5c0811

Luigi commited on

increase xt length to max
629495e

Luigi commited on

remove all moe
fafc8cb

Luigi commited on

remove qwen 1.5 moe
6735035

Luigi commited on

adjust title style
e9559bd

Luigi commited on

use another version of qwen 1.5 moe
96e60d6

Luigi commited on

add Qwen1.5-MoE
e17afaf

Luigi commited on

Qwen2.5-MOE-6x1.5B
5eca666

Luigi commited on

remove under 3b models
617be26

Luigi commited on

Add model caching
d33dfcd

Luigi commited on

UI/UX Improvement
eb215ff

Luigi commited on

reset timeout timer once a new token is generated
35943b1

Luigi commited on

open web search settgins to user
c9fd924

Luigi commited on

add 2 more models
f7a541f

Luigi commited on

apply new settings on duckduck search
d9421eb

Luigi commited on

tune llama paramters
20484f3

Luigi commited on

increase max_chars_per_result to 600
1155897

Luigi commited on

increase max results to 6 for better web search
0c2fe1d

Luigi commited on

increase ctx lenght to 2k
9ba47d1

Luigi commited on

increase timeout to 5min
71d28c5

Luigi commited on

Code simplification
248f5a7

Luigi commited on

Enable speculattive decoding
a7fdfe6

Luigi commited on

fix role disorder error in history
06a162a

Luigi commited on