Inference seems to be broken with latest llama.cpp llama-server
1
#2 opened 7 months ago
by
AIWintermuteAI

-cml / --chatml has been discontinued in llama.cpp
2
#1 opened 9 months ago
by
algorithm
