Could you please tell how to inference this model?

#4
by carlosbdw - opened

thank you very much , I used to use vllm ,but it doesn't work with it.

Any specific ideas on how to infer with vllm?

you can't. it's tensorRT

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment