Qwen1.5-4B-Chat-rkllm

This is a conversion from Qwen/Qwen1.5-4B-Chat to the RKLLM format for chat in Rockchip devices.

Qwen1.5-4B-Chat-rkllm

Support Devices

RK3588/RK3588s

Convert tools

To Converting LLMs for Rockchip's NPUs, please see the artical^1,2 for model details.

Converted with RKLLM runtime

RKLLM runtime 1.0.1

License

Same as the original Qwen/Qwen1.5-4B-Chat

Trouble shot

E RKNN: [10:48:59.683] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory

firefly@firefly:~/Documents/rknn-llm$ rkllm ./chatglm3-6b.rkllm
rkllm init start
rkllm-runtime version: 1.0.1, rknpu driver version: 0.8.2, platform: RK3588
Warning: Your rknpu driver version is too low, please upgrade to 0.9.6.
E RKNN: [10:48:59.683] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory

can not create weight memory for domain1
E RKNN: [10:49:00.480] failed to allocate handle, ret: -1, errno: 12, errstr: Cannot allocate memory

can not create weight memory for domain2
E RKNN: [10:49:05.216] failed to convert handle(1020) to fd, ret: -1, errno: 24, errstr: Too many open files

# Solution
firefly@firefly:~/Documents/rknn-llm$ ulimit -n 102400