Spaces:

mikeee
/

qwen-7b-chat

Running on T4

qwen-7b-chat / requirements.txt

ffreemt

Update bitsandbytes-cuda116

aadaa3d almost 2 years ago

349 Bytes

	transformers==4.31.0
	accelerate
	tiktoken
	einops

	# flash-attention
	# git clone -b v1.0.8 https://github.com/Dao-AILab/flash-attention
	# cd flash-attention && pip install .
	# pip install csrc/layer_norm
	# pip install csrc/rotary

	torch # 2.0.1
	safetensors

	# bitsandbytes
	bitsandbytes-cuda116

	transformers_stream_generator
	scipy

	loguru
	about-time