llmware
/

phi-3-onnx

Model card Files Files and versions Community

phi-3-onnx / README.md

doberst's picture

Update README.md

3e614c7 verified 3 months ago

|

history blame contribute delete

897 Bytes

	---
	license: apache-2.0
	inference: false
	base_model: microsoft/Phi-3-mini-4k-instruct
	base_model_relation: quantized
	tags: [green, llmware-chat, p3, onnx]
	---

	# phi-3-onnx

	phi-3-onnx is an ONNX int4 quantized version of [Microsoft Phi-3-mini-4k-instruct](https://www.huggingface.co/microsoft/Phi-3-mini-4k-instruct), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.


	### Model Description

	- Developed by: microsoft
	- Quantized by: llmware
	- Model type: phi3
	- Parameters: 3.8 billion
	- Model Parent: microsoft/Phi-3-mini-4k-instruct
	- Language(s) (NLP): English
	- License: Apache 2.0
	- Uses: Chat, general-purpose LLM
	- Quantization: int4


	## Model Card Contact

	[llmware on hf](https://www.huggingface.co/llmware)

	[llmware website](https://www.llmware.ai)