| <!--Copyright 2024 The HuggingFace Team. All rights reserved. | |
| Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with | |
| the License. You may obtain a copy of the License at | |
| http://www.apache.org/licenses/LICENSE-2.0 | |
| Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on | |
| an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the | |
| specific language governing permissions and limitations under the License. | |
| --> | |
| # ONNX Runtime | |
| π€ [Optimum](https://github.com/huggingface/optimum) provides a Stable Diffusion pipeline compatible with ONNX Runtime. You'll need to install π€ Optimum with the following command for ONNX Runtime support: | |
| ```bash | |
| pip install -q optimum["onnxruntime"] | |
| ``` | |
| This guide will show you how to use the Stable Diffusion and Stable Diffusion XL (SDXL) pipelines with ONNX Runtime. | |
| ## Stable Diffusion | |
| To load and run inference, use the [`~optimum.onnxruntime.ORTStableDiffusionPipeline`]. If you want to load a PyTorch model and convert it to the ONNX format on-the-fly, set `export=True`: | |
| ```python | |
| from optimum.onnxruntime import ORTStableDiffusionPipeline | |
| model_id = "stable-diffusion-v1-5/stable-diffusion-v1-5" | |
| pipeline = ORTStableDiffusionPipeline.from_pretrained(model_id, export=True) | |
| prompt = "sailing ship in storm by Leonardo da Vinci" | |
| image = pipeline(prompt).images[0] | |
| pipeline.save_pretrained("./onnx-stable-diffusion-v1-5") | |
| ``` | |
| <Tip warning={true}> | |
| Generating multiple prompts in a batch seems to take too much memory. While we look into it, you may need to iterate instead of batching. | |
| </Tip> | |
| To export the pipeline in the ONNX format offline and use it later for inference, | |
| use the [`optimum-cli export`](https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#exporting-a-model-to-onnx-using-the-cli) command: | |
| ```bash | |
| optimum-cli export onnx --model stable-diffusion-v1-5/stable-diffusion-v1-5 sd_v15_onnx/ | |
| ``` | |
| Then to perform inference (you don't have to specify `export=True` again): | |
| ```python | |
| from optimum.onnxruntime import ORTStableDiffusionPipeline | |
| model_id = "sd_v15_onnx" | |
| pipeline = ORTStableDiffusionPipeline.from_pretrained(model_id) | |
| prompt = "sailing ship in storm by Leonardo da Vinci" | |
| image = pipeline(prompt).images[0] | |
| ``` | |
| <div class="flex justify-center"> | |
| <img src="https://huggingface.co/datasets/optimum/documentation-images/resolve/main/onnxruntime/stable_diffusion_v1_5_ort_sail_boat.png"> | |
| </div> | |
| You can find more examples in π€ Optimum [documentation](https://huggingface.co/docs/optimum/), and Stable Diffusion is supported for text-to-image, image-to-image, and inpainting. | |
| ## Stable Diffusion XL | |
| To load and run inference with SDXL, use the [`~optimum.onnxruntime.ORTStableDiffusionXLPipeline`]: | |
| ```python | |
| from optimum.onnxruntime import ORTStableDiffusionXLPipeline | |
| model_id = "stabilityai/stable-diffusion-xl-base-1.0" | |
| pipeline = ORTStableDiffusionXLPipeline.from_pretrained(model_id) | |
| prompt = "sailing ship in storm by Leonardo da Vinci" | |
| image = pipeline(prompt).images[0] | |
| ``` | |
| To export the pipeline in the ONNX format and use it later for inference, use the [`optimum-cli export`](https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#exporting-a-model-to-onnx-using-the-cli) command: | |
| ```bash | |
| optimum-cli export onnx --model stabilityai/stable-diffusion-xl-base-1.0 --task stable-diffusion-xl sd_xl_onnx/ | |
| ``` | |
| SDXL in the ONNX format is supported for text-to-image and image-to-image. | |