easygoing0114
/

flan-t5-xxl-fused

GGUF

T5xxl

Google FLAN

Inference Endpoints

Model card Files Files and versions Community

easygoing0114 commited on 18 days ago

Commit

fdbfc65

verified ·

1 Parent(s): e08dad9

Update README.md

Browse files

Files changed (1) hide show

README.md +75 -46

README.md CHANGED Viewed

@@ -1,84 +1,113 @@
 ---
 license: apache-2.0
 tags:
-- T5xxl
-- Google FLAN
 ---
 # FLAN-T5-XXL Fused Model
-This repository contains a fused version of FLAN-T5-XXL, combining the split files available at [Google's FLAN-T5-XXL repository](https://huggingface.co/google/flan-t5-xxl). These files have been merged for easier use in AI applications, including image generation models.
----
-## Newly Added: TE-Only Models for Stable Diffusion WebUI Forge & ComfyUI (2025-03-04)
-Two additional files provide the **Text Encoder (TE) only** portion of FLAN-T5-XXL, specifically extracted for use with Stable Diffusion WebUI Forge and ComfyUI.
-<div style="display: flex; justify-content: center; align-items: center;">
-  <div style="text-align: center; margin-right: 1em;">
-    <img src="./flan_t5_xxl_TE-only_FP32_sample1.png" alt="flan_t5_xxl_TE-only_FP32_sample1" width="400px" height="400px">
   </div>
-  <div style="text-align: center; margin-left: 1em;">
-    <img src="./flan_t5_xxl_TE-only_FP32_sample2.png" alt="flan_t5_xxl_TE-only_FP32_sample2" width="400px" height="400px">
   </div>
 </div>
-- `flan_t5_xxl_TE-only_FP32.safetensors`: Full-precision FP32 TE-only model.
-- `flan_t5_xxl_TE-only_FP16.safetensors`: Half-precision FP16 TE-only model for memory-efficient inference.
-These models retain only the text encoding functionality of FLAN-T5-XXL, reducing resource consumption while maintaining high-quality prompt processing in AI image generation workflows.
-Also can be used as drop-in replacements for standard text encoders in Stable Diffusion-based workflows.
----
-## Full Models
-- `flan_t5_xxl_fp32.safetensors`: Full-precision FP32 Full model.
-- `flan_t5_xxl_fp16.safetensors`: Half-precision FP16 Full model for memory-efficient inference.
----
-## Comparison: FLAN-T5-XXL-FP32 vs FLAN-T5-XXL-FP16 on Flux.1[dev] (base model: [blue_pencil-flux1_v0.0.1](https://huggingface.co/bluepen5805/blue_pencil-flux1))
-<div style="text-align: center; margin-left: auto; margin-right: auto;width:600px;max-width:80%;">
-  <img src="./Flan-T5xxl-FP32_FP16_compare.png" alt="Flan-T5xxl-FP32_FP16_compare">
 </div>
----
 ## Comparison: FLAN-T5-XXL vs T5-XXL v1.1
-<div style="display: flex; justify-content: center; align-items: center;">
-  <div style="text-align: center; margin-right: 1em;">
-    <img src="./flan_t5_xxl_image.png" alt="FLAN-T5-XXL Image" width="400px" height="400px">
-    <p>FLAN-T5-XXL Output</p>
   </div>
-  <div style="text-align: center; margin-left: 1em;">
-    <img src="./t5_xxl_v1_1_image.png" alt="T5-XXL v1.1 Image" width="400px" height="400px">
-    <p>T5-XXL v1.1 Output</p>
   </div>
 </div>
-These example images generated using **FLAN-T5-XXL** and [**T5-XXL v1.1**](https://huggingface.co/google/t5-v1_1-xxl) models in Flux.1.
-FLAN-T5-XXL provides more accurate responses to prompts.
----
-## Further Comparison
-- [FLAN-T5-XXL vs T5-XXL v1.1](https://www.ai-image-journey.com/2024/12/clip-t5xxl-text-encoder.html)
-- [FLAN-T5-XXL FP32 vs FP16 and other quantization](https://www.ai-image-journey.com/2024/12/image-difference-t5xxl-clip-l.html)
 ---
 ## License
-This model is provided under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0).
-The uploader does not claim any rights over the model.
----
-## Acknowledgments
-- Original source: [Google's FLAN-T5-XXL repository](https://huggingface.co/google/flan-t5-xxl).
-- GGUF version: [dumb-dev's Hugging Face repository](https://huggingface.co/dumb-dev/flan-t5-xxl-gguf).

 ---
 license: apache-2.0
 tags:
+  - T5xxl
+  - Google FLAN
 ---
 # FLAN-T5-XXL Fused Model
+This repository hosts a fused version of the FLAN-T5-XXL model, created by combining the split files from [Google's FLAN-T5-XXL repository](https://huggingface.co/google/flan-t5-xxl). The files have been merged for convenience, making it easier to integrate into AI applications, including image generation workflows.
+<div style="display: flex; justify-content: center; align-items: center; gap: 2em;">
+  <div>
+    <img src="./images/flan_t5_xxl_TE-only_FP32_sample1.png" alt="FLAN-T5-XXL sample image 1" width="400px" height="400px">
   </div>
+  <div>
+    <img src="./images/flan_t5_xxl_TE-only_FP32_sample2.png" alt="FLAN-T5-XXL sample image 2" width="400px" height="400px">
   </div>
 </div>
+Sample pictures: Base Model [**blue_pencil-flux1_v0.0.1**](https://huggingface.co/bluepen5805/blue_pencil-flux1)
+## Key Features
+- **Fused for Simplicity:** Combines split model files into a single, ready-to-use format.
+- **Optimized Variants:** Available in FP32, FP16, and quantized GGUF formats to balance accuracy and resource usage.
+- **Enhanced Prompt Accuracy:** Outperforms the standard T5-XXL v1.1 in generating precise outputs for image generation tasks.
+## Model Variants
+| Model File                          | Size   | Accuracy (SSIM Similarity) | Recommended |
+|-------------------------------------|:--------:|:----------------------------:|:-------------:|
+| flan_t5_xxl_fp32.safetensors       | 44.1GB | 100%                       |             |
+| flan_t5_xxl_fp16.safetensors       | 22.1GB | 99.9%                      |             |
+| flan_t5_xxl_TE-only_FP32.safetensors| 18.7GB | 100%                       | 🔺           |
+| flan_t5_xxl_TE-only_FP16.safetensors| 9.4GB  | 99.9%                      | ✅          |
+| flan_t5_xxl_TE-only_Q8_0.gguf      | 5.5GB  | 99.8%                      | ✅          |
+| flan_t5_xxl_TE-only_Q6_K.gguf      | 4.4GB  | 99.7%                      | 🔺           |
+| flan_t5_xxl_TE-only_Q5_K_M.gguf    | 3.8GB  | 98.4%                      | 🔺           |
+| flan_t5_xxl_TE-only_Q4_K_M.gguf    | 3.2GB  | 95.2%                      |             |
+| flan_t5_xxl_TE-only_Q3_K_L.gguf    | 2.6GB  | 84.9%                      |             |
+### Comparison Graph
+<div style="text-align: center; margin-left: auto; margin-right: auto; width: 600px; max-width: 80%;">
+  <img src="./images/Flan-T5xxl MAE and SSIM Similarity.png" alt="Flan-T5xxl MAE and SSIM Similarity Graph">
+</div>
+For a detailed comparison, refer to [this blog post](https://www.ai-image-journey.com/2024/12/image-difference-t5xxl-clip-l.html).
+## Usage Instructions
+Place the downloaded model files in one of the following directories:
+- `installation_folder/models/text_encoder`
+- `installation_folder/models/clip`
+- `installation_folder/Models/CLIP`
+### Stable Diffusion WebUI Forge
+In Stable Diffusion WebUI Forge, select the FLAN-T5-XXL model instead of the default T5xxl_v1_1 text encoder.
+<div style="text-align: center; margin-left: auto; margin-right: auto; width: 800px; max-width: 80%;">
+  <img src="./images/Screenshot of Stable Diffusion WebUI Forge text encoder selection screen.png" alt="Stable Diffusion WebUI Forge Text Encoder Selection Screen">
 </div>
+**Note:** Stable Diffusion WebUI Forge does not support FP32 models. Use FP16 or GGUF formats instead.
+### ComfyUI
+**Sample Workflow**
+For ComfyUI, we recommend using the [ComfyUI-MultiGPU](https://github.com/neuratech-ai/ComfyUI-MultiGPU) custom node to load the model into system RAM instead of VRAM.
+<div style="text-align: center; margin-left: auto; margin-right: auto; width: 800px; max-width: 80%;">
+  <img src="./images/Screenshots of ComfyUI's DualCLIPLoaderMultiGPU and DualCLIPLoaderGGUFMultiGPU custom nodes.png" alt="ComfyUI DualCLIPLoaderMultiGPU and DualCLIPLoaderGGUFMultiGPU Custom Nodes">
+</div>
+Use the **DualCLIPLoaderMultiGPU** or **DualCLIPLoaderGGUFMultiGPU** node and set the device to **cpu** to load the model into system RAM.
+**FP32 Support:** To use FP32 text encoders in ComfyUI, launch with the `--fp32-text-enc` flag.
 ## Comparison: FLAN-T5-XXL vs T5-XXL v1.1
+<div style="display: flex; justify-content: center; align-items: center; gap: 2em;">
+  <div>
+    <img src="./images/flan_t5_xxl_image.png" alt="FLAN-T5-XXL Image" width="400px" height="400px">
   </div>
+  <div>
+    <img src="./images/t5_xxl_v1_1_image.png" alt="T5-XXL v1.1 Image" width="400px" height="400px">
   </div>
 </div>
+These example images were generated using **FLAN-T5-XXL** and [**T5-XXL v1.1**](https://huggingface.co/google/t5-v1_1-xxl) models in Flux.1. FLAN-T5-XXL delivers more accurate responses to prompts.
+## Further Comparisons
+- [FLAN-T5-XXL vs T5-XXL v1.1](https://ai-image-journey.blogspot.com/2024/12/clip-t5xxl-text-encoder.html)
+- [FLAN-T5-XXL FP32 vs FP16 and Quantization](https://ai-image-journey.blogspot.com/2024/12/image-difference-t5xxl-clip-l.html)
+### Tip: Upgrade CLIP-L Too
+For even better results, consider upgrading the CLIP-L text encoder alongside FLAN-T5-XXL:
+- [LongCLIP-SAE-ViT-L-14](https://huggingface.co/zer0int/LongCLIP-SAE-ViT-L-14) (ComfyUI only)
+- [CLIP-SAE-ViT-L-14](https://huggingface.co/zer0int/CLIP-SAE-ViT-L-14)
+Combining FLAN-T5-XXL with an upgraded CLIP-L can further enhance image quality.
 ---
 ## License
+- This model is distributed under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0).
+- The uploader claims no ownership or rights over the model.