easygoing0114
/

flan-t5-xxl-fused

Inference Endpoints

Model card Files Files and versions Community

easygoing0114 commited on Dec 6, 2024

Commit

c78c922

·

verified ·

1 Parent(s): dd1eb0c

Update README.md

Files changed (1) hide show

README.md +17 -3

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
----
-license: apache-2.0
----

+# FLAN-T5-XXL Fused Model
+This repository contains a fused version of FLAN-T5-XXL, combining the split files available at [Google's FLAN-T5-XXL repository](https://huggingface.co/google/flan-t5-xxl). These files have been merged for easier use in AI applications, including image generation models.
+Additionally, a GGUF-optimized version of this model is available at [dumb-dev's Hugging Face repository](https://huggingface.co/dumb-dev/flan-t5-xxl-gguf). This version is tailored for use in environments optimized for GGUF format, such as certain LLM inference frameworks.
+## Files
+- `flan_t5_xxl_fp32.safetensors`: Full-precision FP32 model.
+- `flan_t5_xxl_fp16.safetensors`: Half-precision FP16 model for memory-efficient inference.
+## License
+This model is provided under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0).
+The uploader does not claim any rights over the model.
+## Acknowledgments
+- Original source: [Google's FLAN-T5-XXL repository](https://huggingface.co/google/flan-t5-xxl).
+- GGUF version: [dumb-dev's Hugging Face repository](https://huggingface.co/dumb-dev/flan-t5-xxl-gguf).