xlsd16-phase2 / README.md
ppbrown's picture
Update README.md
7cb6c67 verified
metadata
datasets:
  - opendiffusionai/cc12m-cleaned
  - opendiffusionai/laion2b-en-aesthetic-square
base_model:
  - opendiffusionai/XLSD-V0.0

Work in progress... unlike the other uploaded models, this is a bf16 precision model. IF someone wants to mess around with a bf16 version of my xlsd project, here it is.

image/png

I am continually refining my datasets. For phase 2, this uses a subset of https://huggingface.co/datasets/opendiffusionai/laion2b-en-aesthetic-square with the original LAION captions. However, I am currently working on recaptioning with InternLM 7b, and then filtering some more. I will then redo training of this "phase 2" step with the updated dataset.

Long term ideal plans would be, once I have optimally cleaned datasets, to redo from scratch, with phase one using the 200k laion square subset, then phase 2 using a square subset of my CC12M https://huggingface.co/datasets/opendiffusionai/cc12m-xlsd (and possibly adding back 1mp sized images) (or possibly switching the order. Not sure yet)

Finally, doing a cleanup with https://huggingface.co/datasets/opendiffusionai/pexels-janpf-sharp