Blackroot's picture
Update README.md
3b8b95b verified
|
raw
history blame
836 Bytes
metadata
license: mit

Discord

A semi custom network based on the follow paper Simpler Diffusion (SiD2)

This network uses the optimal transport flow matching objective outlined Flow Matching for Generative Modeling

xATGLU Layers are used instead of linears for entry into the transformer MLP layer Expanded Gating Ranges Improve Activation Functions

python train.py will train a new image network on the provided dataset. A pretrained model is provided here, step_1799.safetensors. To test, use: python test_sample.py step_1799.safetensors

samples samples samples samples