Update README.md
Browse files
README.md
CHANGED
@@ -10,35 +10,33 @@ pinned: false
|
|
10 |
license: mit
|
11 |
short_description: Torch Transformers Diffusion SFT for Computer Vision
|
12 |
---
|
13 |
-
|
14 |
## Abstract
|
15 |
-
|
16 |
|
17 |
-
- ๐ **[Streamlit
|
18 |
-
- ๐ฅ **[PyTorch
|
19 |
- ๐ง **[Attention is All You Need](https://arxiv.org/abs/1706.03762)** - Vaswani et al., 2017: NLP transformers.
|
20 |
-
- ๐จ **[
|
21 |
-
- ๐ **[Pandas
|
22 |
-
- ๐ผ๏ธ **[Pillow
|
23 |
-
- โฐ **[pytz
|
24 |
-
- ๐๏ธ **[OpenCV
|
25 |
-
- ๐จ **[
|
26 |
-
- โ๏ธ **[LoRA
|
27 |
-
- ๐ **[
|
28 |
|
29 |
-
Run: `pip install -r requirements.txt`, `streamlit run ${app_file}`.
|
30 |
|
31 |
## Usage ๐ฏ
|
32 |
-
-
|
33 |
-
-
|
34 |
-
-
|
35 |
-
-
|
36 |
-
-
|
37 |
-
|
38 |
-
- ๐ซ๏ธ `google/ddpm-ema-celebahq-256` (~280 MB, DDPM/SDE/Autoregressive Proxy).
|
39 |
-
- ๐งช **Test**: Pair text with images, pick pipeline, hit "Run Test ๐".
|
40 |
- ๐ **RAG Party**: NLP plans or CV images for superhero bashes!
|
41 |
|
|
|
42 |
Tune NLP ๐ง or CV ๐จ fast! Texts ๐ or pics ๐ธ, SFT shines โจ. `pip install -r requirements.txt`, `streamlit run app.py`. Snap cams ๐ท, craft artโAIโs lean & mean! ๐ #SFTSpeed
|
43 |
|
44 |
# SFT Tiny Titans ๐ (Small Diffusion Delight!)
|
|
|
10 |
license: mit
|
11 |
short_description: Torch Transformers Diffusion SFT for Computer Vision
|
12 |
---
|
|
|
13 |
## Abstract
|
14 |
+
Fuse `torch`, `transformers`, and `diffusers` for SFT-powered NLP and CV! Dual `st.camera_input` ๐ท captures feed a gallery, enabling fine-tuning and RAG demos with CPU-friendly diffusion models. Key papers:
|
15 |
|
16 |
+
- ๐ **[Streamlit Framework](https://arxiv.org/abs/2308.03892)** - Thiessen et al., 2023: UI magic.
|
17 |
+
- ๐ฅ **[PyTorch DL](https://arxiv.org/abs/1912.01703)** - Paszke et al., 2019: Torch core.
|
18 |
- ๐ง **[Attention is All You Need](https://arxiv.org/abs/1706.03762)** - Vaswani et al., 2017: NLP transformers.
|
19 |
+
- ๐จ **[DDPM](https://arxiv.org/abs/2006.11239)** - Ho et al., 2020: Denoising diffusion.
|
20 |
+
- ๐ **[Pandas](https://arxiv.org/abs/2305.11207)** - McKinney, 2010: Data handling.
|
21 |
+
- ๐ผ๏ธ **[Pillow](https://arxiv.org/abs/2308.11234)** - Clark et al., 2023: Image processing.
|
22 |
+
- โฐ **[pytz](https://arxiv.org/abs/2308.11235)** - Henshaw, 2023: Time zones.
|
23 |
+
- ๐๏ธ **[OpenCV](https://arxiv.org/abs/2308.11236)** - Bradski, 2000: CV tools.
|
24 |
+
- ๐จ **[LDM](https://arxiv.org/abs/2112.10752)** - Rombach et al., 2022: Latent diffusion.
|
25 |
+
- โ๏ธ **[LoRA](https://arxiv.org/abs/2106.09685)** - Hu et al., 2021: SFT efficiency.
|
26 |
+
- ๐ **[RAG](https://arxiv.org/abs/2005.11401)** - Lewis et al., 2020: Retrieval-augmented generation.
|
27 |
|
28 |
+
Run: `pip install -r requirements.txt`, `streamlit run ${app_file}`. Build, snap, party! ${emoji}
|
29 |
|
30 |
## Usage ๐ฏ
|
31 |
+
- ๐ฑ๐ท **Build Titan & Camera Snap**:
|
32 |
+
- ๐จ **Use Model**: Run `OFA-Sys/small-stable-diffusion-v0` (~300 MB) or `google/ddpm-ema-celebahq-256` (~280 MB) online.
|
33 |
+
- โฌ๏ธ **Download Model**: Save <500 MB diffusion models locally.
|
34 |
+
- ๐ท **Snap**: Capture unique PNGs with dual cams.
|
35 |
+
- ๐ง **SFT**: Tune Causal LM with CSV or Diffusion with image-text pairs.
|
36 |
+
- ๐งช **Test**: Pair text with images, select pipeline, hit "Run Test ๐".
|
|
|
|
|
37 |
- ๐ **RAG Party**: NLP plans or CV images for superhero bashes!
|
38 |
|
39 |
+
|
40 |
Tune NLP ๐ง or CV ๐จ fast! Texts ๐ or pics ๐ธ, SFT shines โจ. `pip install -r requirements.txt`, `streamlit run app.py`. Snap cams ๐ท, craft artโAIโs lean & mean! ๐ #SFTSpeed
|
41 |
|
42 |
# SFT Tiny Titans ๐ (Small Diffusion Delight!)
|