awacke1's picture
Update README.md
292333e verified
|
raw
history blame
3.63 kB
metadata
title: TorchTransformers Diffusion CV SFT
emoji: 
colorFrom: yellow
colorTo: indigo
sdk: streamlit
sdk_version: 1.43.2
app_file: app.py
pinned: false
license: mit
short_description: Torch Transformers Diffusion SFT for Computer Vision

Tune NLP 🧠 or CV 🎨 fast! Texts 📝 or pics 📸, SFT shines ✨. pip install -r requirements.txt, streamlit run app.py. Snap cams 📷, craft art—AI’s lean & mean! 🎉 #SFTSpeed

SFT Tiny Titans 🚀 (Small Diffusion Delight!)

A Streamlit app for Supervised Fine-Tuning (SFT) of small diffusion models, featuring multi-camera capture, model testing, and agentic RAG demos with a playful UI.

Features 🎉

  • Build Titan 🌱: Spin up tiny diffusion models from Hugging Face (Micro Diffusion, Latent Diffusion, FLUX.1 Distilled).
  • Camera Snap 📷: Snap pics with 6 cameras using a 4-column grid UI per cam—witty, emoji-packed controls for device, label, hint, and visibility! 📸✨
  • Fine-Tune Titan (CV) 🔧: Tune models with 3 use cases—denoising, stylization, multi-angle generation—using your camera captures, with CSV/MD exports.
  • Test Titan (CV) 🧪: Generate images from prompts with your tuned diffusion titan.
  • Agentic RAG Party (CV) 🌐: Craft superhero party visuals from camera-inspired prompts.
  • Media Gallery 🎨: View, download, or zap captured images with flair.

Installation 🛠️

  1. Clone the repo:
    git clone <repository-url>
    cd sft-tiny-titans
    

Abstract

TorchTransformers Diffusion SFT Titans harnesses torch, transformers, and diffusers for cutting-edge NLP and CV, powered by supervised fine-tuning (SFT). Dual st.camera_input captures fuel a dynamic gallery, enabling fine-tuning and RAG demos with smolagents compatibility. Key papers illuminate the stack:

Run: pip install -r requirements.txt, streamlit run ${app_file}. Snap, tune, party! ${emoji}