README.md · ginipick/Realtime-FLUX at 644e6635ed849c92c979ccc2ef88c89c3f8131e7

metadata

title: Realtime FLUX Image
emoji: 💬⚡
colorFrom: yellow
colorTo: pink
sdk: gradio
sdk_version: 5.35.0
app_file: app.py
pinned: true
license: mit
short_description: mcp_server & High quality Images in Realtime

Looking at this code, it's a Gradio-based application for real-time image generation using the FLUX.1-schnell model. Here's a detailed explanation:

English Explanation

Overview

This application provides a real-time image generation interface using the FLUX.1-schnell diffusion model. It features instant preview capabilities where images are generated as you type, making it highly interactive and user-friendly.

Key Features

Real-time Generation
- Images are generated automatically as you type in the prompt
- Uses GPU acceleration with @spaces.GPU decorator
- Optimized for fast inference with only 1-4 steps
User Interface Components
- Prompt Input: Text area for describing desired images
- Generated Image: Real-time display of generated results
- Enhance Button: Manual trigger for image generation
- Latency Display: Shows processing time for each generation
Advanced Options
- Seed Control: For reproducible results (0 to 2³²-1)
- Randomize Seed: Toggle for random seed generation
- Width/Height Sliders: Image dimensions (256-2048 pixels)
- Inference Steps: Control generation quality/speed (1-4 steps)
Special Features
- Snow Effect: Animated snowflakes falling across the interface
- Korean Text Detection: Warns when Korean text is detected in prompts
- Example Gallery: Pre-defined creative prompts for inspiration
- Automatic CUDA Cache Clearing: Prevents memory overflow

Technical Implementation

Model Configuration
- Uses FLUX.1-schnell with float16 precision for efficiency
- Custom pipeline with intermediate outputs capability
- GPU duration limited to 15 seconds per generation
Input Validation
- Automatic size constraints (256-2048 pixels)
- Seed validation and randomization
- Error handling with graceful fallbacks
Performance Optimizations
- Automatic Mixed Precision (AMP) for faster computation
- CUDA cache clearing after each generation
- Minimal inference steps for real-time performance

Example Prompts Included

Steampunk owl in Victorian clothing
Floating island made of books
Bioluminescent cyberpunk forest
Ancient temple with robot archaeologists
Cosmic coffee shop with constellation baristas

한글 설명

개요

이 애플리케이션은 FLUX.1-schnell 확산 모델을 사용한 실시간 이미지 생성 인터페이스입니다. 타이핑하는 동안 즉시 이미지가 생성되는 기능을 제공하여 매우 상호작용적이고 사용자 친화적입니다.

주요 기능

실시간 생성
- 프롬프트를 입력하는 동안 자동으로 이미지 생성
- @spaces.GPU 데코레이터를 통한 GPU 가속
- 1-4 단계만으로 빠른 추론 최적화
사용자 인터페이스 구성요소
- 프롬프트 입력: 원하는 이미지를 설명하는 텍스트 영역
- 생성된 이미지: 생성 결과의 실시간 표시
- 향상 버튼: 수동 이미지 생성 트리거
- 지연 시간 표시: 각 생성의 처리 시간 표시
고급 옵션
- 시드 제어: 재현 가능한 결과를 위한 설정 (0 ~ 2³²-1)
- 시드 무작위화: 무작위 시드 생성 토글
- 너비/높이 슬라이더: 이미지 크기 (256-2048 픽셀)
- 추론 단계: 생성 품질/속도 제어 (1-4 단계)
특별 기능
- 눈 효과: 인터페이스 전체에 떨어지는 애니메이션 눈송이
- 한글 텍스트 감지: 프롬프트에 한글이 감지되면 경고 표시
- 예제 갤러리: 영감을 위한 사전 정의된 창의적 프롬프트
- 자동 CUDA 캐시 정리: 메모리 오버플로 방지

기술적 구현

모델 구성
- 효율성을 위한 float16 정밀도의 FLUX.1-schnell 사용
- 중간 출력 기능이 있는 커스텀 파이프라인
- 생성당 GPU 시간을 15초로 제한
입력 검증
- 자동 크기 제약 (256-2048 픽셀)
- 시드 검증 및 무작위화
- 우아한 폴백을 통한 오류 처리
성능 최적화
- 빠른 계산을 위한 자동 혼합 정밀도(AMP)
- 각 생성 후 CUDA 캐시 정리
- 실시간 성능을 위한 최소 추론 단계

포함된 예제 프롬프트

빅토리아 시대 의상을 입은 스팀펑크 올빼미
책으로 만들어진 떠다니는 섬
생물발광 사이버펑크 숲
로봇 고고학자가 있는 고대 사원
별자리 바리스타가 있는 우주 커피숍

사용 팁

한글 프롬프트는 지원되지만 영어 프롬프트가 더 나은 결과를 생성합니다
빠른 미리보기를 위해 추론 단계를 낮게 유지하세요
고품질 이미지를 위해서는 "향상" 버튼을 클릭하세요
시드 값을 고정하면 동일한 이미지를 재생성할 수 있습니다