Shasha AI — Architecture Overview

A high‑level view of the components and data flow in the Shasha AI code‑generation platform.

1. Core Layers

1.1 Frontend (Gradio UI)

app.py: Defines the Gradio Blocks UI — input panels (prompt, file, website, image), model selector, language selector, buttons, and output panes (Code, Preview, History).
static/: Holds index.html, index.js, style.css for any transformers.js demos and custom styling.
assets/: Images, logos, and other static assets.

1.2 Backend Services

1.2.1 Inference Routing

hf_client.py
- get_inference_client(model_id, provider)
- Wraps HF/OpenAI/Gemini/Groq providers into a unified interface.
- Automatically selects/falls back based on model prefix and available credentials.
inference.py
- chat_completion(...) and stream_chat_completion(...)
- Encapsulates request/response logic, streaming vs. blocking, and error handling.

1.2.2 Model Registry

models.py
- ModelInfo dataclass & AVAILABLE_MODELS registry
- Central source of truth for supported models, identifiers, descriptions, and default providers.
- Helper find_model() for lookup by name or ID.

1.2.3 Prompt & History Management

constants.py
- System prompts for HTML/transformers.js modes, search/replace tokens, Gradio language support.
utils.py
- history_to_messages(), remove_code_block(), multimodal image encoding, parsing transformers.js outputs, search/replace utilities.

1.2.4 Deployment & Project Import

deploy.py
- send_to_sandbox() → live HTML preview in iframe
- load_project_from_url() → import existing HF Spaces (app.py/index.html)
- deploy_to_spaces*() → create/update HF Space via Hub API

2. Extensions & Plugins

plugins.py (future)
- Plugin discovery/loading for community‑created extensions (e.g., DB runners, snippet libraries).
auth.py (future)
- OAuth flows for GitHub, Google Drive, Slack — enable private file loading.

3. Project Structure

. ├── app.py # Gradio application ├── constants.py # System prompts & app‑wide constants ├── hf_client.py # Inference client factory ├── models.py # Model registry & metadata ├── inference.py # Chat completion wrappers ├── utils.py # Helpers: history, code parsing, multimodal ├── deploy.py # Sandbox preview & HF Spaces import/deploy ├── plugins.py # (planned) Plugin architecture ├── auth.py # (planned) OAuth integrations ├── notebooks/ # Demo Jupyter notebooks ├── tests/ # pytest suites & UI smoke tests ├── ci.yaml # CI pipeline for lint/test/deploy ├── docs/ │ ├── QUICKSTART.md │ ├── ARCHITECTURE.md │ └── API_REFERENCE.md └── static/ # transformers.js demos & CSS/JS assets

yaml Copy Edit

4. Data Flow

User Interaction: User enters prompt / uploads file / selects model & language.
Preprocessing:
- File → extract_text_from_file()
- Website → extract_website_content()
- Image → process_image_for_model()
Message Assembly:
- System prompt + history → OpenAI‑style message list
- Enhanced via optional web search (Tavily)
Inference Call:
- get_inference_client() → select provider & billing
- chat_completion() or streaming
Postprocessing:
- Parse code blocks / transformers.js multi‑file output
- Apply search/replace to existing code if editing
UI Update:
- Code pane
- Live preview iframe (send_to_sandbox)
- Chat history pane