---
title: Small Model Chatbot
emoji: 😻
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: mit
short_description: Some small models chatbot
---
=======
# Multi-Model Tiny Chatbot

A lightweight, multi-model chat application featuring several small language models optimized for different tasks. Built with Gradio for an intuitive web interface and designed for local deployment.

## 🌟 Features

- **Multiple Model Support**: Choose from 4 specialized small language models
- **Lazy Loading**: Models are loaded only when selected, optimizing memory usage
- **Real-time Chat Interface**: Smooth conversational experience with Gradio
- **Lightweight**: All models are under 200M parameters for fast inference
- **Local Deployment**: Run entirely on your local machine

## 🤖 Available Models

### 1. SmolLM2 (135M Parameters)
- **Purpose**: General conversation and instruction following
- **Architecture**: HuggingFace SmolLM2-135M-Instruct
- **Best For**: General Q&A, creative writing, coding help
- **Language**: English

### 2. NanoLM-25M (25M Parameters)
- **Purpose**: Ultra-lightweight instruction following
- **Architecture**: Mistral-based with chat template support
- **Best For**: Quick responses, simple tasks, resource-constrained environments
- **Language**: English

### 3. NanoTranslator-S (9M Parameters)
- **Purpose**: English to Chinese translation
- **Architecture**: LLaMA-based translation model
- **Best For**: Translating English text to Chinese
- **Language**: English → Chinese

### 4. NanoTranslator-XL (78M Parameters)
- **Purpose**: Enhanced English to Chinese translation
- **Architecture**: LLaMA-based with improved accuracy
- **Best For**: High-quality English to Chinese translation
- **Language**: English → Chinese

## 🚀 Quick Start

### Prerequisites

- Python 3.8 or higher
- 4GB+ RAM recommended
- Internet connection for initial model downloads

### Installation

1. **Run the application**
   ```bash
   uv run app.py
   ```

2. **Open your browser**
   - Navigate to `http://localhost:7860`
   - Select a model and start chatting!


## 🎯 Use Cases

### General Conversation
- Use **SmolLM2** or **NanoLM-25M** for general chat, Q&A, and assistance

### Translation Tasks
- Use **NanoTranslator-S** for quick English→Chinese translations
- Use **NanoTranslator-XL** for higher quality English→Chinese translations

### Resource-Constrained Environments
- **NanoLM-25M** (25M params) for ultra-lightweight deployment
- **NanoTranslator-S** (9M params) for minimal translation needs

## 💡 Model Performance

| Model | Parameters | Use Case | Memory Usage | Speed |
|-------|------------|----------|--------------|-------|
| SmolLM2 | 135M | General Chat | ~500MB | Fast |
| NanoLM-25M | 25M | Lightweight Chat | ~100MB | Very Fast |
| NanoTranslator-S | 9M | Quick Translation | ~50MB | Very Fast |
| NanoTranslator-XL | 78M | Quality Translation | ~300MB | Fast |


### Model Sources
- SmolLM2: `HuggingFaceTB/SmolLM2-135M-Instruct`
- NanoLM-25M: `Mxode/NanoLM-25M-Instruct-v1.1`
- NanoTranslator-S: `Mxode/NanoTranslator-S`
- NanoTranslator-XL: `Mxode/NanoTranslator-XL`

## 📝 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🙏 Acknowledgments

- [HuggingFace](https://huggingface.co/) for the Transformers library and model hosting
- [Mxode](https://huggingface.co/Mxode) for the Nano series models
- [Gradio](https://gradio.app/) for the amazing web interface framework