Spaces:
Running
Running
title: Small Model Chatbot | |
emoji: 😻 | |
colorFrom: indigo | |
colorTo: green | |
sdk: gradio | |
sdk_version: 5.31.0 | |
app_file: app.py | |
pinned: false | |
license: mit | |
short_description: Some small models chatbot | |
======= | |
# Multi-Model Tiny Chatbot | |
A lightweight, multi-model chat application featuring several small language models optimized for different tasks. Built with Gradio for an intuitive web interface and designed for local deployment. | |
## 🌟 Features | |
- **Multiple Model Support**: Choose from 4 specialized small language models | |
- **Lazy Loading**: Models are loaded only when selected, optimizing memory usage | |
- **Real-time Chat Interface**: Smooth conversational experience with Gradio | |
- **Lightweight**: All models are under 200M parameters for fast inference | |
- **Local Deployment**: Run entirely on your local machine | |
## 🤖 Available Models | |
### 1. SmolLM2 (135M Parameters) | |
- **Purpose**: General conversation and instruction following | |
- **Architecture**: HuggingFace SmolLM2-135M-Instruct | |
- **Best For**: General Q&A, creative writing, coding help | |
- **Language**: English | |
### 2. NanoLM-25M (25M Parameters) | |
- **Purpose**: Ultra-lightweight instruction following | |
- **Architecture**: Mistral-based with chat template support | |
- **Best For**: Quick responses, simple tasks, resource-constrained environments | |
- **Language**: English | |
### 3. NanoTranslator-S (9M Parameters) | |
- **Purpose**: English to Chinese translation | |
- **Architecture**: LLaMA-based translation model | |
- **Best For**: Translating English text to Chinese | |
- **Language**: English → Chinese | |
### 4. NanoTranslator-XL (78M Parameters) | |
- **Purpose**: Enhanced English to Chinese translation | |
- **Architecture**: LLaMA-based with improved accuracy | |
- **Best For**: High-quality English to Chinese translation | |
- **Language**: English → Chinese | |
## 🚀 Quick Start | |
### Prerequisites | |
- Python 3.8 or higher | |
- 4GB+ RAM recommended | |
- Internet connection for initial model downloads | |
### Installation | |
1. **Run the application** | |
```bash | |
uv run app.py | |
``` | |
2. **Open your browser** | |
- Navigate to `http://localhost:7860` | |
- Select a model and start chatting! | |
## 🎯 Use Cases | |
### General Conversation | |
- Use **SmolLM2** or **NanoLM-25M** for general chat, Q&A, and assistance | |
### Translation Tasks | |
- Use **NanoTranslator-S** for quick English→Chinese translations | |
- Use **NanoTranslator-XL** for higher quality English→Chinese translations | |
### Resource-Constrained Environments | |
- **NanoLM-25M** (25M params) for ultra-lightweight deployment | |
- **NanoTranslator-S** (9M params) for minimal translation needs | |
## 💡 Model Performance | |
| Model | Parameters | Use Case | Memory Usage | Speed | | |
|-------|------------|----------|--------------|-------| | |
| SmolLM2 | 135M | General Chat | ~500MB | Fast | | |
| NanoLM-25M | 25M | Lightweight Chat | ~100MB | Very Fast | | |
| NanoTranslator-S | 9M | Quick Translation | ~50MB | Very Fast | | |
| NanoTranslator-XL | 78M | Quality Translation | ~300MB | Fast | | |
### Model Sources | |
- SmolLM2: `HuggingFaceTB/SmolLM2-135M-Instruct` | |
- NanoLM-25M: `Mxode/NanoLM-25M-Instruct-v1.1` | |
- NanoTranslator-S: `Mxode/NanoTranslator-S` | |
- NanoTranslator-XL: `Mxode/NanoTranslator-XL` | |
## 📝 License | |
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details. | |
## 🙏 Acknowledgments | |
- [HuggingFace](https://huggingface.co/) for the Transformers library and model hosting | |
- [Mxode](https://huggingface.co/Mxode) for the Nano series models | |
- [Gradio](https://gradio.app/) for the amazing web interface framework | |