--- title: Small Model Chatbot emoji: 😻 colorFrom: indigo colorTo: green sdk: gradio sdk_version: 5.31.0 app_file: app.py pinned: false license: mit short_description: Some small models chatbot --- ======= # Multi-Model Tiny Chatbot A lightweight, multi-model chat application featuring several small language models optimized for different tasks. Built with Gradio for an intuitive web interface and designed for local deployment. ## 🌟 Features - **Multiple Model Support**: Choose from 4 specialized small language models - **Lazy Loading**: Models are loaded only when selected, optimizing memory usage - **Real-time Chat Interface**: Smooth conversational experience with Gradio - **Lightweight**: All models are under 200M parameters for fast inference - **Local Deployment**: Run entirely on your local machine ## 🤖 Available Models ### 1. SmolLM2 (135M Parameters) - **Purpose**: General conversation and instruction following - **Architecture**: HuggingFace SmolLM2-135M-Instruct - **Best For**: General Q&A, creative writing, coding help - **Language**: English ### 2. NanoLM-25M (25M Parameters) - **Purpose**: Ultra-lightweight instruction following - **Architecture**: Mistral-based with chat template support - **Best For**: Quick responses, simple tasks, resource-constrained environments - **Language**: English ### 3. NanoTranslator-S (9M Parameters) - **Purpose**: English to Chinese translation - **Architecture**: LLaMA-based translation model - **Best For**: Translating English text to Chinese - **Language**: English → Chinese ### 4. NanoTranslator-XL (78M Parameters) - **Purpose**: Enhanced English to Chinese translation - **Architecture**: LLaMA-based with improved accuracy - **Best For**: High-quality English to Chinese translation - **Language**: English → Chinese ## 🚀 Quick Start ### Prerequisites - Python 3.8 or higher - 4GB+ RAM recommended - Internet connection for initial model downloads ### Installation 1. **Run the application** ```bash uv run app.py ``` 2. **Open your browser** - Navigate to `http://localhost:7860` - Select a model and start chatting! ## 🎯 Use Cases ### General Conversation - Use **SmolLM2** or **NanoLM-25M** for general chat, Q&A, and assistance ### Translation Tasks - Use **NanoTranslator-S** for quick English→Chinese translations - Use **NanoTranslator-XL** for higher quality English→Chinese translations ### Resource-Constrained Environments - **NanoLM-25M** (25M params) for ultra-lightweight deployment - **NanoTranslator-S** (9M params) for minimal translation needs ## 💡 Model Performance | Model | Parameters | Use Case | Memory Usage | Speed | |-------|------------|----------|--------------|-------| | SmolLM2 | 135M | General Chat | ~500MB | Fast | | NanoLM-25M | 25M | Lightweight Chat | ~100MB | Very Fast | | NanoTranslator-S | 9M | Quick Translation | ~50MB | Very Fast | | NanoTranslator-XL | 78M | Quality Translation | ~300MB | Fast | ### Model Sources - SmolLM2: `HuggingFaceTB/SmolLM2-135M-Instruct` - NanoLM-25M: `Mxode/NanoLM-25M-Instruct-v1.1` - NanoTranslator-S: `Mxode/NanoTranslator-S` - NanoTranslator-XL: `Mxode/NanoTranslator-XL` ## 📝 License This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details. ## 🙏 Acknowledgments - [HuggingFace](https://huggingface.co/) for the Transformers library and model hosting - [Mxode](https://huggingface.co/Mxode) for the Nano series models - [Gradio](https://gradio.app/) for the amazing web interface framework