DeepMount00's picture
Update README.md
a174bbb verified
|
raw
history blame
4.1 kB

๐Ÿค– Alireo-400M Model Card

    <h2>๐Ÿ“ Model Description</h2>
    <p>Alireo-400M is a lightweight yet powerful Italian language model with 400M parameters, designed to provide efficient natural language processing capabilities while maintaining a smaller footprint compared to larger models.</p>
    
    <h2>โœจ Key Features</h2>
    <div class="features-list">
        <ul>
            <li>๐Ÿ—๏ธ <strong>Architecture:</strong> Transformer-based language model</li>
            <li>๐Ÿ“Š <strong>Parameters:</strong> 400M</li>
            <li>๐ŸชŸ <strong>Context Window:</strong> 8K tokens</li>
            <li>๐Ÿ“š <strong>Training Data:</strong> Curated Italian text corpus (books, articles, web content)</li>
            <li>๐Ÿ’พ <strong>Model Size:</strong> ~800MB</li>
        </ul>
    </div>
    
    <h2>๐Ÿ“ˆ Performance</h2>
    <div class="performance">
        <p>Despite its compact size, Alireo-400M demonstrates impressive performance:</p>
        <ul>
            <li>๐Ÿ† Outperforms Qwen 0.5B across multiple benchmarks</li>
            <li>๐ŸŽฏ Maintains high accuracy in Italian language understanding tasks</li>
            <li>โšก Efficient inference speed due to optimized architecture</li>
        </ul>
    </div>
    
    <h2>โš ๏ธ Limitations</h2>
    <div class="limitations">
        <ul>
            <li>Limited context window compared to larger models</li>
            <li>May struggle with highly specialized technical content</li>
            <li>Performance may vary on dialectal variations</li>
            <li>Not suitable for multilingual tasks</li>
        </ul>
    </div>
    
    <h2>๐Ÿ’ป Hardware Requirements</h2>
    <div class="requirements">
        <ul>
            <li>๐ŸŽฎ <strong>Minimum RAM:</strong> 2GB</li>
            <li>๐Ÿ’ช <strong>Recommended RAM:</strong> 4GB</li>
            <li>๐ŸŽจ <strong>GPU:</strong> Optional, but recommended for faster inference</li>
            <li>๐Ÿ’ฟ <strong>Disk Space:</strong> ~1GB (including model and dependencies)</li>
        </ul>
    </div>
    
    <h2>๐Ÿ“œ License</h2>
    <p>Apache 2.0</p>
    
    <h2>๐Ÿ“„ Citation</h2>
    <div class="citation">@software{alireo2024,

author = {[Michele Montebovi]}, title = {Alireo-400M: A Lightweight Italian Language Model}, year = {2024}, }