๐ค Alireo-400M Model Card
<h2>๐ Model Description</h2>
<p>Alireo-400M is a lightweight yet powerful Italian language model with 400M parameters, designed to provide efficient natural language processing capabilities while maintaining a smaller footprint compared to larger models.</p>
<h2>โจ Key Features</h2>
<div class="features-list">
<ul>
<li>๐๏ธ <strong>Architecture:</strong> Transformer-based language model</li>
<li>๐ <strong>Parameters:</strong> 400M</li>
<li>๐ช <strong>Context Window:</strong> 8K tokens</li>
<li>๐ <strong>Training Data:</strong> Curated Italian text corpus (books, articles, web content)</li>
<li>๐พ <strong>Model Size:</strong> ~800MB</li>
</ul>
</div>
<h2>๐ Performance</h2>
<div class="performance">
<p>Despite its compact size, Alireo-400M demonstrates impressive performance:</p>
<ul>
<li>๐ Outperforms Qwen 0.5B across multiple benchmarks</li>
<li>๐ฏ Maintains high accuracy in Italian language understanding tasks</li>
<li>โก Efficient inference speed due to optimized architecture</li>
</ul>
</div>
<h2>โ ๏ธ Limitations</h2>
<div class="limitations">
<ul>
<li>Limited context window compared to larger models</li>
<li>May struggle with highly specialized technical content</li>
<li>Performance may vary on dialectal variations</li>
<li>Not suitable for multilingual tasks</li>
</ul>
</div>
<h2>๐ป Hardware Requirements</h2>
<div class="requirements">
<ul>
<li>๐ฎ <strong>Minimum RAM:</strong> 2GB</li>
<li>๐ช <strong>Recommended RAM:</strong> 4GB</li>
<li>๐จ <strong>GPU:</strong> Optional, but recommended for faster inference</li>
<li>๐ฟ <strong>Disk Space:</strong> ~1GB (including model and dependencies)</li>
</ul>
</div>
<h2>๐ License</h2>
<p>Apache 2.0</p>
<h2>๐ Citation</h2>
<div class="citation">@software{alireo2024,
author = {[Michele Montebovi]}, title = {Alireo-400M: A Lightweight Italian Language Model}, year = {2024}, }