README.md · llmware/README at 48c44b2381572b844b0eba394f2e61b85f7191e7

metadata

title: README
emoji: 📚
colorFrom: purple
colorTo: blue
sdk: static
pinned: false

Welcome to the llmware HuggingFace page. We believe that the ascendence of LLMs creates a major new application pattern and data pipelines that will be transformative in the enterprise, especially in knowledge-intensive industries. Our open source research efforts are focused both on the new "ware" ("middleware" and "software" that will wrap and integrate LLMs), as well as building high-quality automation-focused enterprise Agent, RAG and embedding small specialized language models.

Our model training initiatives fall into four major categories:

SLIMs - small, specialized function calling models for stacking in multi-model, Agent-based workflows -- SLIMs
BLING/DRAGON - highly-accurate fact-based question-answering models
-- SMALL MODEL ACCURACY BENCHMARK | -- OUR JOURNEY BUILDING ACCURATE ENTERPRISE SMALL MODELS
Industry-BERT - industry fine-tuned embedding models
Private Inference - Self-Hosting, Packaging and Quantization - GGUF, ONNX, OpenVino

Please check out a few of our recent blog postings related to these initiatives:
THINKING DOES NOT HAPPEN ONE TOKEN AT A TIME | RAG-INSTRUCT-TEST-DATASET | LLMWARE EMERGING STACK | BECOMING A MASTER FINETUNING CHEF

Interested? Join us on Discord