view article Article Agentic RAG Stack (3/5) - Generate responses using a SmolLM By davidberenstein1957 • 6 days ago • 6
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • 16 days ago • 16
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • 7 days ago • 7
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 20 days ago • 62
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 28 days ago • 142
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 138
GLiREL -- Generalist Model for Zero-Shot Relation Extraction Paper • 2501.03172 • Published Jan 6 • 1
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8, 2024 • 18
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated Dec 18, 2024 • 53
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • Jul 9, 2024 • 42
Positions Datasets Collection Datasets where each row is a chess position • 4 items • Updated Jan 9 • 6
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 29
Tucano Collection Tucano is a series of decoder-transformers based on the Llama 2 architecture, natively pre-trained in Portuguese. • 17 items • Updated Nov 13, 2024 • 2
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1, 2024 • 145
LLM2Encoder Collection Collection of initial models and models that use converted decoders to encoders as backbones • 11 items • Updated Sep 10, 2024 • 6
GLiNER bi-encoders Collection Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10, 2024 • 13