Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. β’ 5 items β’ Updated 5 days ago β’ 45
Reasoning Datasets Collection Distilled synthetic Reasoning datasets β’ 7 items β’ Updated 10 days ago β’ 50
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 16 days ago β’ 337
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 β’ 98
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper β’ 2501.05366 β’ Published Jan 9 β’ 92
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 28 days ago β’ 142
view article Article Mastering Tensor Dimensions in Transformers By not-lain β’ about 1 month ago β’ 43
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization β’ 16 items β’ Updated 13 days ago β’ 26
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ By merve β’ Aug 25, 2023 β’ 27
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data Paper β’ 2402.15343 β’ Published Feb 23, 2024 β’ 13
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 134
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 β’ 11 items β’ Updated 29 days ago β’ 71
OLMo 2 Collection Artifacts for the second set of OLMo models. β’ 22 items β’ Updated 1 day ago β’ 82