2 4 1

Ofir Zafrir

ofirzaf

AI & ML interests

Sparsity, Qunatization, Model Compression

Recent Activity

upvoted an article 6 days ago

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

authored a paper 3 months ago

Q8BERT: Quantized 8Bit BERT

authored a paper 3 months ago

FastDraft: How to Train Your Draft

View all activity

Organizations

ofirzaf's activity

upvoted an article 6 days ago

Article

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

Mar 20, 2024

• 6

authored 2 papers 3 months ago

Q8BERT: Quantized 8Bit BERT

Paper • 1910.06188 • Published Oct 14, 2019 • 2

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 10

upvoted a paper 3 months ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 10

upvoted a paper 6 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 36

New activity in microsoft/Phi-3-mini-4k-instruct 9 months ago

Changed instruction/chat template

#54 opened 9 months ago by

ofirzaf

published an article 11 months ago

Article

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

Mar 20, 2024

• 6

published an article about 1 year ago

Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Jan 30, 2024

• 9

authored a paper over 1 year ago

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

Paper • 2306.16601 • Published Jun 28, 2023 • 4

liked a Space over 1 year ago

12.4k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

updated 5 models over 2 years ago

updated 2 models almost 3 years ago

ofirzaf/bert-large-uncased-mnli

Text Classification • Updated May 9, 2022 • 6

ofirzaf/bert-large-uncased-squad

Question Answering • Updated Apr 26, 2022 • 6

updated a model about 3 years ago

Intel/bert-large-uncased-squadv1.1-sparse-90-unstructured

Question Answering • Updated Dec 5, 2021 • 87

updated 2 models over 3 years ago

Intel/bert-base-uncased-mnli-sparse-70-unstructured-no-classifier

Fill-Mask • Updated Jun 29, 2021 • 12

Intel/bert-base-uncased-sparse-1_2

Updated Jun 24, 2021 • 9