Zack Li's picture

Zack Li PRO

zackli4ai

·

AI & ML interests

LLM, on-device AI

Recent Activity

updated a model 6 days ago

nexa-collaboration/output_llama3.1_8b_instruct_torchao_sparsity0.7

updated a model 6 days ago

nexa-collaboration/output_llama3.1_8b_instruct_torchao_sparsity0.2

updated a model 6 days ago

nexa-collaboration/output_llama3.1_8b_instruct_torchao_sparsity0.3

View all activity

Organizations

zackli4ai's activity

upvoted a paper 20 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 23 days ago • 318

upvoted a paper about 2 months ago

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 41

upvoted a collection 4 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted a paper 6 months ago

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 43

upvoted a paper 8 months ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48

upvoted a paper 10 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

upvoted an article 10 months ago

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 41

upvoted a paper 10 months ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30, 2024 • 117

upvoted 2 papers 11 months ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2, 2024 • 57

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126