Anomaly's picture

13 15

Anomaly

anomalydavid

·

AI & ML interests

None yet

Recent Activity

liked a model 23 days ago

nvidia/parakeet-rnnt-0.6b

liked a Space 23 days ago

lllyasviel/iclight-v2

liked a model 23 days ago

NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant

View all activity

Organizations

None yet

anomalydavid's activity

upvoted 4 collections 4 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 576

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated Jan 17 • 39

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 78

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 16 days ago • 560

upvoted 5 papers 9 months ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 39

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Paper • 2406.11230 • Published Jun 17, 2024 • 34

HARE: HumAn pRiors, a key to small language model Efficiency

Paper • 2406.11410 • Published Jun 17, 2024 • 39

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Paper • 2406.12624 • Published Jun 18, 2024 • 37

upvoted 4 papers 10 months ago

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30, 2024 • 117

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 54

Nearest Neighbor Speculative Decoding for LLM Generation and Attribution

Paper • 2405.19325 • Published May 29, 2024 • 14

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published May 29, 2024 • 22