Phil's picture

Phil

phil111

·

AI & ML interests

None yet

Recent Activity

new activity 13 days ago

mistralai/Mistral-Small-24B-Instruct-2501:This Mistral Small has FAR less knowledge than the last.

liked a model 25 days ago

deepseek-ai/DeepSeek-R1

new activity 28 days ago

internlm/internlm3-8b-instruct:English tests and tasks are absurdly overfit.

View all activity

Organizations

None yet

phil111's activity

New activity in mistralai/Mistral-Small-24B-Instruct-2501 13 days ago

This Mistral Small has FAR less knowledge than the last.

#5 opened 15 days ago by

liked a model 25 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 3.71M • • 8.83k

New activity in internlm/internlm3-8b-instruct 28 days ago

English tests and tasks are absurdly overfit.

#8 opened 30 days ago by

New activity in microsoft/phi-4 about 1 month ago

A heavily filtered corpus simply doesn't work.

#19 opened about 1 month ago by

I Don't Understand This Model

#9 opened about 1 month ago by

New activity in matteogeniaccio/phi-4 about 2 months ago

Notably better than Phi3.5 in many ways, but something is wrong.

#5 opened about 2 months ago by

liked a model about 2 months ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 21 days ago • 1.52M • • 3.41k

New activity in deepseek-ai/DeepSeek-V3-Base about 2 months ago

Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.

#27 opened about 2 months ago by

liked a model about 2 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 21 days ago • 69.6k • 1.55k

New activity in NyxKrage/Microsoft_Phi-4 about 2 months ago

SimpleQA score

#1 opened 2 months ago by

New activity in ibm-granite/granite-3.1-8b-instruct about 2 months ago

Exceptional creative writer

#1 opened about 2 months ago by

liked 2 models about 2 months ago

ibm-granite/granite-3.1-8b-instruct

Text Generation • Updated 14 days ago • 88.5k • 146

QuantFactory/granite-3.1-8b-instruct-GGUF

Text Generation • Updated Dec 19, 2024 • 663 • 7

New activity in tiiuae/Falcon3-7B-Instruct about 2 months ago

Very High English MMLU scores, Yet Extremely Low Broad English Knowledge

#8 opened about 2 months ago by

New activity in CohereForAI/c4ai-command-r7b-12-2024 about 2 months ago

How was r7b?

#3 opened 2 months ago by

Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks

#1 opened 2 months ago by

New activity in meta-llama/Llama-3.3-70B-Instruct 2 months ago

local Llama + GPU(cuda)

#34 opened 2 months ago by

Base Model?

#32 opened 2 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 2 months ago

Add Hymba-1.5B to the leaderboard

#1030 opened 2 months ago by

liked a model 2 months ago

lmstudio-community/Llama-3.3-70B-Instruct-GGUF

Text Generation • Updated Dec 6, 2024 • 38.3k • 44