Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
32
1
39
Phil
phil111
Follow
Mi6paulino's profile picture
mondalsurojit's profile picture
tahamajs's profile picture
10 followers
·
12 following
AI & ML interests
None yet
Recent Activity
new
activity
13 days ago
mistralai/Mistral-Small-24B-Instruct-2501:
This Mistral Small has FAR less knowledge than the last.
liked
a model
25 days ago
deepseek-ai/DeepSeek-R1
new
activity
28 days ago
internlm/internlm3-8b-instruct:
English tests and tasks are absurdly overfit.
View all activity
Organizations
None yet
phil111
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Mistral-Small-24B-Instruct-2501
13 days ago
This Mistral Small has FAR less knowledge than the last.
20
#5 opened 15 days ago by
phil111
liked
a model
25 days ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
5 days ago
•
3.71M
•
•
8.83k
New activity in
internlm/internlm3-8b-instruct
28 days ago
English tests and tasks are absurdly overfit.
21
#8 opened 30 days ago by
phil111
New activity in
microsoft/phi-4
about 1 month ago
A heavily filtered corpus simply doesn't work.
4
#19 opened about 1 month ago by
phil111
I Don't Understand This Model
16
#9 opened about 1 month ago by
phil111
New activity in
matteogeniaccio/phi-4
about 2 months ago
Notably better than Phi3.5 in many ways, but something is wrong.
8
#5 opened about 2 months ago by
phil111
liked
a model
about 2 months ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
21 days ago
•
1.52M
•
•
3.41k
New activity in
deepseek-ai/DeepSeek-V3-Base
about 2 months ago
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened about 2 months ago by
phil111
liked
a model
about 2 months ago
deepseek-ai/DeepSeek-V3-Base
Updated
21 days ago
•
69.6k
•
1.55k
New activity in
NyxKrage/Microsoft_Phi-4
about 2 months ago
SimpleQA score
2
#1 opened 2 months ago by
frappuccino
New activity in
ibm-granite/granite-3.1-8b-instruct
about 2 months ago
Exceptional creative writer
5
#1 opened about 2 months ago by
SubtleOne
liked
2 models
about 2 months ago
ibm-granite/granite-3.1-8b-instruct
Text Generation
•
Updated
14 days ago
•
88.5k
•
146
QuantFactory/granite-3.1-8b-instruct-GGUF
Text Generation
•
Updated
Dec 19, 2024
•
663
•
7
New activity in
tiiuae/Falcon3-7B-Instruct
about 2 months ago
Very High English MMLU scores, Yet Extremely Low Broad English Knowledge
2
#8 opened about 2 months ago by
phil111
New activity in
CohereForAI/c4ai-command-r7b-12-2024
about 2 months ago
How was r7b?
6
#3 opened 2 months ago by
MRU4913
Add Qwen 2.5 7B & Tulu 3 8B results to OLLM benchmarks
12
#1 opened 2 months ago by
Fizzarolli
New activity in
meta-llama/Llama-3.3-70B-Instruct
2 months ago
local Llama + GPU(cuda)
7
#34 opened 2 months ago by
Luciolla
Base Model?
3
#32 opened 2 months ago by
User8213
New activity in
open-llm-leaderboard/open_llm_leaderboard
2 months ago
Add Hymba-1.5B to the leaderboard
3
#1030 opened 2 months ago by
pmolchanov
liked
a model
2 months ago
lmstudio-community/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
Updated
Dec 6, 2024
•
38.3k
•
44
Load more