Aritra Dutta's picture

Aritra Dutta

dutta18

·

https://vpnleaderboard.com/

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

lmms-lab/LongVA-7B:TypeError: unsupported operand type(s) for //: 'int' and 'NoneType' while calling the processor

liked a model 3 days ago

mPLUG/mPLUG-Owl3-7B-240728

new activity 3 days ago

mPLUG/mPLUG-Owl3-7B-240728:KeyError: None for when the model.generate() method is executed in mPlugOwl3

View all activity

Organizations

New activity in lmms-lab/LongVA-7B 3 days ago

TypeError: unsupported operand type(s) for //: 'int' and 'NoneType' while calling the processor

#1 opened 3 days ago by

liked a model 3 days ago

mPLUG/mPLUG-Owl3-7B-240728

Image-Text-to-Text • 8B • Updated Sep 29, 2024 • 532 • 43

New activity in mPLUG/mPLUG-Owl3-7B-240728 3 days ago

KeyError: None for when the model.generate() method is executed in mPlugOwl3

#8 opened 3 days ago by

New activity in aws-prototyping/long-llava-qwen2-7b 5 days ago

Model's processor have small bug in the integer division.

#2 opened 5 days ago by

liked a model 5 days ago

aws-prototyping/long-llava-qwen2-7b

8B • Updated Dec 5, 2024 • 275 • 11

New activity in aws-prototyping/long-llava-qwen2-7b 5 days ago

How to use this model for inference on videos?

#1 opened 5 days ago by

liked 2 models 5 days ago

LanguageBind/Video-LLaVA-7B-hf

Image-to-Text • 7B • Updated May 16, 2024 • 7.13k • 50

LanguageBind/Video-LLaVA-7B

Text Generation • 7B • Updated Apr 9, 2024 • 3.15k • 89

upvoted an article 20 days ago

Article

Running Large Transformer Models on Mobile and Edge Devices

Nov 3, 2025

•

13

liked 3 models about 2 months ago

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 3 days ago • 2.39k • 101

mistralai/Ministral-3-3B-Instruct-2512-BF16

4B • Updated 14 days ago • 10.5k • 19

mistralai/Ministral-3-3B-Base-2512

4B • Updated 14 days ago • 10.9k • 53

upvoted an article about 2 months ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

Feb 17, 2025

•

28

New activity in HuggingFaceTB/SmolVLM-256M-Instruct about 2 months ago

can not believe, but seems 256M is slower then internvl-1B ?

#25 opened 3 months ago by

updated a Space 2 months ago

Trackio

Display tracking information

published a Space 2 months ago

Trackio

Display tracking information

upvoted an article 2 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

274

updated a dataset 2 months ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated Nov 23, 2025 • 23.7k • 162

published a dataset 2 months ago

dutta18/Quantity-Reasoning-VQA-23K

Viewer • Updated Nov 23, 2025 • 23.7k • 162

upvoted an article 2 months ago

Article

Preference Optimization for Vision Language Models

+2

Jul 10, 2024

•

93