Melih Özcan's picture

64

Melih Özcan

staycoolish

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

upvoted a paper 1 day ago

UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

upvoted a paper 1 day ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

View all activity

Organizations

None yet

staycoolish's activity

upvoted 4 papers 1 day ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published 3 days ago • 28

UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models

Paper • 2503.08120 • Published 2 days ago • 26

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 3 days ago • 65

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 2 days ago • 52

upvoted 3 papers 2 days ago

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published 3 days ago • 62

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published 4 days ago • 20

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 3 days ago • 51

upvoted 2 papers 3 days ago

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 6 days ago • 26

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 6 days ago • 103

upvoted a paper 6 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 7 days ago • 83

upvoted 4 papers 8 days ago

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published 9 days ago • 14

Wikipedia in the Era of LLMs: Evolution and Risks

Paper • 2503.02879 • Published 9 days ago • 20

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published 9 days ago • 23

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published 9 days ago • 18

upvoted a paper 9 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 10 days ago • 64

upvoted 2 papers 10 days ago

Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids

Paper • 2502.20396 • Published 14 days ago • 12

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Paper • 2502.20900 • Published 13 days ago • 7

upvoted 3 papers 16 days ago

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published 18 days ago • 24

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published 18 days ago • 34

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 17 days ago • 67