David Stanojevic's picture

2

David Stanojevic

david-stan

·

david-stan

AI & ML interests

None yet

Recent Activity

updated a model 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

published a model 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

upvoted a paper 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

View all activity

Organizations

updated a model 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

Text Generation • 8B • Updated Nov 6, 2025 • 5 • 1

published a model 2 months ago

JetBrains-Research/premia-nes-7B-unsloth-mixed-v9-zeta-prompt

Text Generation • 8B • Updated Nov 6, 2025 • 5 • 1

upvoted a paper 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27, 2025 • 20

upvoted a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29, 2025 • 37

updated a model 11 months ago

david-stan/roberta-large-lora-class

Updated Feb 15, 2025

published a model 11 months ago

david-stan/roberta-large-lora-class

Updated Feb 15, 2025