Zikang Shan's picture

Zikang Shan

zkshan2002
·

AI & ML interests

Reinforcement Learning

Recent Activity

published a model 2 days ago
RTO-RL/Llama3-8B-TDPO
updated a model 2 days ago
RTO-RL/Llama3-8B-TDPO
published a model 2 days ago
RTO-RL/Llama3-8B-SimPO
View all activity

Organizations

Reinforced Token Optimization's profile picture