Zikang Shan's picture

Zikang Shan

zkshan2002
·

AI & ML interests

Reinforcement Learning

Recent Activity

published a model 1 day ago
RTO-RL/Llama3-8B-TDPO
updated a model 1 day ago
RTO-RL/Llama3-8B-TDPO
published a model 1 day ago
RTO-RL/Llama3-8B-SimPO
View all activity

Organizations

Reinforced Token Optimization's profile picture