Zhexin Zhang's picture

4 3 1

Zhexin Zhang

nonstopfor

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

commented on a paper 15 days ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

published a model 22 days ago

thu-coai/ShieldAgent

View all activity

Organizations

nonstopfor's activity

commented a paper 15 days ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published 18 days ago • 5 •

New activity in thu-coai/AISafetyLab_Datasets 3 months ago

Upload 6 files

#2 opened 3 months ago by

yangjunxiao2021

commented a paper 3 months ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 12 •

commented a paper 8 months ago

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 13 •