Seungju Han's picture

3 8

Seungju Han

Seungjuhan

·

https://seungjuhan.me

AI & ML interests

None yet

Organizations

Seungjuhan's activity

upvoted 2 papers 8 months ago

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Paper • 2406.18510 • Published Jun 26, 2024 • 9

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13

upvoted a paper over 1 year ago

CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos

Paper • 2303.09713 • Published Mar 17, 2023 • 1