Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
daqc 's Collections
Dataset Best Practices
LRMs
Agents
Thinkers
Low-Resource Data
Reasoning LLMs
Multilingual
Read later
SLMs
Safety
Reinforcement
on-Device (phone)
Frameworks
Domain-specific

Safety

updated 15 days ago
Upvote
-

  • Agent-SafetyBench: Evaluating the Safety of LLM Agents

    Paper • 2412.14470 • Published Dec 19, 2024 • 13

  • nvidia/Aegis-AI-Content-Safety-Dataset-2.0

    Viewer • Updated Jun 9 • 33.4k • 2.61k • 44
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs