Text Classification
Transformers
Safetensors
distilbert
Inference Endpoints
Al-Chan's picture
Create README.md
293ab0a verified
metadata
datasets:
  - davanstrien/aart-ai-safety-dataset
  - obalcells/advbench
  - databricks/databricks-dolly-15k

Malicious & Jailbreaking Prompt Classifer

Datasets Used

MaliciousInstruct

AART

StrongREJECT

DAN

AdvBench

Databricks-Dolly