Model Card for X_Boundary_DeepSeek_R1_Distill_Llama3_8B-adapter
X_Boundary_DeepSeek_R1_Distill_Llama3_8B-adapter is an LoRA adapter of DeepSeek-R1-Distill-Llama-8B trained by X-Boundary.
X-Boundary is a method to strike a balance between robust defense against multi-turn jailbreak attacks and the usability of Large Language Model (LLM) by establishing exact distinction boundary between safe and harmful representations.
Quick Start
from transformers import AutoModelForCausalLM, AutoTokenizer
base_model_name = 'deepseek-ai/DeepSeek-R1-Distill-Llama-8B'
adapter_name = 'Ursulalala/X_Boundary_DeepSeek_R1_Distill_Llama3_8B-adapter'
model = AutoModelForCausalLM.from_pretrained(
base_model_name,
torch_dtype='auto',
device_map='auto'
)
tokenizer = AutoTokenizer.from_pretrained(base_model_name)
model.load_adapter(adapter_name)
Framework versions