Models & datasets from the paper "Tamper-Resistant Safeguards for Open-Weight LLMs" (https://arxiv.org/pdf/2408.00761)
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Collections
1
models
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/631d279aa31a0a462ca46913/fvNAiVoio4fYrH6MH_ogd.png)
lapisrocks/Llama-3-8B-Instruct-TAR-Bio-v2
Updated
ā¢
1.02k
![](https://cdn-avatars.huggingface.co/v1/production/uploads/631d279aa31a0a462ca46913/fvNAiVoio4fYrH6MH_ogd.png)
lapisrocks/Llama-3-8B-Instruct-TAR-Refusal
Text Generation
ā¢
Updated
ā¢
120
![](https://cdn-avatars.huggingface.co/v1/production/uploads/631d279aa31a0a462ca46913/fvNAiVoio4fYrH6MH_ogd.png)
lapisrocks/Llama-3-8B-Instruct-TAR-Bio
Text Generation
ā¢
Updated
ā¢
31
![](https://cdn-avatars.huggingface.co/v1/production/uploads/631d279aa31a0a462ca46913/fvNAiVoio4fYrH6MH_ogd.png)
lapisrocks/Llama-3-8B-Instruct-Random-Mapped-Cyber
Text Generation
ā¢
Updated
ā¢
28
![](https://cdn-avatars.huggingface.co/v1/production/uploads/631d279aa31a0a462ca46913/fvNAiVoio4fYrH6MH_ogd.png)
lapisrocks/Llama-3-8B-Instruct-Random-Mapped-Bio
Text Generation
ā¢
Updated
ā¢
159