mirror of
https://github.com/The-Art-of-Hacking/h4cker
synced 2024-11-21 10:23:02 +00:00
Update ai_security_tools.md
This commit is contained in:
parent
1e7d9dd1f3
commit
5f144a640a
1 changed files with 3 additions and 0 deletions
|
@ -42,3 +42,6 @@ _Products that intercept prompts and responses and apply security or privacy rul
|
||||||
|
|
||||||
## AI Red Teaming Datasets
|
## AI Red Teaming Datasets
|
||||||
- [AttaQ Dataset](https://huggingface.co/datasets/ibm/AttaQ) - a red teaming dataset consisting of 1402 carefully crafted adversarial questions
|
- [AttaQ Dataset](https://huggingface.co/datasets/ibm/AttaQ) - a red teaming dataset consisting of 1402 carefully crafted adversarial questions
|
||||||
|
|
||||||
|
## AI Red Teaming Guidance
|
||||||
|
- [HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal](https://arxiv.org/pdf/2402.04249)
|
||||||
|
|
Loading…
Reference in a new issue