Update ai_security_tools.md

This commit is contained in:
Omar Santos 2024-09-22 00:46:59 -04:00 committed by GitHub
parent 1e7d9dd1f3
commit 5f144a640a
No known key found for this signature in database
GPG key ID: B5690EEEBB952194

View file

@ -42,3 +42,6 @@ _Products that intercept prompts and responses and apply security or privacy rul
## AI Red Teaming Datasets ## AI Red Teaming Datasets
- [AttaQ Dataset](https://huggingface.co/datasets/ibm/AttaQ) - a red teaming dataset consisting of 1402 carefully crafted adversarial questions - [AttaQ Dataset](https://huggingface.co/datasets/ibm/AttaQ) - a red teaming dataset consisting of 1402 carefully crafted adversarial questions
## AI Red Teaming Guidance
- [HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal](https://arxiv.org/pdf/2402.04249)