Update ai_security_tools.md

2024-11-23 19:33:02 +00:00 · 2024-09-22 00:46:59 -04:00 · 2024-09-22 00:46:59 -04:00 · 5f144a640a
commit 5f144a640a
parent 1e7d9dd1f3
1 changed files with 3 additions and 0 deletions
--- a/ai_research/ai_security_tools.md
+++ b/ai_research/ai_security_tools.md
@ -42,3 +42,6 @@ _Products that intercept prompts and responses and apply security or privacy rul

 ## AI Red Teaming Datasets
 - [AttaQ Dataset](https://huggingface.co/datasets/ibm/AttaQ) - a red teaming dataset consisting of 1402 carefully crafted adversarial questions
+
+## AI Red Teaming Guidance
+- [HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal](https://arxiv.org/pdf/2402.04249)