adversarial-attacks

Here are 572 public repositories matching this topic...

Trusted-AI / adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

python machine-learning privacy ai attack extraction inference artificial-intelligence evasion red-team poisoning adversarial-machine-learning blue-team adversarial-examples adversarial-attacks trusted-ai trustworthy-ai

Updated Mar 3, 2025
Python

QData / TextAttack

Star

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

nlp security machine-learning natural-language-processing data-augmentation adversarial-machine-learning adversarial-examples adversarial-attacks

Updated Jul 25, 2024
Python

bethgelab / foolbox

Star

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

python machine-learning tensorflow keras pytorch adversarial-examples adversarial-attacks jax

Updated Apr 3, 2024
Python

microsoft / promptbench

Star

A unified evaluation framework for large language models

benchmark evaluation prompt robustness adversarial-attacks large-language-models prompt-engineering chatgpt

Updated Feb 11, 2025
Python

Harry24k / adversarial-attacks-pytorch

Star

PyTorch implementation of adversarial attacks [torchattacks]

deep-learning pytorch adversarial-attacks

Updated Jun 29, 2024
Python

thunlp / TAADpapers

Star

Must-read Papers on Textual Adversarial Attack and Defense

nlp natural-language-processing adversarial-learning adversarial-attacks paper-list adversarial-defense

Updated Feb 3, 2025
Python

DSE-MSU / DeepRobust

Star

A pytorch adversarial library for attack and defense methods on images and graphs

machine-learning deep-neural-networks deep-learning defense graph-mining graph-convolutional-networks adversarial-examples adversarial-attacks graph-neural-networks

Updated Jul 23, 2024
Python

A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convol…

Updated May 22, 2024
Python

thunlp / OpenAttack

Star

An Open-Source Package for Textual Adversarial Attack.

nlp natural-language-processing pytorch adversarial-example adversarial-attacks

Updated Jul 20, 2023
Python

fra31 / auto-attack

Star

Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"

adversarial-attacks adversarial-robustness adversarial-defenses

Updated May 16, 2024
Python

hendrycks / natural-adv-examples

Star

A Harder ImageNet Test Set (CVPR 2021)

imagenet robustness adversarial-example adversarial-attacks domain-generalization ml-safety

Updated Mar 23, 2024
Python

jind11 / TextFooler

Star

A Model for Natural Language Attack on Text Classification and Inference

natural-language-processing text-classification natural-language-inference bert adversarial-attacks bert-model

Updated Dec 8, 2022
Python

thu-ml / ares

Star

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

nes pca bim benchmark-framework evolutionary spsa boundary adversarial-machine-learning distillation fgsm adversarial-attacks deepfool adversarial-robustness mi-fgsm mmlda hgd

Updated Oct 15, 2023
Python

sarathknv / adversarial-examples-pytorch

Star

Implementation of Papers on Adversarial Examples

deep-learning pytorch adversarial-networks generative-adversarial-networks adversarial-learning adversarial-examples fgsm adversarial-attacks adversarial-images perturbations adversarial-perturbations semantic-adversarial-examples

Updated Apr 24, 2023
Python

deadbits / vigil-llm

Star

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

security-tools adversarial-machine-learning adversarial-attacks yara-scanner large-language-models llmops prompt-injection llm-security

Updated Jan 31, 2024
Python

agencyenterprise / PromptInject

Star

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

machine-learning agi language-models ai-safety adversarial-attacks ai-alignment ml-safety gpt-3 large-language-models prompt-engineering chain-of-thought agi-alignment

Updated Feb 26, 2024
Python

natanielruiz / disrupting-deepfakes

Star

🔥🔥Defending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks

machine-learning computer-vision deep-learning faceswap face-swap fake-news adversarial-attacks deepfakes deepfake-detection defending disrupting-deepfakes defending-deepfakes

Updated May 7, 2020
Python

ChandlerBang / Pro-GNN

Star

Implementation of the KDD 2020 paper "Graph Structure Learning for Robust Graph Neural Networks"

machine-learning deep-learning pytorch semi-supervised-learning defense graph-mining attack-defense adversarial-attacks graph-neural-networks graph-structure-recovery

Updated May 12, 2023
Python

ain-soph / trojanzoo

Star

TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classification in deep learning.

deep-learning pytorch image-classification adversarial-attacks backdoor-attacks

Updated Aug 11, 2024
Python

1Konny / FGSM

Star

Simple pytorch implementation of FGSM and I-FGSM

adversarial-example adversarial-attacks machine-learning-security

Updated Mar 21, 2018
Python

Improve this page

Add a description, image, and links to the adversarial-attacks topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-attacks topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-attacks

Here are 572 public repositories matching this topic...

Trusted-AI / adversarial-robustness-toolbox

QData / TextAttack

bethgelab / foolbox

microsoft / promptbench

Harry24k / adversarial-attacks-pytorch

thunlp / TAADpapers

DSE-MSU / DeepRobust

shubhomoydas / ad_examples

thunlp / OpenAttack

fra31 / auto-attack

hendrycks / natural-adv-examples

jind11 / TextFooler

thu-ml / ares

sarathknv / adversarial-examples-pytorch

deadbits / vigil-llm

agencyenterprise / PromptInject

natanielruiz / disrupting-deepfakes

ChandlerBang / Pro-GNN

ain-soph / trojanzoo

1Konny / FGSM

Improve this page

Add this topic to your repo