Home Knowledge Base Data Poisoning

Data Poisoning is the adversarial attack that corrupts machine learning models by injecting malicious examples into training data — exploiting the fundamental dependence of ML systems on training data integrity to degrade model performance, embed backdoors, or manipulate predictions toward attacker-specified targets, without requiring access to the model itself during deployment.

What Is Data Poisoning?

Types of Data Poisoning Attacks

Availability Attacks (Denial of Service):

Integrity Attacks (Targeted):

Backdoor Attacks:

Poisoning in Specific Settings

Web-Scraped Pre-training Data:

Federated Learning Poisoning:

LLM Training Data Poisoning:

Detection and Defense

Data Sanitization:

Certified Defenses:

Data Provenance:

Poisoning Resistance through Architecture:

Data poisoning is the training-time attack that corrupts AI at its foundation — while adversarial examples require attacker access at inference time, data poisoning requires only the ability to influence what data enters the training pipeline, making it a realistic threat for any organization relying on internet-scraped, crowdsourced, or federated training data without cryptographic integrity verification.

data poisoningtrainingmalicious

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.