Active Research — AI Security & Jailbreaking

AI Security Lab Where models break. Prompt Injection · Jailbreaking · Input Infection · Adversarial ML

Rigorous, documented research into AI security threats — from prompt injection and jailbreaking to adversarial machine learning and LLM governance. All findings include ethical boundaries, official references, and defensive recommendations.

Read Research ← Main Site

monitoring

Prompt Injection CRITICAL

Jailbreaking (DAN / GPT4) HIGH

Input Infection RESEARCH

Training Data Poisoning HIGH

Model Inversion RESEARCH

EU AI Act Compliance ACTIVE

Agentic AI Threats EMERGING

Research Areas

What This Lab Covers

Seven active research streams — each documented with methodology, examples, and defensive recommendations.

01 // PROMPT INJECTION

Prompt Injection Attacks

Direct and indirect prompt injection — how attackers hijack LLM behaviour through carefully crafted inputs.

LLMInjectionOWASP LLM01

02 // JAILBREAKING

Jailbreaking Techniques

DAN prompts, roleplay exploits, and systematic bypass techniques — documented for defensive research.

DANGPT-4ClaudeGemini

03 // INPUT INFECTION

Input Infection Research

How malicious inputs propagate through AI pipelines — infecting outputs, downstream systems, and agentic workflows.

AgentsRAGPipeline

04 // ADVERSARIAL ML

Adversarial Machine Learning

Model inversion, membership inference, training data extraction, and adversarial examples.

Model InversionExtractionMITRE ATLAS

05 // DATA POISONING

Training Data Poisoning

How attackers corrupt training datasets to embed backdoors, biases, and persistent vulnerabilities.

BackdoorSupply ChainDataset

06 // AI GOVERNANCE

AI Act & Governance

EU AI Act compliance, NIST AI RMF, and the regulatory landscape shaping AI security obligations.

EU AI ActNIST AI RMFISO 42001

Published Research

Latest Articles

AI SECURITY

AI Jailbreaking & Prompt Injection: From Basics to Advanced Techniques

A comprehensive technical guide to AI jailbreaking and prompt injection — covering how these attacks work, real documented techniques, input infection, and how to defend against them.

jailbreakingprompt-injection