Research · Private

llm-attack-atlas

A structured research corpus of documented LLM attack techniques, built for analytical queries about category frequency, target-system architecture, and where defensive boundaries need to sit.

Status v1.5 · repository private during research phase

A structured research corpus of documented LLM attack techniques across the OWASP LLM Top 10, vendor red-team disclosures, and arXiv research papers. The corpus supports analytical queries about category frequency, target-system architecture, and where defensive boundaries need to sit in modern AI system architecture.

Design principle

Safety-by-default schema design — every new entry defaults to quarantine, status changes route through an audit trail enforced at the database level, and extraction precision is measured via Wilson confidence intervals against operator-validated ground truth.

Status

v1.5 complete: 34 corpus entries across 3 sources, two validation runs with 100% precision on technique extraction (95% CI lower bound 83.9%). v2 in planning. Repository private during the research phase.