Merge pull request #1260 from HackTricks-wiki/update_AIxCC_finals__Tale_of_the_tape_20250808_124240

AIxCC finals Tale of the tape
2025-10-10 18:36:50 +00:00 · 2025-08-13 16:02:42 +02:00 · 2025-08-13 16:02:42 +02:00 · c394e547f1
commit c394e547f1
parent 6cc2ab141c 4a4d0db1c7
3 changed files with 164 additions and 0 deletions
--- a/src/AI/AI-Assisted-Fuzzing-and-Vulnerability-Discovery.md
+++ b/src/AI/AI-Assisted-Fuzzing-and-Vulnerability-Discovery.md
@ -0,0 +1,157 @@
+# AI-Assisted Fuzzing & Automated Vulnerability Discovery
+
+{{#include ../banners/hacktricks-training.md}}
+
+## Overview
+Large-language models (LLMs) can super-charge traditional vulnerability-research pipelines by generating semantically rich inputs, evolving grammars, reasoning over crash data, and even proposing multi-bug patches.  This page collects the most effective patterns observed during DARPA’s AI Cyber Challenge (AIxCC) finals and other public research.
+
+What follows is not a description of one specific competition system, but an abstraction of the techniques so you can reproduce them in your own workflows.
+
+---
+
+## 1. LLM-Generated Seed Inputs
+
+Traditional coverage–guided fuzzers (AFL++, libFuzzer, Honggfuzz…) start with a small corpus of seeds and mutate bytes blindly.  When the target input format is complex (SQL, URLs, custom binary protocols) random mutations usually break the syntax before interesting branches are reached.
+
+LLMs can solve this bootstrap problem by emitting *seed generators* – short scripts that output **syntax-correct but security-relevant inputs**.  For example:
+
+```prompt
+SYSTEM: You are a helpful security engineer.
+USER:
+Write a Python3 program that prints 200 unique SQL injection strings targeting common anti-pattern mistakes (missing quotes, numeric context, stacked queries).  Ensure length ≤ 256 bytes / string so they survive common length limits.
+```
+
+```python
+# gen_sqli_seeds.py (truncated)
+PAYLOADS = [
+    "1 OR 1=1 -- ",
+    "' UNION SELECT NULL,NULL--",
+    "0; DROP TABLE users;--",
+    ...
+]
+for p in PAYLOADS:
+    print(p)
+```
+
+Run once and feed the output directly into the fuzzer’s initial corpus:
+
+```bash
+python3 gen_sqli_seeds.py > seeds.txt
+afl-fuzz -i seeds.txt -o findings/ -- ./target @@
+```
+
+Benefits:
+1. Semantic validity → deeper coverage early.
+2. Re-generatable: tweak the prompt to focus on XSS, path traversal, binary blobs, etc.
+3. Cheap (< 1 ¢ with GPT-3.5).
+
+### Tips
+* Instruct the model to *diversify* payload length and encoding (UTF-8, URL-encoded, UTF-16-LE) to bypass superficial filters.
+* Ask for a *single self-contained script* – avoids JSON formatting hiccups.
+
+---
+
+## 2. Grammar-Evolution Fuzzing
+
+A more powerful variant is to let the LLM **evolve a grammar** instead of concrete seeds.  The workflow (“Grammar Guy” pattern) is:
+
+1. Generate an initial ANTLR/Peach/LibFuzzer grammar via prompt.
+2. Fuzz for N minutes and collect coverage metrics (edges / blocks hit).
+3. Summarise uncovered program areas and feed the summary back into the model:
+   ```prompt
+   The previous grammar triggered 12 % of the program edges.  Functions not reached: parse_auth, handle_upload.  Add / modify rules to cover these.
+   ```
+4. Merge the new rules, re-fuzz, repeat.
+
+Pseudo-code skeleton:
+
+```python
+for epoch in range(MAX_EPOCHS):
+    grammar = llm.refine(grammar, feedback=coverage_stats)
+    save(grammar, f"grammar_{epoch}.txt")
+    coverage_stats = run_fuzzer(grammar)
+```
+
+Key points:
+* Keep a *budget* – each refinement uses tokens.
+* Use `diff` + `patch` instructions so the model edits rather than rewrites.
+* Stop when Δcoverage < ε.
+
+---
+
+## 3. Agent-Based PoV (Exploit) Generation
+
+After a crash is found you still need a **proof-of-vulnerability (PoV)** that deterministically triggers it.
+
+A scalable approach is to spawn *thousands* of lightweight agents (<process/thread/container/prisoner>), each running a different LLM (GPT-4, Claude, Mixtral) or temperature setting.
+
+Pipeline:
+1. Static/ dynamic analysis produces *bug candidates* (struct with crash PC, input slice, sanitizer msg).
+2. Orchestrator distributes candidates to agents.
+3. Agent reasoning steps:
+   a. Reproduce bug locally with `gdb` + input.
+   b. Suggest minimal exploit payload.
+   c. Validate exploit in sandbox.  If success → submit.
+4. Failed attempts are **re-queued as new seeds** for coverage fuzzing (feedback loop).
+
+Advantages:
+* Parallelisation hides single-agent unreliability.
+* Auto-tuning of temp / model size based on observed success-rate.
+
+---
+
+## 4. Directed Fuzzing with Fine-Tuned Code Models
+
+Fine-tune an open-weight model (e.g. Llama-7B) on C/C++ source labelled with vulnerability patterns (integer overflow, buffer copy, format string).  Then:
+
+1. Run static analysis to get function list + AST.
+2. Prompt model: *“Give mutation dictionary entries that are likely to break memory safety in function X”*.
+3. Insert those tokens into a custom `AFL_CUSTOM_MUTATOR`.
+
+Example output for a `sprintf` wrapper:
+```
+{"pattern":"%99999999s"}
+{"pattern":"AAAAAAAA....<1024>....%n"}
+```
+
+Empirically this shrinks time-to-crash by >2× on real targets.
+
+---
+
+## 5. AI-Guided Patching Strategies
+
+### 5.1 Super Patches
+Ask the model to *cluster* crash signatures and propose a **single patch** that removes the common root cause.  Submit once, fix several bugs → fewer accuracy penalties in environments where each wrong patch costs points.
+
+Prompt outline:
+```
+Here are 10 stack traces + file snippets.  Identify the shared mistake and generate a unified diff fixing all occurrences.
+```
+
+### 5.2 Speculative Patch Ratio
+Implement a queue where confirmed PoV-validated patches and *speculative* patches (no PoV) are interleaved at a 1:N ratio tuned to scoring rules (e.g. 2 speculative : 1 confirmed).  A cost model monitors penalties vs. points and self-adjusts N.
+
+---
+
+## Putting It All Together
+An end-to-end CRS (Cyber Reasoning System) may wire the components like this:
+
+```mermaid
+graph TD
+    subgraph Discovery
+        A[LLM Seed/Grammar Gen] --> B[Fuzzer]
+        C[Fine-Tuned Model Dicts] --> B
+    end
+    B --> D[Crash DB]
+    D --> E[Agent PoV Gen]
+    E -->|valid PoV| PatchQueue
+    D -->|cluster| F[LLM Super-Patch]
+    PatchQueue --> G[Patch Submitter]
+```
+
+---
+
+## References
+* [Trail of Bits – AIxCC finals: Tale of the tape](https://blog.trailofbits.com/2025/08/07/aixcc-finals-tale-of-the-tape/)
+* [CTF Radiooo AIxCC finalist interviews](https://www.youtube.com/@ctfradiooo)
+{{#include ../banners/hacktricks-training.md}}
--- a/src/AI/README.md
+++ b/src/AI/README.md
@ -64,4 +64,10 @@ MCP (Model Context Protocol) is a protocol that allows AI agent clients to conne
 AI-MCP-Servers.md
 {{#endref}} 

+### AI-Assisted Fuzzing & Automated Vulnerability Discovery
+
+{{#ref}}
+AI-Assisted-Fuzzing-and-Vulnerability-Discovery.md
+{{#endref}}
+
 {{#include ../banners/hacktricks-training.md}}
--- a/src/SUMMARY.md
+++ b/src/SUMMARY.md
@ -825,6 +825,7 @@

 # 🤖 AI
 - [AI Security](AI/README.md)
+  - [Ai Assisted Fuzzing And Vulnerability Discovery](AI/AI-Assisted-Fuzzing-and-Vulnerability-Discovery.md)
  - [AI Security Methodology](AI/AI-Deep-Learning.md)
  - [AI MCP Security](AI/AI-MCP-Servers.md)
  - [AI Model Data Preparation](AI/AI-Model-Data-Preparation-and-Evaluation.md)