SaaS red-team harness that stress-tests backdoor defenses against unseen trigger variants — so ML security engineers know if their defense actually generalizes
Customer: ML security engineer at a mid-size AI lab or fintech/healthtech company — has deployed a backdoor defense (e.g. ONION, STRIP, or fine-pruning), needs to prove it to an internal audit or external compliance review, no dedicated red-team budget
Problem: Backdoor defenses are evaluated on the same trigger distribution they were trained on. Engineer has no fast way to probe whether defense breaks when trigger moves position, gets paraphrased, or shifts to a synonym surface — so audit sign-off is theater
Pricing: saas-mrr — $800 MRR in 4 months (8 customers at $99/mo)
Why now
Recent papers (2025-2026) show backdoor defenses generalize beyond known triggers AND fail on shifted ones — reviewers and auditors are starting to ask ‘did you test trigger variants?’ and engineers have no tooling to answer that question fast
Go-to-market
- Post reproducibility breakdown of one published defense (e.g. ONION) failing on paraphrased triggers on /r/MachineLearning and ML Twitter — link to open-source CLI version of the harness
- File issues or comment on HuggingFace model cards for popular ‘backdoor-robust’ models showing untested variant classes — drives targeted traffic from engineers actively working the problem
- DM 20 authors of backdoor defense papers on Semantic Scholar offering free audit of their eval suite using the harness — 2-3 will share it; that’s enough for early signups
- Ship a free tier: run up to 3 trigger variant classes against any HF-hosted classifier, paywall variant depth, custom trigger injection scripts, and W&B export
Moat (or lack thereof)
No moat. Any ML engineer can replicate the trigger mutation logic in a weekend. Defensible position is only speed-to-insight (prebuilt trigger libraries, W&B dashboard templates) and being the default tool cited in audit checklists — first-mover on that citation loop is the only real edge, and it’s fragile