Cookies Notice
This site uses cookies to deliver services and to analyze traffic.
📣 Introducing AI Threat Modeling: Preventing Risks Before Code Exists
Every application security audit your team has run was built on the assumption that a human wrote the code, understood it, and intended every line.
But the way teams write and ship code has changed. 84% of developers now use or plan to use AI coding tools, and the code these tools produce fails security checks at alarming rates. Formal verification research has found that 55.8% of AI-generated code contains at least one vulnerability, with no model scoring above a D grade. The audit challenge has shifted: codebases are growing faster, introducing unfamiliar patterns, and pulling in dependencies that may not even exist on public registries.
The teams running effective audits in this environment have expanded their scope well beyond source code scanning. They evaluate supply chain integrity, runtime reachability, AI governance policies, and the security posture of non-human identities like AI agents, all within a continuous audit framework rather than a periodic review cycle.
AI-generated code changes what an application security audit needs to catch. Teams need a practical way to find the risky parts, test what the AI missed, and turn the results into fixes developers can actually use.
An application security review is a systematic, evidence-based evaluation of an application’s security controls, architecture, and deployment configuration. It examines how security is built into the software development lifecycle, not just whether individual vulnerabilities exist. AppSec teams or external auditors typically run it, and the output is a compliance attestation, a risk assessment, and a prioritized remediation roadmap.
Audits are often confused with penetration tests and vulnerability scans. All three serve different purposes and operate at different depths.
| Assessment | Scope | Method | Output |
| Vulnerability scan | Known flaws across a broad surface | Automated pattern matching against CVE databases | List of CVEs with CVSS scores |
| Penetration test | Specific targets, depth over breadth | Manual adversarial simulation and exploitation | Attack path documentation and proof of business impact |
| Application security audit | Full security posture across the SDLC | Holistic review of controls, code, policy, and architecture | Compliance attestation and remediation roadmap |
Vulnerability scans reveal flaws, Penetration tests tell you which ones an attacker can exploit, and audits tell you whether your security program is working and where the structural gaps are.
Meet with our team of application security experts and learn how Apiiro is transforming the way modern applications and software supply chains are secured.
The volume of code hitting production has changed by an order of magnitude. GitHub processed 1 billion commits in all of 2025, with the platform handling 275 million commits per week, on pace for 14 billion this year.
Much of that surge is driven by AI coding agents, and Gartner now predicts that 90% of enterprise software engineers will use AI code assistants by 2028.
The downstream pressure is already visible. NIST announced in April 2026 that it can no longer enrich all CVEs in the National Vulnerability Database, citing a 263% surge in submissions between 2020 and 2025, with Q1 2026 running a third higher than the same period last year.
The infrastructure the industry relies on to score and prioritize vulnerabilities is buckling under the weight of AI-accelerated code production.
The security quality of that code is the core problem. A formal verification study across seven major models found that 55.8% of AI-generated code contains at least one provably exploitable vulnerability. The Georgetown Center for Security and Emerging Technology reached a similar conclusion, finding that up to 50% of AI-generated code contains security flaws, with 10% actively exploitable. No model in the formal verification study scored above a D grade.
Supply chain integrity is eroding in tandem. A study of 576,000 code samples across 16 LLMs found that roughly 20% of AI-generated code references packages that do not exist on public registries like PyPI or npm. Attackers have begun registering these hallucinated package names as malicious libraries, a technique known as slopsquatting. For any code security audit, this means the scope must extend beyond the application’s own source code into the package ecosystem itself.
There is also the problem of pattern homogeneity. Because developers across unrelated organizations use similar prompts, AI models produce the same insecure patterns at scale. Identical input validation failures, hardcoded credential placeholders, and broken access control logic appear across thousands of codebases, creating predictable attack surfaces that adversaries can target systematically.
For auditors, the implication is clear. Velocity, unfamiliar patterns, fabricated dependencies, and replicated vulnerabilities have widened the audit surface beyond what periodic, code-focused reviews can cover.
An effective audit in this environment must go well beyond source code scanning. The following six domains define the minimum scope for a security engineer conducting a modern review.
Running an application security audit in a high-velocity AI environment requires a structured methodology. The following five steps cover scoping through remediation.
Start by drawing the audit perimeter. A narrow focus on application source code will miss large portions of the attack surface. The scope should include core applications, APIs, microservices, build systems, cloud infrastructure, and third-party integrations. Run a baseline vulnerability scan early to quantify current exposure and establish a benchmark for measuring improvement.
Inventory every AI-assisted code change from the past 90 days by reviewing commit messages, developer tool logs, and CI/CD records. Shadow AI usage is a significant blind spot. The Stack Overflow 2025 Developer Survey found that many developers use AI coding tools without formal IT approval, creating risk surfaces that security teams cannot see. Catalog which models are in use, which repositories they touch, and whether acceptable use policies exist.
Execute SAST and DAST across the full codebase and running environment. Validate scanner outputs manually, especially for AI-generated code, which often mimics secure patterns on the surface while failing in execution. Integrate scanning into DevSecOps workflows so that analysis runs continuously rather than as a one-time audit event.
Not every finding from the previous step warrants remediation. Use reachability analysis to confirm whether vulnerable code is actually accessible from a public endpoint. Chain vulnerabilities together to assess real attack paths. An insecure AI-generated SQL builder combined with missing rate limiting, for example, could enable full database exfiltration, while the same SQL flaw in an internal tool behind a VPN may be a low priority.
Translate technical findings into business risk. Categorize every finding by severity, exploitability, and business impact. Assign clear ownership for each item and set remediation timelines based on risk, not just CVSS scores. Track mean time to remediate (MTTR) as the primary metric for measuring whether the security program is keeping pace with development velocity.
Use this checklist to verify coverage across the core audit domains.
The application security audit is no longer a periodic checkpoint.
When codebases grow at machine speed, accumulate hallucinated dependencies, and replicate the same insecure patterns across thousands of repositories, the audit must become a continuous governance mechanism with full architectural visibility.
The teams that will manage this effectively are the ones that can map their entire software architecture across every material change, correlate findings from code to runtime, and prioritize remediation based on actual business risk rather than raw vulnerability counts. That requires a platform foundation that understands the architecture, enforces policy automatically, and acts as a force multiplier for security engineers who are already stretched thin.
Apiiro is the agentic application security platform built for this problem. Deep Code Analysis engine continuously maps the full software architecture, code-to-runtime correlation confirms which findings are actually reachable in production, and AI agents (AutoFix, AutoGovern, Guardian with Secure Prompts) remediate risks, enforce policy, and prevent vulnerable code from being generated at all.
Get full visibility into your software architecture, prioritize findings by reachability and business impact, and cut through the noise that makes audits slow. Schedule a demo to see how it works.
An audit is a comprehensive review of an application’s security controls, policies, and SDLC processes. A penetration test is a targeted, adversarial simulation designed to exploit specific vulnerabilities and prove business impact. Audits assess the overall security posture. Pen tests prove whether individual flaws are exploitable.
At a minimum, annually. In high-velocity environments with significant AI-generated code or regulatory requirements like PCI DSS, quarterly audits or continuous audit mechanisms are more appropriate. Any major infrastructure change, acquisition, or shift in development tooling should also trigger a review.
Traditional SAST catches surface-level syntax issues but frequently misses the complex logic flaws, business logic errors, and hallucinated dependencies common in AI-generated output. Modern audits require tools with semantic analysis and architectural context to identify the patterns that rule-based scanners overlook.
Unmanaged security debt. Without regular audits, vulnerabilities accumulate untracked across the codebase, supply chain, and runtime environment. This increases the likelihood of a breach, regulatory penalties, and loss of customer trust. In AI-heavy codebases, the accumulation rate is significantly faster than in human-only environments.
ASPM transforms audits from periodic, manual reviews into a continuous, data-driven process. It unifies findings across tools, provides reachability context to separate noise from genuine risk, automates evidence collection, and keeps organizations audit-ready without compliance fire drills. It gives auditors the runtime and architectural context that standalone scanners lack.
See for yourself how Apiiro can give you the visibility and context you need to optimize your manual processes and make the most out of your current investments.