Opsera Launches BrickForge, a Purpose-Built AI Operational Command Center for Enterprise Data Teams

Available on Databricks Marketplace

Learn More

Book a Demo

Blog /

AI, Security

I Ran an AI Security Scan on a Notoriously Vulnerable App. Here’s What Happened.

Rohit Dalvi

Published on

April 9, 2026

Evaluate your code base for modernization.

Start Forging

I’ll be honest, when I first heard “AI security agent,” I pictured something that would flag a couple of obvious issues and call it a day. I was wrong.

I used Opsera’s Security Agent to scan OWASP Juice Shop, one of the most widely used intentionally vulnerable Node.js apps in the security community. It’s a deliberately broken e-commerce app used to teach developers what not to do. In other words, it’s a perfect stress test.

Here’s what I found, what it means, and where I think this kind of tooling is going.

Setting It Up

I had the Opsera Security Agent connected via GitHub Copilot in VS Code. Three prompts in, I typed: “Run Opsera security scan for this project.”

That’s it. No config files. No manual pipeline setup. No switching tabs.

The agent kicked off 6 different security scanners simultaneously: gitleaks, npm audit, trivy, semgrep, checkov, and hadolint, and came back with a full report in minutes.

What It Found?

Risk Score: 97 / 100. That’s basically as bad as it gets.

Total findings: 176 broken down as

7 Critical
126 High
34 Medium
9 Low

Now, Juice Shop is designed to be broken, so some of this is expected. But what surprised me was the depth of what the agent caught across very different layers of the stack.

Within 2-3 minutes, I had a comprehensive vulnerability report covering 6 security domains and 176 findings, something that would’ve taken a security team hours to pull together manually.

Secrets Sitting in Plain Sight

The scan flagged a generic API key hardcoded in data/static/users.yml at line 88. It also caught a JWT token pattern buried inside a frontend test file.

These are exactly the kinds of things that get missed in manual code reviews: they’re tucked away in YAML configs and test artifacts, not in the main application logic. The fact that the agent surfaced them without being told where to look is genuinely useful.

7 Critical Dependency Vulnerabilities

npm audit came back with 7 critical CVEs across Juice Shop’s dependency graph. This is the silent killer in most real-world apps: you didn’t write the vulnerable code, some package you installed two years ago did. And unless you’re actively scanning, you’d have no idea.

SQL Injection in the Login Route

Semgrep caught a Sequelize injection pattern in routes/login.ts and routes/search.ts. Direct user input flowing into database queries without parameterization, a textbook vulnerability that’s also one of the most exploited in real breaches.

Container and IaC Issues

Checkov and hadolint flagged the Dockerfile for missing a HEALTHCHECK, an unpinned base image tag, and consecutive RUN instructions that bloat the image unnecessarily. Small things, but the kind that compound over time in production.

So What?

Here’s where I want to be direct: most security tools give you a list of problems and leave you to figure out the rest. What made this scan different was that it gave me a prioritized action plan of what to fix this sprint, what to plan next quarter, without me having to interpret raw vulnerability data.

That matters a lot in a real engineering org. A developer shouldn’t have to context-switch into a security mindset to understand why a finding is urgent. A PM shouldn’t need a security background to prioritize a fix in the backlog. The agent bridges that gap.

The business impact of what was found here:

Hardcoded credentials in source = one leaked repo away from a full account compromise
Critical npm CVEs = known exploits, publicly documented, actively targeted
SQL injection = data breach waiting to happen, and a compliance nightmare (SOC 2, GDPR, you name it)

These aren’t theoretical risks. They’re the kinds of issues that make headlines.

Where Does This Go From Here?

This is the part I’m most excited about as a product thinker.

Right now, the agent scans and reports. That’s already valuable. But the natural next evolution is:

CI/CD Gates

Block a merge if a critical finding is introduced. Security becomes a hard gate in the pipeline, not a quarterly audit. The scan I ran manually today becomes something that runs automatically on every pull request.

Auto-Remediation Suggestions

The report already includes remediation guidance per finding. The next step is the agent opening a PR with the actual fix – patching the dependency, replacing the hardcoded secret with an environment variable reference, parameterizing the query and flagging it for human review

Compliance Reporting Out of the Box

SOC 2, ISO 27001, FedRAMP – every one of these frameworks has specific security requirements around dependency management, secrets handling, and access control. The scan data is already there. Mapping it automatically to a compliance framework and generating audit-ready evidence would save teams weeks of work.

Agentic Fix Workflows

The bigger picture here is an agent that doesn’t just tell you what’s broken, it fixes it, tests the fix, verifies nothing regressed, and closes the loop. The human stays in the decision seat, but the grunt work of security remediation gets offloaded.

This is what shift left actually looks like when it’s powered by AI agents rather than just better tooling.

Final Thought

Security has historically been the thing teams deal with after something goes wrong. The Opsera Security Agent is a step toward making it something that runs quietly in the background, catches issues early, and tells you exactly what to do about them.

For a team that’s moving fast, which is every team worth working on, that’s not a nice-to-have. It’s the difference between shipping confidently and shipping with crossed fingers.

I ran one command. It found 176 issues across 6 different security domains in minutes.

That’s the bar now!

Back to Blog

Evaluate your code base for modernization.

Start Forging

Opsera Launches BrickForge, a Purpose-Built AI Operational Command Center for Enterprise Data Teams

Platform

Unified DevOps Platform

Unified Insights

Unified DevSecOps

Databricks Ops

Forge Software Factory

AppSec Agents

BrickForge

Use Cases

SaaS

SDLC

Data & Analytics

Security

Accelerating AI Maturity with LTM

Resources

DORA vs SPACE Metrics: Key Differences, Use Cases, and When to Use Each Framework

Resources

DORA vs SPACE Metrics: Key Differences, Use Cases, and When to Use Each Framework

Developers

Episode 23: Architecting for Success: Salesforce DevOps with Opsera

Company

Opsera and Cursor Partner to Embed Autonomous AI Agents Directly into AI-SDLC Workflows for Next-gen AI-Driven Development

I Ran an AI Security Scan on a Notoriously Vulnerable App. Here’s What Happened.

Setting It Up

What It Found?

Secrets Sitting in Plain Sight

7 Critical Dependency Vulnerabilities

SQL Injection in the Login Route

Container and IaC Issues

So What?

Where Does This Go From Here?

Final Thought

Evaluate your code base for modernization.

Recommended Content

Resources

Setting It Up

What It Found?

Secrets Sitting in Plain Sight

7 Critical Dependency Vulnerabilities

SQL Injection in the Login Route

Container and IaC Issues

So What?

Where Does This Go From Here?

Final Thought

Book time with our DevOps experts and start empowering your teams

Evaluate your code base for modernization.

Recommended Content