AI Agent Security
The execution and containment properties of autonomous agents operating on developer workstations.
- Autonomous execution surfaces
- Agent containment models
- Prompt injection chains
Medulla AI Security Research Institute is newly established in Silicon Valley, California. We are forming a program around AI agent security, Model Context Protocol ecosystems, and the runtime conditions under which autonomous software can be operated safely on real endpoints — and we welcome partners who want to work in the open.
Medulla AI Security Research Institute is an independent nonprofit in its founding phase, based in Silicon Valley, California, focused on the safety, transparency, and governance of autonomous AI systems and local AI runtime environments. Our aim is to publish open research, ship reference security tooling, and contribute to technical standards as the program matures — starting from a clear agenda and careful partnerships, not from a backlog of finished work we are claiming today.
We will prioritize original analysis of attack surfaces unique to autonomous coding agents, MCP servers, and on-device model orchestration runtimes.
We intend to release production-minded open source for the AI agent ecosystem, including LLMGuard — runtime protection for macOS endpoints — as engineering milestones allow.
Contributions we aim to make over time: reproducible benchmarks, threat taxonomies, and best-practice guidance for organizations adopting autonomous agents at scale.
While peer-reviewed outputs are not yet available, our work is organized around the operational reality of modern AI agents — where they execute, what they touch, and how their behavior can be observed, contained, and governed.
The execution and containment properties of autonomous agents operating on developer workstations.
Trust boundaries, protocol governance, and exposure of remote tool servers in the Model Context Protocol.
Real-time monitoring, policy enforcement, and isolation primitives for local AI execution.
CVE research, exploit chain analysis, and runtime telemetry across the agent ecosystem.
When technical notes, papers, or advisories are ready, we will publish them openly under permissive licenses with reproducible methodology and coordinated disclosure where vulnerabilities are involved. There is nothing to list yet — the institute is new and outputs are in preparation.
No research outputs have been published yet. This section will list papers, briefs, and other releases as they ship. If you are working in a related area and want to coordinate early, we would welcome a conversation.
Dates are targets, not promises. We will adjust as partners join and as our first technical work proves out — but we publish a roadmap so expectations stay grounded in reality.
Public charter and governance, disclosure channel, research agenda refinement, and early conversations with academics, security teams, and open-source maintainers. No publication obligations until the program is staffed and scoped.
Initial threat-model or landscape note on AI agent and MCP runtime risk, plus a documented disclosure workflow for coordinated findings. LLMGuard moves from concept toward an auditable preview where feasible.
First open-source milestone for LLMGuard or an adjacent tool, a small public event series (see below), and clearer standards-facing deliverables — benchmarks or taxonomy drafts — informed by partner feedback.
Deeper long-running projects as funding and collaboration allow: expanded publications, sustained disclosure coordination, and tooling maintained in public with predictable release cadence.
Most programming is still being scheduled. If you would like Medulla to participate in a workshop, panel, or closed technical exchange, use the contact channel — we prioritize small, high-signal forums.
Public Q&A on mission, roadmap, and disclosure. No recordings at first; details and registration link will be posted here when the first session is scheduled.
Invitation-only technical discussion on containment patterns for coding agents and MCP trust boundaries. A short summary may be published afterward if participants agree.
The institute just launched. Past talks and workshops will be listed here with titles, dates, and materials when available.
We consider conferences, university seminars, and community meetups that align with our principles. Include format, audience, and expected level of technical depth.
Propose an eventThe institute's open-source work will follow the same norms we aim to apply to research: permissive licensing, vendor neutrality, and software designed without mandatory telemetry, third-party runtime dependencies, or cloud coupling. The first projects are still taking shape.
LLMGuard is our planned local security layer for detecting, auditing, and containing high-risk behavior from AI agents — copilots, MCP-driven tools, and other autonomous assistants — on macOS 14+. Details and code will appear as milestones are reached; the preview below illustrates the intended experience.
Even at launch, Medulla is defined by how it intends to work. These principles will guide what we publish, how we collaborate, and which mandates we will not accept as the institute grows.
Open research and reproducible analysis. Methods, data, and decisions are documented in the public record.
No commercial AI platform affiliation. Funding, governance, and research priorities are kept separate from vendor interests.
Research focused on the safety of the broader AI ecosystem — practitioners, developers, and the public alike.
Active support for academic researchers, security teams, and open-source communities advancing AI runtime safety.
Medulla works alongside academic groups, security teams, standards bodies, and open-source maintainers. If your work intersects with AI agent or runtime safety, we would like to hear from you.
Contact the Institute