Vercel's Eve Framework Treats AI Agents as Standard Software Projects
Agent projects today collapse under their own prompt spaghetti and ad-hoc tool wiring. Eve's file-system-native approach makes agent behavior versionable, reviewable, and debuggable with the same workflows teams already use for web apps, lowering the barrier to production deployment.
Eve rethinks AI agent development by imposing a strict, file-system-native project structure. Each agent lives in its own directory, with an `instructions.md` file defining its behavior, a `tools/` folder of TypeScript files for capabilities, and reusable skill modules that load on demand. A built-in sandbox isolates code execution, and a scheduler turns agents into background automation systems that run on a timer.
The framework shifts prompts from ephemeral strings to engineering assets. Tool registration becomes automatic file detection, and debugging surfaces structured logs showing every tool call and state change. The result is an agent development experience that mirrors modern web application workflows—`dev` mode, visual observability, and modular composition.
Eve targets the fragmentation that plagues current agent projects: scattered logic, unobservable execution, and deployment friction. By standardizing the directory layout and execution model, it aims to make agents as maintainable and deployable as any other software system.
Eve's directory-as-agent model treats prompt engineering as a configuration management problem, not a model-tuning problem. The `instructions.md` file becomes a version-controlled artifact that teams can review, branch, and roll back.
Auto-registering tools from a file system removes an entire category of boilerplate that makes current agent frameworks brittle. The convention-over-configuration approach mirrors what made Rails and Next.js productive for web development.
On-demand skill loading addresses a real failure mode: monolithic system prompts that grow until they degrade model performance. Modular skills keep context lean and let agents compose behavior dynamically.
Adding a scheduler pushes agents past the chatbot boundary into cron-replacement territory. This is where agent frameworks start competing with traditional automation and ETL pipelines, not just conversational UIs.