Every week you work on the same agent — a smart assistant for a business. It starts simple and gets harder.
Week 1 — the agent answers questions about the business, searches the web for anything it doesn't know, and runs on three different LLMs so you can see the difference in quality and cost.
Week 2 — it gains memory. It knows who each client is, their preferences, their history, their interactions. Fifty simultaneous users, zero data crossover.
Week 3 — it reads the business's actual documents. Price lists, employee bios, internal policies, client contracts. It cites its sources and never makes up an answer. It also accepts a photo of a product or issue and gives a first-pass assessment.
Week 4 — you rebuild it on LangGraph with a proper state machine and connect LangSmith. Now you can trace every decision, catch every hallucination, and escalate to a human administrator when confidence is low.
Week 5 — one agent becomes three: an intake specialist, a triage agent, and a booking agent. They coordinate, pass context between them, and pause for human approval before confirming anything critical.
Week 6 — it connects to real systems. A client describes a request, attaches a photo, and without any human involvement the agent creates a lead in CRM, checks Google Calendar, books the appointment and sends a WhatsApp confirmation.
Week 7 — Docker, AWS, HTTPS, monitoring with Telegram alerts, prompt injection protection, model routing, semantic caching. Cost per request drops 40–60%. The agent is in production.
Week 8 — you design and build a second enterprise agent entirely on your own, for a real business case you choose, and present it in front of the group.