App Dev Advanced
Shut the Context Window! You're Letting the Tokens Out
AI-powered development is accelerating rapidly, but many teams are unknowingly burning through tokens due to how context is constructed and used. As organisations adopt AI tooling and agentic workflows, inefficient context usage can quietly increase cost, reduce performance, and degrade output quality.
In this session, we will break down how modern AI systems use context, including how prompts are expanded with additional data, how tokens are consumed across interactions, and why context windows should be treated as a designed resource rather than an infinite scratchpad. We will explore common patterns that lead to excessive token usage, such as large working sets, long-running interactions, and uncontrolled agent loops.
From there, we will look at practical approaches to regaining control, including focused work loops, context management strategies, and using the right model for the right task. Finally, we will explore how to introduce observability and control into AI workloads through platform-level patterns, including AI gateways, usage attribution, and policy-driven design.
Attendees will leave with a clearer understanding of how tokens are used across AI systems and how to design more efficient, observable, and cost-aware AI-powered development workflows.
Dylan McCarthy
Principal Solution Consultant · Versent, Microsoft MVP
Long Session