Welcome to Vanishing Gradients!
This issue is about what it actually takes to build with AIโdebugging agents, validating outputs, navigating uncertainty, and avoiding the trap of systems that only look good on paper. Whether youโre testing LLM apps, deploying open-weight models, or trying to make metrics mean something, thereโs a lot in here for you.
Quick links below to whatโs coming up in my data/AI life, what just dropped, and how to plug in:
๐
Live Online Events
โ [May 27] Why Data and AI Still Break โ Akshay Agrawal (Founder, marimo; ex-Google Brain, Netflix, Stanford)
โ [May 27] Building Production-Grade AI Systems โ Aman Gupta (MasterClass)
โ [May 29] Build GenAI Systems Fast: AI Studio, Gemini, and Gemma โ Ravin Kumar (DeepMind)
โ [June 2] Why Tool Calling Breaks โ Alan Nichol (Co-founder, Rasa)
โ [June 15] Evaluating AI Agents (Live Workshop) โ Ravin Kumar (DeepMind)
๐ In-Person Events
โ [June 16] Berlin Meetup: Build with AI โ Ines Montani (Explosion), Hugo Bowne-Anderson
โ [June 17] Berlin Meetup: Agents & Evals โ Alan Nichol (Rasa), Hugo Bowne-Anderson
โ [June 24โ25] VentureBeat Transform โ Workshop + Panels (San Francisco)
โ [July 7โ11] SciPy 2025 โ Tutorial + Talk (Tacoma, WA)
๐ Podcasts & Clips
โ [Fei-Fei Li] AI as a Civilizational Technology
โ [Eoin OโMahony] When Non-Determinism Is a Superpower
๐ Hands-On
โ [Gemma Livestream Recap] Building AI Agents Locally โ Ravin Kumar
โ [Lightning Lesson Replay] LLMs and Low-Hanging Fruit โ Nathan Danielsen (Carvana)
โ [June 15 Workshop] Evaluating AI Agents: From Demos to Dependability
๐ก Get Involved
โ Sign up for the upcoming Building LLM Applications for Data Scientists and Software Engineers course (timed for EU + US)
โ Subscribe to our lu.ma calendar for livestreams and workshops
โ Subscribe on YouTube for workshops, podcast livestreams, and more!
โ Iโll be in London, Paris, Berlin, and a few other European citiesโhit reply if you want to co-host a meetup or bring me in to speak with your team
๐ Reading time: 12โ15 minutes
AI as a Civilizational Technology
๐ง Fei-Fei Li on AI, Institutions, and Shared Prosperity
AI is a civilizational technologyโฆ It touches on geopolitics. It touches on productivity and shared prosperity. These are bigger societal problems that have to do with human-centered AI.
Fei-Fei Li opens our most recent episode of High Signal with the stakes: not just how AI works, but what itโs doing to labor, government, and global systems. Sheโs best known for creating ImageNet, and now leads Stanfordโs Human-Centered AI Institute. Sheโs also a former VP and Chief Scientist of AI/ML at Google, and currently co-founder of World Labs.
In this episode, Fei-Fei, Duncan, and I talk about:
โ What it means to build AI infrastructure for societyโnot just for scale
โ Why spatial intelligence could shift the foundation of AI
โ The role of public education, civic trust, and long-term thinking
๐ง Listen to the clip below and check out the full episode hereย
Two Upcoming Lightning Lessons + One Past โ 30 Minutes, Tactical, and Free
โก Build GenAI Systems Fast: AI Studio, Gemini, and Gemma
Wed, May 28 with Ravin Kumar (DeepMind)
Googleโs AI Studio is one of my favourite tools to explore and play around with the Google/DeepMind ecosystem of AI tools, including Gemini and Gemma.
In this lightning lesson, Ravin (who works directly on Gemini, Gemma, and AI Studio) and I will share how these tools are actually designed to be usedโso you can move from playing with prompts to building full systems that ship.
Youโll learn how to:
โ Prototype full GenAI systems using AI Studio
โ Build real GenAI features: retrieval, agents, and tools
โ Run locally with Gemma or scale with Gemini
โ Start building today with reusable code templates
๐ Register here
โก Why Tool Calling Breaks AI Systemsโand What to Do Instead
Mon, June 2 with Alan Nichol (Co-founder and CTO, Rasa)
Tool calling is one of the worst defaults in AI system design today.
Alan has spent over a decade helping Fortune 500s build real conversational AI. In this lightning lesson, weโll break down why most tool use patterns fall apartโand how to fix them with Process Calling: a structured way to make agents reliable, inspectable, and explainable.
Youโll learn how to:
โ Design stateful, multi-turn business logic
โ Move beyond brittle prompt chaining
โ Build agents that handle branching, memory, and follow-up
โ Reduce flakiness and accelerate iteration with modular flows
๐ Register here
โก LLMs and Low-Hanging Fruit: Finding GenAI Value Fast
With Nathan Danielsen (Builder of Great Products and Engineering Teams, Carvana)
Nathan and I recently gave a lightning lesson on all the low-hanging fruit in your organization that is already ripe for GenAI use cases. In it, youโll learn
โ How to build GenAI apps with your existing internal data
โ A framework for finding GenAI wins in your org
โ How to go from idea to prototype fastโno new stack needed
You can check it out here.
Building AI Agents with Gemma 3๐ค
Livestream with Ravin Kumar: LLM Agents, Tool Calling, and Real-World Debugging
Ravin Kumar (DeepMind) and I ran a 2+ hour workshop on building AI agents locally with Gemma 3. Over 200 people joined liveโand we pushed things further by building an MCP server/client setup from scratch.
This wasnโt a polished demo. The code broke. The tools failed. But that made it realโand way more useful. We debugged in the open and showed what it actually looks like to build with open weights, live.
Some of what we covered:
โ Building a local LLM app and exploring Gemma models
โ Logging, observability, and debugging with real tools
โ Tool calling vs. agentsโand why the distinction matters
โ MCP architecture, client/server setup, and prompt iteration
๐งต Watch the full workshop + access the repo + join the Discord
Evaluating AI Agents: From Demos to Dependability
๐งช Upcoming Workshop โ June 15 with Ravin Kumar
Iโm very excited for our 3rd live, online, and free workshop with Ravin Kumar (Deepmind). Most AI agent demos look impressiveโuntil they break in practice. In this live, hands-on session, weโll focus on what it takes to make agents dependable.
Youโll learn how to:
โ Trace tool use and model reasoning
โ Simulate real interactions and edge cases
โ Define what success actually means
โ Catch silent failures and iterate effectively
Weโll build a lightweight agent that can:
โ Query a SQL database
โ Run Python-based data analysis
โ Generate basic visualizations
And youโll evaluate whether it:
โ Chose the right toolโ Executed the right logicโ Explained the result correctly
All running locally using Gemma 3 models and Ollamaโno frameworks, no cloud dependencies.ย
This is the third session in a series focused on building real systems:
1๏ธโฃ Local LLM apps + evaluation harnesses
2๏ธโฃ Agents with tool use + dynamic behavior
3๏ธโฃ Now: making those agents reliable and testable
๐ฅ Register for free for the June 15 livestream
๐ When Non-Determinism Is a Superpower
Most teams try to make LLMs behave like deterministic APIs. But what if unpredictability is the point?
In this clip from High Signal, Eoin OโMahony (ex-Uber) shares a moment that reshaped how I think about agent behavior:
If you gave it to 10 analysts and had them spend two weeks on each of it... do you expect them all to come up with the same answer?
We talk about why determinism isnโt always the goalโespecially in agentic systems doing exploratory analysis, decision support, or reasoning under uncertainty.
The full episode touches on:
โ Analyst agents and diverse perspectives
โ When randomness reveals real signal
โ Operationalizing ML in complex systems
โ Why early experiments need impact, not just p-values
โ How network effects can flip your metrics
๐ง Watch the clip above and listen to the full episode here.
Live Online Events
๐บ Why Data and AI Still Break at Scale (and What to Do About It)
Tues, May 27 with Akshay Agrawal (Founder of marimo; ex-Google Brain, Netflix, Stanford)
Why does so much AI work fall apart when it leaves the laptop? Weโll talk about reproducibility, researchโprod gaps, and the hidden cost of bad tooling.
๐ Register here
๐บ Building Production-Grade AI Systems at MasterClass
Tue, May 27 with Aman Gupta (MasterClass)
A behind-the-scenes look at how MasterClass built LLM systems for real productsโwithout relying on off-the-shelf APIs.Weโll cover infra, post-training, evaluation, and the realities of shipping AI in production.
๐ Register here
๐บ Building Reliable Agents with Open-Weight Models
Sun, June 15 with Ravin Kumar (DeepMind)
This live session will focus on debugging, evaluation, and making agents that actually workโbuilt entirely with open-weight models and local tools.
๐ Register here
In-Person Events๐
Build with AI โ Berlin Meetup
Monday, June 16 ยท 6:00โ9:00 PM GMT+2
๐ค Hosted by Explosion AI, Native Instruments, and Vanishing Gradients
๐ Food & drinks provided
Two short talks, lightning demos, and time to connect with the Berlin AI/dev community.
Talks:
๐งพ Conquering PDFs: Document Understanding Beyond Plain Text
Ines Montani โ spaCy / Explosion
From messy formats to structured data using spaCy, Docling, OCR, and layout models.
๐งช Evaluation-Driven Development & Synthetic Data Flywheels
Hugo Bowne-Anderson โ Vanishing Gradients
How to catch failures before users doโvia synthetic data, eval harnesses, and feedback loops.
๐ Register here.
๐ Agents & Evals โ Berlin Meetup
Tuesday, June 17 ยท 6:00โ9:00 PM GMT+2
๐ค Hosted by Vanishing Gradients and Rasa
๐ป Snacks and conversation included
Two short talks and an open-floor session on building LLM systems that actually work.
Talks:
๐ Why Tool Calling Breaks Your AI Agentsโand What to Do Instead
Alan Nichol โ Co-founder & CTO, Rasa
How Process Calling helps agents handle memory, branching, and control.
๐งช Escaping POC Purgatory with Evaluation-Driven Development
Hugo Bowne-Anderson โ Vanishing Gradients
Lessons from teams that escaped demo limbo and built testable, durable systems.
๐ Register here.
๐ VentureBeat Transform โ San Francisco, June 24โ25
Iโll be at VentureBeat Transform this yearโgiving a workshop on building AI agents and hosting a few panels. Details still to come, but if youโll be in SF, would love to see you there.
๐ SciPy 2025 โ Tutorial + TalkJuly 7โ11 ยท Tacoma, WA
Iโll be presenting a workshop and a talk at SciPy, 2025, and I hope to see you there!
๐ Tutorial: Building LLM-Powered Applications for Data Scientists and Software Engineers (July 7โ8)
๐งช Talk: Escaping Proof-of-Concept Purgatory: Building Robust LLM-Powered Applications (July 9โ11)
Iโll also be in London, Paris, Berlin, and a couple of other European cities over the next month so if youโd like to do a meetup, have me give a talk at your company, or just chat with your team about what youโre building, hit reply. Iโd love to hear what youโre working on.
Want to Support Vanishing Gradients?
If youโve been enjoying Vanishing Gradients and want to support my work, here are a few ways to do so:
๐งโ๐ซ Join (or share) my AI course โ Iโm excited to be teaching Building LLM Applications for Data Scientists and Software Engineers againโthis time with sessions scheduled for both Europe and the US. If you or your team are working with LLMs and want to get hands-on, Iโd love to have you. And if you know someone who might benefit, sharing it really helps.
๐ฃ Spread the word โ If you find this newsletter valuable, share it with a friend, colleague, or your team. More thoughtful readers = better conversations.
๐
Stay in the loop โ Subscribe to the Vanishing Gradients calendar on lu.ma to get notified about livestreams, workshops, and events.
โถ๏ธ Subscribe to the YouTube channel โ Get full episodes, livestreams, and AI deep dives. Subscribe here.
๐ก Work with me โ I help teams navigate AI, data, and ML strategy. If your company needs guidance, feel free to reach out by hitting reply.
Thanks for reading Vanishing Gradients!
If youโre enjoying it, consider sharing it, dropping a comment, or giving it a likeโit helps more people find it.
Until next time โ๏ธ
Hugo