Skip to content

Details

Important: Register on the event website is required for admission.(RSVP is turned off on meetup)

Description:
Join Google Cloud for the agentic AI bootcamp to learn how to build production-ready AI and multimodal agents.

The ‘Next’ – Capturing Innovation
The era of the text-only chatbot is evolving. Focus on the bleeding edge of AI: Multimodality. We’ll explore how to build intelligent agents that can see, hear, and respond to the world in real-time, creating immersive experiences that feel more human than ever before.

What to Expect:

  • - Multimodal Gemini Agents: Coordinate agents to analyze video and audio while maintaining character consistency across multi-turn image generation.
  • - Intelligence Beyond RAG: Move past simple retrieval with hybrid search, context engineering, and multi-agent pipelines.
  • - Real-Time Live Interaction: Build low-latency, interruptible agents that "see" and "hear" using the Gemini Live API and bidirectional streaming.

Who Should Attend?
This hands-on workshop is designed for software developers, data scientists, and AI practitioners who have some experience building applications or working with models, and are looking to productionize them. To get the most out of the labs, you should have foundational knowledge of a programming language like Python and be comfortable using the command-line interface. While expertise is not required, a basic understanding of Cloud computing concepts, web APIs, and containerization technology like Docker will be highly beneficial.

To participate, you must bring your own laptop and power cable. The activities are intended for laptops and cannot be completed on a tablet or phone.

Venue: Waterloo, Canada (register full the full address)

Related topics

Artificial Intelligence
Deep Learning
Machine Learning
Natural Language Processing
Data Science

You may also like