비디오

5 practical Gemini API uses for developers

Google for Developers2025년 5월 22일

Explore practical uses of the Gemini API for developers, like how to build innovative applications with Gemini's features. We'll cover image understanding for tasks like object recognition and scene description; creating multimodal interactions that combine voice, text, and images for more natural user experiences; and how to automate complex workflows through function calling. We’ll also share how to use the long context window to enable complex reasoning, and multi-step problem solving. Resources: Gemini API → https://ai.google.dev/gemini-api Gemini API Cookbook on GitHub → https://goo.gle/cookbook GenList demo → https://goo.gle/genlist-demo Live API - Web Console → https://goo.gle/3SvUZji Gemini 2.0 - Multi-tool with the Multimodal Live API → https://goo.gle/gemini-maps-plots Gemini 2.0: Browser as a tool → https://goo.gle/gemini-browser-tool Speaker: Mark McDonald Check out all the keynote sessions from Google I/O 2025 → https://goo.gle/io25-keynote-sessions Check out the AI session track from Google I/O 2025 → https://goo.gle/io25-ai-yt Check out all of the sessions from Google I/O 2025→ https://goo.gle/io25-sessions-yt Subscribe to Google for Developers → https://goo.gle/developers Event: Google I/O 2025 Products Mentioned: AI/Machine Learning