Start building with Gemini 2.5 Flash
Gemini 2.5 Flash is in preview, offering improved reasoning capabilities through a "thinking" process that developers can control for cost and latency tradeoffs. This updated version aims to provide a cost-effective solution for complex tasks, balancin...
Three MarTech solutions putting generative AI in marketing
Three generative AI-powered MarTech solutions from Google designed to help developers streamline marketing material creation, personalize campaigns, and enhance ad performance. Discover ViGenAiR for video ad creation, Adios for image asset management a...
Gemini 2.5 Flash and Pro, Live API, and Veo 2 in the Gemini API
Updates to the Gemini API, including the production readiness of Veo 2 for video generation, the preview of the Live API for real-time interactions, and the upcoming Gemini 2.5 Flash model, alongside the existing Gemini 2.5 Pro aim to enhance developer...
Announcing the Agent2Agent Protocol (A2A)
Agent2Agent (A2A) protocol is an open standard designed to enable AI agents from different vendors and frameworks to collaborate and exchange information across enterprise platforms aiming to foster a future of seamless AI agent interoperability and en...
Simplified Dataflow Connectors with Managed I/O
Google Cloud Dataflow's Managed I/O simplifies using Apache Beam I/O connectors by automatically updating connectors to the latest versions and providing a standardized API, optimizing connectors specifically for Dataflow, ensuring efficient performanc...
The Gemini API and the Internet of Things
The Gemini API and ESP32 microcontroller simplify custom voice commands for IoT devices, leveraging speech recognition for devices to understand and react to custom commands, bridging the gap between digital and physical worlds.
Unlocking bonus worlds with Gemini for the Google I/O puzzle
The Google I/O 2025 puzzle used the Gemini API to generate dynamic riddles for bonus worlds, enhancing player engagement and scalability. Here's what our developers learned on using the Gemini API effectively, including creativity, design, and implemen...
Experiment with Gemini 2.0 Flash native image generation
The experimental native image generation feature of Gemini 2.0 Flash – allowing for the combination of text and images, conversational image editing, and leveraging real-world knowledge for contextual visuals – is now available for developers to te...
Safer and Multimodal: Responsible AI with Gemma
ShieldGemma 2, built on Gemma 3, is a 4 billion parameter model that can be used as an input filter for vision language models or an output filter for image generation systems, and is designed to respond to a wide range of diverse and nuanced imagery.
Introducing Gemma 3: The Developer Guide
Gemma 3 is a new, advanced version of the Gemma open-model family featuring multimodality, longer context windows, and improved language capabilities, with various sizes and deployment options for developers to experiment.
Gemma 3 on mobile and web with Google AI Edge
Gemma 3 1B, a new small language model for mobile and web applications via Google AI Edge, is now available, with increased efficiency, improved performance, and offline availability.
State-of-the-art text embedding via the Gemini API
A new experimental Gemini Embedding text model, now available in the Gemini API, achieves top rankings on the Massive Text Embedding Benchmark (MTEB) leaderboard and offers expanded language support and high-dimensional embeddings.