Safer and Multimodal: Responsible AI with Gemma
ShieldGemma 2, built on Gemma 3, is a 4 billion parameter model that can be used as an input filter for vision language models or an output filter for image generation systems, and is designed to respond to a wide range of diverse and nuanced imagery.
Introducing Gemma 3: The Developer Guide
Gemma 3 is a new, advanced version of the Gemma open-model family featuring multimodality, longer context windows, and improved language capabilities, with various sizes and deployment options for developers to experiment.
Gemma 3 on mobile and web with Google AI Edge
Gemma 3 1B, a new small language model for mobile and web applications via Google AI Edge, is now available, with increased efficiency, improved performance, and offline availability.
State-of-the-art text embedding via the Gemini API
A new experimental Gemini Embedding text model, now available in the Gemini API, achieves top rankings on the Massive Text Embedding Benchmark (MTEB) leaderboard and offers expanded language support and high-dimensional embeddings.
Gemini 2.0 Deep Dive: Code Execution
This blog post introduces Gemini's code execution feature, which allows the AI model to generate and run Python code for tasks like solving equations, data analysis, and creating visualizations.
CalCam: Transforming Food Tracking with the Gemini API
CalCam, a calorie-tracking app, uses the Gemini API to analyze meal photos, providing users with fast and accurate nutritional information. Polyverse, CalCam's creator, highlights Gemini API's speed, accuracy, and structured JSON output are crucial for...