Skip to content

Google's Gemini App Unveils AI Image Editor with Real-World Reasoning

Merge images and transfer details effortlessly. Edit locally with text input. Google's new AI image editor is here.

In the given image i can see a photo of a girl,card with text,artificial butterfly and few objects.
In the given image i can see a photo of a girl,card with text,artificial butterfly and few objects.

Google's Gemini App Unveils AI Image Editor with Real-World Reasoning

Google's Gemini app has introduced a groundbreaking AI image editing model, Gemini 2.5 Flash Image Generation. This new feature allows users to radically alter images while keeping key elements recognizable, demonstrating real-world reasoning capabilities.

The model, developed by Google DeepMind, is now available within the Gemini app. Users can access it by selecting the 'Flash' language model at the top left. Gemini 2.5 Flash can grasp simple causal relationships and visually represent them, showing impressive real-world reasoning.

One standout feature is the ability to merge up to three images for complex compositions. Users can transfer color, texture, or design from one object to another while retaining its shape and details. This model builds on the native Gemini language model's image generation capabilities, behaving accurately in prompt implementation, similar to GPT-4 from ChatGPT.

Gemini 2.5 Flash also enables precise, locally limited edits via text input. Users can blur backgrounds, remove spots, add colors, or delete objects without manual selection tools. A key feature is character consistency, allowing users to represent a person, object, or animal consistently across different images.

Google has integrated the new image editing model, Gemini 2.5 Flash Image Generation, into the Gemini app. It's also available as a preview version via the Gemini API, Google AI Studio, and Vertex AI, with usage costs of $30 per million output tokens, approximately $0.039 per image. This model significantly enhances the image editing experience, offering users a powerful tool for creative expression and precise manipulation.

Read also:

Latest