AI | LLM Models

Gemini (Google)

Reference Page 13 views

The Gemini Family

Gemini is Google DeepMind's flagship multimodal AI model, designed to understand and generate text, images, audio, video, and code natively.

Model Tiers

Gemini Nano: A compact model designed to run on-device (smartphones, tablets). Powers features like summarization and smart reply on Pixel phones.

Gemini Pro: The mid-tier model for general-purpose tasks. Powers Google AI Studio and many Google product integrations.

Gemini Ultra: The most capable tier, designed for highly complex reasoning, coding, and multimodal tasks.

Native Multimodality

Unlike models that bolt on image understanding as an add-on, Gemini was trained from the ground up to process multiple types of data. This means it can seamlessly reason across text, images, charts, and code in a single conversation.

Integration Across Google

Gemini is deeply integrated into Google's ecosystem: Search (AI Overviews), Gmail (email drafting), Docs (writing assistance), Sheets (formula generation), and Android. This gives it unmatched distribution to billions of users.

AI Articles