Key Highlights
- Alphabet is deploying enhanced Gemini AI capabilities across Docs, Sheets, Slides, and Drive for AI Ultra and Pro members.
- Docs users can leverage Gemini to generate content, replicate writing tones, and import formatting from existing files.
- Sheets functionality now includes building complete spreadsheets through conversational prompts with live Google Search integration.
- The tech giant introduced Gemini Embedding 2, a unified multimodal system supporting text, images, video, audio, and document formats.
- Shares of Alphabet (GOOGL) showed modest gains during Tuesday’s midday session.
Alphabet’s Google division is accelerating its integration of artificial intelligence throughout its suite of productivity applications. On Tuesday, the company revealed that Gemini AI capabilities are becoming available across Google Docs, Sheets, Slides, and Drive platforms — launching in beta exclusively for AI Ultra and Pro tier subscribers with immediate effect.
The deployment initially targets English-speaking users worldwide for Docs, Sheets, and Slides functionality. Drive’s enhanced capabilities remain limited to United States users currently, though the company has indicated plans for broader language support in coming months.
Within Docs, subscribers can articulate their requirements and watch as Gemini constructs documents by synthesizing information from personal files, email correspondence, and web resources. The AI assistant can replicate stylistic elements from other documents and transfer formatting structures from template files — particularly beneficial for scenarios like auto-populating travel schedule templates using reservation confirmations from your inbox.
Sheets receives comparable enhancements. Users can instruct Gemini to construct complete spreadsheets using natural language. Google provided an illustration: generating a relocation task list that extracts vendor information from emails and monitors pricing estimates within your messages.
The Fill with Gemini capability extends functionality further — it retrieves current information from Google Search to automatically populate table cells. Applications include university admission timelines or cost of attendance figures, inserted without manual research.
Presentation and Storage Tools Receive Enhancements
For Slides, Gemini can create individual presentation pages that align with your deck’s established visual identity and color palette. It draws information from documents, emails, and internet sources. Google acknowledged that complete presentation creation from a single instruction remains under development.
Drive’s Ask Gemini functionality introduces AI-powered overview summaries positioned above search outputs, compiled from relevant files complete with source attribution. Users can search across documents, email, scheduling tools, and web content through a consolidated interface.
New Multimodal Embedding Model Debuts
In a parallel announcement, Google launched Gemini Embedding 2, an advanced multimodal system that processes text, images, video clips, audio files, and documents within a single integrated embedding framework.
The technology accommodates up to 8,192 input tokens for textual content, processes as many as six images per query in PNG or JPEG specifications, and analyzes video content extending to 120 seconds in MP4 or MOV file types.
It additionally processes audio input natively — eliminating transcription requirements — and embeds PDF documents spanning up to six pages.
Google stated the model functions across more than 100 languages and serves applications including semantic search operations, sentiment evaluation, and Retrieval-Augmented Generation (RAG) implementations.
The organization claimed superior performance compared to competing models from Amazon and Voyage across text, image, and video processing benchmarks — though these assessments originate from Google’s internal evaluations.
Alphabet (GOOGL) stock registered slight upward movement during Tuesday’s midday trading hours.



