Google brings multimodal search to AI Mode with enhanced visual search

Google is busy upgrading its Search with new AI features to retain its audience. The tech giant has announced some improvements to AI Mode, as well as expanding access to more users.

Google's AI Mode, a feature designed to enhance search capabilities now supports visual search with the Gemini AI model. It is tailored for longer and more complex user queries, which are twice as long as regular queries. AI Mode can be used for complicated tasks like product comparisons, planning trips, analyzing how-tos, etc.

AI Mode's multimodal approach allows users to upload an image or take a photo, and ask a question about the picture. The AI provides a response that is rich, comprehensive, by using enhanced visual search capabilities in Lens with a custom version of Gemini. It is capable of understanding an image, the context of how objects are related, the scene, colors, shapes, arrangements, unique materials, etc.

Google Search AI Mode

Google says AI Mode uses a "query fan-out" technique, to issue multiple queries about the image's contents, to deliver detailed and extensive information that is contextually relevant. The user can then decide how to proceed with the options. This trend indicates a shift in user engagement, highlighting a demand for deeper and more nuanced search interactions. Google's announcement demonstrates how the AI Mode feature works.

AI Mode was originally available only to Google One AI Premium subscribers, but is now available for all users in the U.S. who opt-in to Labs. You can test AI Mode through the Google app for Android and iOS.

As Beta News notes, AI Mode could be crucial for Google to fend off the competition from ChatGPT, Copilot, Grok, and other services that seek a foothold in the search engine market.

Recent Articles

Intel's Latest Graphics Driver Enhances Assassin's Creed Shadows Experience
Intel's Latest Graphics Driver Enhances Assassin's Creed Shadows Experience
Intel has released a new graphics driver update, version 32.0.101.6647 WHQL, designed to optimize p...
Apple Watch to Feature Integrated Cameras and Advanced AI Capabilities
Apple Watch to Feature Integrated Cameras and Advanced AI Capabilities
Apple is reportedly developing future Apple Watch models equipped with integrated cameras to enable...
Google launches Gemini Live video and screen sharing features for Pixel 9 and Galaxy S25
Google launches Gemini Live video and screen sharing features for Pixel 9 and Galaxy S25
Google has announced an update to Gemini Live platform, by introducing Gemini Live Video and Screen...
Microsoft introduces
Microsoft introduces "Hey, Copilot" wake word for activating the AI on Windows 11
Microsoft has announced yet another new feature for its Copilot app for Windows. You can invoke it ...