Try our new, FREE Youtube Summarizer!

Video Magic: Google Lens Redefines Search

Google Lens Leaps into the Future: Search with Your Voice and Video!

Last updated:

Mackenzie Ferguson

Edited By

Mackenzie Ferguson

AI Tools Researcher & Implementation Consultant

Google Lens has introduced an exciting new feature that allows users to search with video and voice commands. Now you can explore the world around you by simply capturing a video and asking questions aloud. Harnessing the power of the Gemini AI model, responses are generated by analyzing video sequences, bringing a dynamic edge to search capabilities on Android and iOS.

Banner for Google Lens Leaps into the Future: Search with Your Voice and Video!

Google Lens is revolutionizing the way users interact with visual media by introducing a new feature that allows you to search using video input. This feature, which is being gradually released in Search Labs on Android and iOS, empowers users to ask questions aloud about the content captured in videos. It marks a significant expansion from the previous functionality which was limited to still images.

    The innovation leverages Google's Gemini AI model to process videos recorded through Google Lens, allowing users to receive answers to questions posed about visual content in real-time. For example, a visitor at an aquarium can record a video of fish and ask questions like “Why are they swimming together?” to receive contextual information and answers generated by AI. This new capability was first introduced by Google at its I/O conference in May.

      Software might be eating the world
      but AI is eating software.

      Join 50,000+ readers learning how to use AI in just 5 minutes daily.

      Completely free, unsubscribe at any time.

      This novel approach captures the video as a series of image frames, applying advanced computer vision techniques that have previously been used in Google Lens. As explained by Rajan Patel, Google’s vice president of engineering, the technology processes these frames sequentially to generate a response based on web-sourced information. This cutting-edge development demonstrates Google's continuous evolution in artificial intelligence and machine learning technologies.

        Currently, Google Lens still lacks the feature to identify sounds from video recordings, such as bird chirps. However, the company is exploring these possibilities, suggesting future enhancements could incorporate sound analysis into video-based searches. This indicates a potential future for even more comprehensive search capabilities, extending beyond visuals to auditory recognition.

          Expanding beyond video, Google Lens’s update also includes enhancements to its photo search feature by introducing voice query support. Users can now point their camera, capture an image, and vocalize their questions, broadening the user experience from solely text-based input to more dynamic, voice-driven interactions.

            This innovation holds significant implications for businesses and consumers alike. For businesses, it offers an opportunity to develop new interactive applications and services that hinge on real-time visual analysis, potentially transforming customer engagement and support models. For consumers, it elevates the convenience and immediacy of accessing information, catering to an increasingly on-the-go digital lifestyle where quick and easy information retrieval is paramount.

              The rollout of these new features further solidifies Google's position as a leader in AI-driven technologies. By integrating voice and video capabilities into Google Lens, the tech giant is not only enhancing user interaction but also reaffirming its commitment to advancing and democratizing AI technology for everyday use. As these features become widely available, they are set to significantly enhance the digital landscape, offering more intuitive and efficient ways for users to interact with the world around them.

                Software might be eating the world
                but AI is eating software.

                Join 50,000+ readers learning how to use AI in just 5 minutes daily.

                Completely free, unsubscribe at any time.