How Ask Photos in Google Photos will work

Gemini-powered Ask Photos is coming to Google Photos this summer, and the company this week shared a bit more about how it works. 

The Google Research team says Ask Photos is a “powerful example of how Gemini models can act as agents via function calling and memory capabilities.” Sample queries Google has provided outside the on-stage announcement include:

  • “Show me the best photo from each national park I’ve visited.” 
  • “What themes have we had for Lena’s birthday parties?”

Your conversational query is “passed to an agent model that uses Gemini to determine the best retrieval augmented generation (RAG) tool for the task.”

Typically, the agent model begins by understanding the user’s intent and formulates a search through their photos using an updated vector-based retrieval system, which extends the already powerful metadata search built into Photos.

That system is better at understanding natural language concepts, like “a person smiling while riding a bike,” than keyword search. 

An answer model then looks at the photos and videos returned by search. “Gemini’s long context window and multimodal capabilities” are leveraged to find the most relevant information. Beyond the visual content and any text, dates, locations, and other metadata is used. 

Finally, the answer model crafts a helpful response grounded in the photos and videos it has studied.

What’s interesting is how you can correct Ask Photos and the app will remember that information for future conversations. In this regard, it’s more than a search feature and could be used like an assistant. You will be able to “view and manage remembered details at any time.”  

This experimental feature, which could be related to the rumored Project Ellman, is rolling out over the coming months and more capabilities are already being teased.

More on Google Photos:


Add 9to5Google to your Google News feed. 

FTC: We use income earning auto affiliate links. More.

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Related Posts
Motorola Moto G51 5G review thumbnail

Motorola Moto G51 5G review

Introduction and specs The Motorola Moto G51 5G is supposed to be a successor to the Moto G50 5G, but since the two handsets were released less than four months apart, one could argue that the newer iteration is just a hardware refresh. Nonetheless, the G51 has more capable hardware in some aspects, and since…
Read More
Google is upgrading its Cloud resources with AMD "Milan" EPYC processors thumbnail

Google is upgrading its Cloud resources with AMD “Milan” EPYC processors

Kada je prošle godine Google objavio da će svoje Virtuelne Mašine iz „N2D“ klase opremiti sa tada aktuelnom drugom generacijom AMD EPYC „Rome“ procesora, glavni zahtev IT giganta je bio izbalansirani odnos proračunske snage i memorijskog kapaciteta. Razlog je što N2D klasa pripada tzv. Virtuelnim Mašinama „opšte namene“. Reč je o računarskim resursima koji su namenjeni svima i koji…
Read More
Vintage Computing Festival Berlin 2021 on site and with live streams thumbnail

Vintage Computing Festival Berlin 2021 on site and with live streams

Das Vintage Computing Festival Berlin (VCFB) ist in diesem Jahr "hybrid": Im Pergamon-Palais der HU Berlin präsentieren 20 Aussteller unter dem Motto "Reboot" historische Rechner, Kunst und praktische Software für Retrobastler. In kostenlosen Vorträgen und Livestreams kann man über wiki.vcfb.de/2021 aus der Ferne den Vorträgen zuhören oder an Workshops auf dem BigBlueButton-System des VCFB e.V.…
Read More
Space: The Next Tech Industry Frontier? thumbnail

Space: The Next Tech Industry Frontier?

Many remarkable innovations, as varied as satellite communication, global positioning systems, digital photo sensors, and OpenStack cloud computing, have deep roots in space research. Now, in an era when even tech business leaders and tourists are journeying into space, enterprises of many different types and sizes are aiming to establish a presence in the final…
Read More
The Antitrust Case Against Facebook Draws Blood thumbnail

The Antitrust Case Against Facebook Draws Blood

On Tuesday, federal judge James E. Boasberg ruled that the Federal Trade Commission’s effort to break up Facebook could move forward. The case itself is far from decided. But by blessing the FTC’s theory that a monopoly can harm consumers even when its product is free, the judge has signaled that Facebook—and other tech platforms—are…
Read More
Index Of News
Total
0
Share