Google unveils Veo, a high-definition AI video generator that may rival Sora

liquid reality —

Google’s video-synthesis model creates minute-long 1080p videos from written prompts.

Still images taken from videos generated by Google Veo.

Enlarge / Still images taken from videos generated by Google Veo.

Google / Benj Edwards

On Tuesday at Google I/O 2024, Google announced Veo, a new AI video-synthesis model that can create HD videos from text, image, or video prompts, similar to OpenAI’s Sora. It can generate 1080p videos lasting over a minute and edit videos from written instructions, but it has not yet been released for broad use.

Veo reportedly includes the ability to edit existing videos using text commands, maintain visual consistency across frames, and generate video sequences lasting up to and beyond 60 seconds from a single prompt or a series of prompts that form a narrative. The company says it can generate detailed scenes and apply cinematic effects such as time-lapses, aerial shots, and various visual styles

Since the launch of DALL-E 2 in April 2022, we’ve seen a parade of new image synthesis and video synthesis models that aim to allow anyone who can type a written description to create a detailed image or video. While neither technology has been fully refined, both AI image and video generators have been steadily growing more capable.

In February, we covered a preview of OpenAI’s Sora video generator, which many at the time believed represented the best AI video synthesis the industry could offer. It impressed Tyler Perry enough that he put his film studio expansions on hold. However, so far, OpenAI has not provided general access to the tool—instead, it has limited its use to a select group of testers.

Now, Google’s Veo appears at first glance to be capable of video-generation feats similar to Sora. We have not tried it ourselves, so we can only go by the cherry-picked demonstration videos the company has provided on its website. That means anyone viewing them should take Google’s claims with a huge grain of salt, because the generation results may not be typical.

Veo’s example videos include a cowboy riding a horse, a fast-tracking shot down a suburban street, kebabs roasting on a grill, a time-lapse of a sunflower opening, and more. Conspicuously absent are any detailed depictions of humans, which have historically been tricky for AI image and video models to generate without obvious deformations.

Google says that Veo builds upon the company’s previous video-generation models, including Generative Query Network (GQN), DVD-GAN, Imagen-Video, Phenaki, WALT, VideoPoet, and Lumiere. To enhance quality and efficiency, Veo’s training data includes more detailed video captions, and it utilizes compressed “latent” video representations. To improve Veo’s video-generation quality, Google included more detailed captions for the videos used to train Veo, allowing the AI to interpret prompts more accurately.

Veo also seems notable in that it supports filmmaking commands: “When given both an input video and editing command, like adding kayaks to an aerial shot of a coastline, Veo can apply this command to the initial video and create a new, edited video,” the company says.

While the demos seem impressive at first glance (especially compared to Will Smith eating spaghetti), Google acknowledges AI video-generation is difficult. “Maintaining visual consistency can be a challenge for video generation models,” the company writes. “Characters, objects, or even entire scenes can flicker, jump, or morph unexpectedly between frames, disrupting the viewing experience.”

Google has tried to mitigate those drawbacks with “cutting-edge latent diffusion transformers,” which is basically meaningless marketing talk without specifics. But the company is confident enough in the model that it is working with actor Donald Glover and his studio, Gilga, to create an AI-generated demonstration film that will debut soon.

Initially, Veo will be accessible to select creators through VideoFX, a new experimental tool available on Google’s AI Test Kitchen website, labs.google. Creators can join a waitlist for VideoFX to potentially gain access to Veo’s features in the coming weeks. Google plans to integrate some of Veo’s capabilities into YouTube Shorts and other products in the future.

There’s no word yet about where Google got the training data for Veo (if we had to guess, YouTube was likely involved). But Google states that it is taking a “responsible” approach with Veo. According to the company, “Videos created by Veo are watermarked using SynthID, our cutting-edge tool for watermarking and identifying AI-generated content, and passed through safety filters and memorization checking processes that help mitigate privacy, copyright, and bias risks.”

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Related Posts
European Parliament approves initial proposal to ban some targeted ads thumbnail

European Parliament approves initial proposal to ban some targeted ads

On Thursday, the European Parliament voted to approve the initial draft of a bill that aims to curb Big Tech’s invasive advertising practices (via Bloomberg). The Parliament adopted the draft with 530 votes of approval, 78 against, and 80 absentations. The Digital Services Act, which was first introduced in 2020, will prevent platforms, like Google,…
Read More
Samsung Galaxy S22 Ultra: Δεν είναι απλά όμορφο, αλλά και τέρας αντοχής (βίντεο) thumbnail

Samsung Galaxy S22 Ultra: Δεν είναι απλά όμορφο, αλλά και τέρας αντοχής (βίντεο)

Οι πολυαναμενόμενες σειρές Galaxy S22 και Galaxy Tab S8 της Samsung κυκλοφόρησαν επιτέλους και οι χρήστες δεν φαίνεται να χορταίνουν τις συσκευές, κρίνοντας από τη συντριπτική ζήτηση που αντιμετωπίζει η εταιρεία, ειδικά για τα Samsung Galaxy S22 Ultra και Tab S8 Ultra. Δεν είναι μόνο οι χρήστες, ωστόσο, καθώς οι reviewers κάνουν ήδη τη δουλειά…
Read More

Samsung starts removing ads from its One UI Android Apps

As promised earlier in the year, Samsung is removing ads from its first-party mobile apps. As of today, you won’t see the company advertise things to you in Samsung Pay, Weather, Theme and Health. Reports of the change first started to filter out on Samsung’s Community Forum in South Korea, with 9to5Google and TizenHelp later spotting…
Read More
Singapore telcos to let subscribers block international calls in new anti-scam measure thumbnail

Singapore telcos to let subscribers block international calls in new anti-scam measure

d3sign/Getty ImagesMobile subscribers in Singapore can now instruct their carrier to block all incoming calls from international numbers, as part of the government's efforts to curb the growing volume of online scams targeting the local population. The option is available to customers of the country's four mobile operators: M1, StarHub, Singtel, and Simba, formerly called TPG
Read More
13 Netflix-Serien, die nach nur einer Staffel eingestellt wurden thumbnail

13 Netflix-Serien, die nach nur einer Staffel eingestellt wurden

Netflix-Serien wie “Squid Game”, “The Umbrella Academy”, oder “Stranger Things” sind zu echten Kulturphänomenen geworden. Schon wenige Tage nach ihrer Veröffentlichung forderten begeisterte Zuschauer eine Fortsetzung der spannenden Geschichten. Aber nicht immer gelingt es Eigenproduktionen des Streamers, die Abonnenten von sich zu überzeugen - viele Serien werden bereits nach einer Staffel eingestellt. (Lesen Sie auch:…
Read More
Index Of News
Total
0
Share