Discussion about this post

User's avatar
James's avatar

i suspect that the vast difference between sora and veo2 is from video library used for pretraining. google's unfettered access to youtube probably has much to do with it. also given that openai now should have access to iphone cameras that their library should grow by leaps and bounds and significantly improve sora. this means that physics is wrapped up in pre- and post- training on video content. on a different note, now that people are desensitized to ai videos on social media, they will become more skeptical and enthralled with what they see and hopefully disengage. this should be very damaging to tictok and instragram.

Expand full comment
Jason Baldridge's avatar

Thanks for the thoughtful write up! I’m on the team that built Imagen 3 and Veo, and this new Veo 2 model is the most exciting new model I’ve had the opportunity to explore and evaluate since we built the Parti image generation model a couple years back.

An important component of our release of these models is that every image and video is tagged with SynthID so that they can be verified as AI generated.

https://deepmind.google/technologies/synthid/

We are also part of C2PA, a consortium that adds metadata to generated content.

https://c2pa.org/

These are part of a broader approach to how the benefits of these technologies can be brought to the world while mitigating some of the risks you mention.

Expand full comment
19 more comments...

No posts