https://www.tiktok.com/@dreamrelicc
Before AI, each video on this channel would have taken a large team with a Hollywood budget to create. In a few more years, one person may be able to turn their creative vision into a full-length movie.
Whether said fun is "worth" the social and economic costs is a separate issue.
Trust in media? Soaring! Why believe your eyes or ears when you can doubt everything equally?
Journalism? Thriving! Reporters now get to spend their days playing forensic video detective instead of, you know, reporting news.
Social harmony? Better than ever! Nothing brings people together like shared paranoia and the collective shrug of “I guess truth is dead now.”
Honestly, what could possibly go wrong?
Obviously this will get used for a lot of evil or bad as well
What are the benefits of what you do? Does anyone know?
- Scamming people at scale
- Nonconsensual pornography
- Juicing engagement metrics for fading social media sites
- The ongoing destruction of truth as a concept in our increasingly atomized and divided world
It seems like OpenAI is trying to turn Sora into a social network - TikTok but AI.
The webapp is heavily geared towards consumption, with a feed as the entry point, liking and commenting for posts, and user profiles having a prominent role.
The creation aspect seems about as important as on Instagram, TikTok etc - easily available, but not the primary focus.
Generated videos are very short, with minimal controls. The only selectable option is picking between landscape and portrait mode.
There is no mention or attempt to move towards long form videos, storylines, advanced editing/controls/etc, like others in this space (eg Google Flow).
Seems like they want to turn this into AITok.
Edit: regarding accurate physics ... check out these two videos below...
To be fair, Veo fails miserably with those prompts also.
https://sora.chatgpt.com/p/s_68dc32c7ddb081919e0f38d8e006163...
https://sora.chatgpt.com/p/s_68dc3339c26881918e45f61d9312e95...
Veo:
https://veo-balldrop.wasmer.app/ballroll.mp4
https://veo-balldrop.wasmer.app/balldrop.mp4
Couldn't help but mock them a little, here is a bit of fun... the prompt adherence is pretty good, at least.
NOTE: there are plenty of quite impressive videos being posted, and a lot of horrible ones also.
OpenAI did not stealthily release Sora 2 to the image and video ELO ranking leaderboards ahead of time as is now somewhat tradition.
This model is probably designed to run fast and cheap as a social play. Emphasis on putting you and your friends into popular franchises and IPs.
OpenAI probably has a totally different model for their Hollywood-grade VFX. One that's too expensive to offer $20/mo consumers.
- - - - -
EDIT:
Oh my god, OpenAI literally just disrupted TikTok:
https://x.com/GabrielPeterss4/status/1973071380842229781
https://x.com/GabrielPeterss4/status/1973122324984693113
https://x.com/GabrielPeterss4/status/1973121891926942103
https://x.com/GabrielPeterss4/status/1973120058907041902 (potentially dangerous ... )
https://x.com/GabrielPeterss4/status/1973111654524264763
https://x.com/GabrielPeterss4/status/1973090475486879818
https://x.com/GabrielPeterss4/status/1973110596825653720 (is this the same model? It doesn't look like it.)
https://x.com/GabrielPeterss4/status/1973096194508251321
https://x.com/GabrielPeterss4/status/1973086729281347650
https://x.com/GabrielPeterss4/status/1973088038851932522 (this is truly something only kids will love)
https://x.com/GabrielPeterss4/status/1973087595967201449
https://x.com/GabrielPeterss4/status/1973077105903620504
Holy shit!
This is 100% the future of what kids will do. This is incredible for short form vertical video.
It doesn't need to look good, it just needs to let you tell incredible stories with people and things you care about.
This is way better than Meta's social video app.
But clearly we also see some major downsides. We already have an epidemic of social media rotting people's minds, and everything about this capability is set to supercharge these trends. OpenAI addresses some of these concerns, but there's absolutely no reason to think that OpenAI will do anything other than what they perceive as whatever makes them the most money.
An analogy would be a company coming up with a way to synthesize and distribute infinite high-fructose corn syrup. There are positive aspects to cheaply making sweet tasting food, but we can also expect some very adverse effects on nutritional health. Sora looks like the equivalent for the mind.
There's an optimistic take on this fantastic new technology making the world a better place for all of us in the long run, after society and culture have adapted to it. It's going to be a bumpy ride before we get there.
@qoez
> The first entirely AI generated film (with Sora or other AI video tools) to win an Oscar will be less than 5 years away.
https://news.ycombinator.com/item?id=42368951
This prediction of mine was only 10 months ago.
Imagine when we and if we get to 5 years.
The worst part is we are already seeing bad actors saying 'I didn't say that' or 'I didn't do that, it was a deep fake'. Now you will be able to say anything in real life and use AI for plausible deniability.
Today's Sora can produce something that resembles reality from a distance, but if you look closely, especially if there's another perspective or the scene is atypical, the flaws are obvious.
Perhaps tomorrow's Sora will overcome the the "final 10%" and maintain undetectable consistency of objects in 2 perspectives. But that would require a spatial awareness and consistency that models still have a lot of trouble with.
It's possible to produce some video or image that looks real, cherry-picked for a demo, but not possible to produce any arbitrary one you want that will end up passable.
I'm optimistic here.
Look at 1900s tech like social security number/card, and paper birth certificates. Our world is changing and new systems of verification will be needed.
I see this as either terribly dystopian - or - a possibility for the mass expansion of cryptography and encrypted/signed communication. Ideally in privacy preserving ways because nothing else will make as much sense when it comes to the verification that countries will need to give each other even if they want backdoor registry BS for the common man.
Breaking changes get fixes.
I doubt it will be for the better. The ubiquity of AI deepfakes just reenforces entrenchment around "If the message reinforces my preconceived notion, I believe it and think anyone who calls it fake is stupid/my enemy/pushing an agenda. If the message contradicts my preconceived notion, it's obviously fake and anyone who believes it is stupid/my enemy/pushing an agenda.". People don't even take the time to think "is this even plausible", much less do the intellectual work to verify.
If they got the generation "live" enough, imagine walking past a mirror in a department store and seeing yourself in different clothes.
Wild times.
But Sora /VEO will probably also revolutionize movies and tv content
I love this AI video technology.
Here are some of the films my friends and I have been making with AI. These are not "prompted", but instead use a lot of hand animation, rotoscoping, and human voice acting in addition to AI assistance:
https://www.youtube.com/watch?v=H4NFXGMuwpY
https://www.youtube.com/watch?v=tAAiiKteM-U
https://www.youtube.com/watch?v=7x7IZkHiGD8
https://www.youtube.com/watch?v=Tii9uF0nAx4
Here are films from other industry folks. One of them writes for a TV show you probably watch:
https://www.youtube.com/watch?v=FAQWRBCt_5E
https://www.youtube.com/watch?v=t_SgA6ymPuc
https://www.youtube.com/watch?v=OCZC6XmEmK0
I see several incredibly good things happening with this tech:
- More people being able to visually articulate themselves, including "lay" people who typically do not use editing software.
- Creative talent at the bottom rungs being able to reach high with their ambition and pitch grand ideas. With enough effort, they don't even need studio capital anymore. (Think about the tens of thousands of students that go to film school that never get to direct their dream film. That was a lot of us!)
- Smaller studios can start to compete with big studios. A ten person studio in France can now make a well-crafted animation that has more heart and soul than recent by-the-formula Pixar films. It's going to start looking like indie games. Silksong and Undertale and Stardew Valley, but for movies, shows, and shorts. Makoto Shinkai did this once by himself with "Voices of a Distant Star", but it hasn't been oft repeated. Now that is becoming possible.
You can't just "prompt" this stuff. It takes work. (Each of the shorts above took days of effort - something you probably wouldn't know unless you're in the trenches trying to use the tech!)
For people that know how to do a little VFX and editing, and that know the basic rules of storytelling, these tools are remarkable assets that compliment an existing skill set. But every shot, every location, every scene is still work. And you have to weave that all into a compelling story with good hooks and visuals. It's multi-layered and complex. Not unlike code.
And another code analogy: think of these models like Claude Code for the creative. An exoskeleton, but not the core driving engineer or vision that draws it all together. You can't prompt a code base, and similarly, you can't prompt a movie. At least not anytime soon.
"The studios and creators who thrive in this new landscape will be those who can effectively harness AI’s capabilities while maintaining the human creativity and vision that ultimately drives the art of cinema."
It is in many ways thrilling to see this come to life, and I couldn't agree with you more.
1/ 0m23s: The moon polo players begin with the red coat rider putting on a pair of gloves, but they are not wearing gloves in the left-vs-right charge-down.
2/ 1m05s: The dragon flies up the coast with the cliffs on one side, but then the close-up has the direction of flight reversed. Also, the person speaking seemingly has their back to the direction of flight. (And a stripy instead of plain shirt and a harness that wasn’t visible before.)
3/ 1m45s: The ducks aren't taking the right hand corner into the straightaway. They are heading into the wall.
I do wonder what the workflow will be for fixing any more challenging continuity errors.
State of the things with doom scrolling was already bad, add to it layoffs and replacing people with AI (just admit it, interns are struggling competing with Claude Code, Cursor and Codex)
What's coming next? Bunch of people, with lots of free time watching non-sense AI generated content?
I am genuinely curious, because I was and still excited about AI, until I saw how doom scrolling is getting worse
They are not. This is false, zirp ended, this is the problem. Not LLMs.
Wasn't this always the outcome of the post labor economy?
For this discussion lets just say that AI+Robots could replace most human labor and thinking. What do people do? Entertainment is going to be the number one time consumer.
For example, I saw a lot of people criticizing "Wish" (2023, Disney) for being a good movie in the first half, and totally dropping the ball in the last half. I haven't seen it yet, but I'm wondering if fans will be able to evolve the source material in the future to get the best possible version of it.
Maybe we will even get a good closure for Lost (2004)!
(I'm ignoring copyright aspects, of course, because those are too boring :D)
100% sure we will see people re-doing movie parts. Also see https://en.wikipedia.org/wiki/The_Phantom_Edit
Much more mundane, but useful!
You must understand that infinite copyright is the author's right, and AI companies must be sued for 50 trillion dollars.
About 6 months ago I asked a few different AIs if they could translate a song for me as a learning experience, meaning not a simple translation, but more a word by word explanation of what each word meant, how it was conjugated, any more musical/lyrical only uses that aren't common outside of songs, and so on. I was consistently refused on copyright grounds, despite this seeming a fair use given the educational nature. If I pasted a line of the lyrics at a time, it would work initially, but eventually I would need to start a new chat because the AI determined I translated too much at once.
So in this one, if I wanted to ask it to create a video of the moment in Final Fantasy 6 when the bad guy wins, or a video of the main characters of Final Fantasy 7 and 8 having a sword duel, would it outright refuse for copyright reasons?
It sounds like it would block me, which makes me lose a bit of interest in the technology. I could try to get around it, but at what point might that lead to my account being flagged as a trouble maker trying to bypass 'safety' features. I'm hoping in a few years the copyright fights on AI dies down and we get more fair use allowance instead of the tighter limitations to try to prevent calls for tighter regulation.
I watch videos for two reasons. To see real things, or to consume interesting stories. These videos are not real, and the storytelling is still very limited.
So, for the same reason you'd go to a local art gallery
> new AI feature/model comes out
> "it's going to replace people in this field! they better start looking for a new job!!!"
why is this a good thing?
Pretty much the same problem we all work on every day in $DAY_JOB.
I feel like this is the ultimate extension of "it feels like my feed is just the artificial version of what's happening my friends and doesn't really tell me anything about how they're actually faring."
I have to imagine there will be a rebellion against all of this at some point, when people simply can’t take the false realities anymore. What is the alternative? Ready Player One? The Matrix? Wall-E?
The technology itself is super impressive, but a social media app of AI slop doesn't feel like the best use of it. I'm old enough to not really be interested in social media in general anymore, so maybe I'm just out of touch, but I just can't see this catching on. It feels like the type of thing that people will download, use a few times until the novelty wears off and then never open again.
I am bullish on this, albeit with major concerns in many domains. It was fun and addictive as hell with images. With video it will be wild.
I don't expect Sora2 to be SOTA. The Chinese models are further ahead in video/image gen
AI could be helpful here, but it's not clear that it is required or an improvement.
The music industry already went through this with AutoTune and we know how that turned out.
I agree. At best, short videos can be entertainment that destroys your attention span. Anything more is impossible. Even if there were no bad actors producing the content, you can't condense valuable information into this format.
If you never expected Altman to be the figurehead of principled philosophy, none of this should surprise you. Of course the startup alumni guy is going to project maligned expectations in the hopes of being a multi-trillion dollar company. The shareholders love that shit, Altman is applying the same lessons he learned at Worldcoin to a more successful business.
There was never any question why Altman was removed, in my mind. OpenAI outgrew it's need for grifters, but the grifter hadn't yet outgrown his need for OpenAI.
I can't find the link now, but I saw a continuous shot video of a grocery store from the perspective of a fly. It was shot in the 90s music video style and looked so damn good.
Some of the stuff being done by these guys is also a whole lot of fun (slightly NSFW and political content), and it fits the music video theme:
Personally, I feel mixed feelings. I'm impressed, but I'm not looking forward to the new "movies" that are going to litter YouTube et al generated from this.
The point is that sora2 demo videos seemed impressive but I just didn't feel any real excitement. I am not sure who this is really helping.
Sora 2 itself as a video model doesn't seem better than Veo 3/Kling 2.5/Wan 2.2, and the primary touted feature of having a consistent character can be sufficiently emulated in those models with an input image.
These LLMs might make content that looks initially impressive but they are absolutely not performing physically based rendering or have any awareness of the lighting arrangement in these scenes. There are a lot of things they get right, but you only have to screw up one small element to throw the whole thing off.
I am willing to bet that Unreal Engine 5 will continue to produce more realistic human faces than OAI ever can with these types of models. You cannot beat the effects of actually running raytracing in a PBR pipeline.
CGI for fantasy stuff is unavoidable, but when it's stuff that could have been done by actors but is instead AI, then to me it just feels cheap and nasty - fake.
How much are they (and providers of similar tools) going to be able to keep anyone from putting anyone else in a video, shown doing and saying whatever the tool user wants?
Will some only protect politicians and celebrities? Will the less-famous/less-powerful of us be harassed, defamed, exploited, scammed, etc.?
"Consent-based likeness. Our goal is to place you in control of your likeness end-to-end with Sora. We have guardrails intended to ensure that your audio and image likeness are used with your consent, via cameos. Only you decide who can use your cameo, and you can revoke access at any time. We also take measures to block depictions of public figures (except those using the cameos feature, of course). Videos that include your cameo—including drafts created by other users—are always visible to you. This lets you easily review and delete (and, if needed, report) any videos featuring your cameo. We also apply extra safety guardrails to any video with a cameo, and you can even set preferences for how your cameo behaves—for example, requesting that it always wears a fedora."
I love the casual reminds that we're second-class citizens each time a new technology gets released. Available in the US but always excluding Puerto Rico.
I imagine it won’t necessarily be used in long scenes with subtle body language, etc involved. But maybe it’ll be used in other types of scenes?
Like you have an exterior shot of a cabin, the surrounding environment, etc — all generated. Then you jump inside which can be shot on a traditional set in a studio.
Getting that establishing shot in real life might cost $30K to find a location, get the crew there, etc. Huge boon to indie films on a budget, but being able to endlessly tweak the shot is valuable even for productions that could afford to do it IRL.
Sora 2 represents significant progress towards [AGI]. In keeping with OpenAI’s mission, it is important that humanity benefits from these models as they are developed.
This seems like a good time to remind ourselves of the original OpenAI charter: https://web.archive.org/web/20230714043611/https://openai.co...I wonder how exactly they reconcile the quote above with "We are concerned about late-stage AGI development becoming a competitive race without time for adequate safety precautions"...
However, personalization (teleporting yourself into a video scene) is boring to me. At its core, it doesn't generate new experience to me. My experience is not defined by photos / videos I took on a trip.
however as they hint at a little in the announcement, if video generation becomes good enough at simulating physics and environments realistically, that's very interesting for robotics.
> I don't have the privilege to think everything ain't political
I think feeling like you need to use that in marketing copy is a pretty good clue in itself both that its not, and that you don’t believe it is so much as desperately wish it would be.
Sora 2 itself looks and sounds a little poorer than Google Veo 3. (Which is itself not currently ranked as the top video model. The Chinese models are dominating.)
I think Google, with their massive YouTube data set, is ultimately going to win this game. They have all the data and infrastructure in the world to build best-in-class video models, and they're just getting started.
The social battle will be something completely different, though. And that's something that I think OpenAI stands a good chance at winning.
Edit: Most companies that are confident of their image or video models stealthily launch it on the Model Arena a week ahead of the public model release. OpenAI did not arrange to do that for Sora 2.
Nano Banana, Seedream/Seedance, Kling, and several other models have followed this pattern of "stealth ELO ranking, then reveal pole position".
https://artificialanalysis.ai/text-to-video/arena?tab=leader...
The fact that this model is about "friends" and "social" implies that this is an underpowered model. You probably saw a cherry picked highlight reel with a large VRAM context, but the actual consumer product will be engineered for efficiency. Built to sustain a high volume of cheap generations, not expensive high quality ones. A product built to face off against Meta. That model compete on the basis of putting you into videos with Pikachu, Mario, and Goku.
I expect the "cameo" feature is an attempt at capturing that viral magic a second time.
However, I still don't see how OpenAI beats Google in video generation. As this was likely a data innovation, Google can replicate and improve this with their ownership of YouTube. I'd be surprised if they didn't already have something like this internally.
This is something I would not like to see, I prefer product videos to be real, I am taking a risk with my money. If the product has hallucinated or unrealistic depiction it would be a kind of fraud.
If anything, it looks a lot worse than a lot of AI-generated videos I've seen in the past, despite being a tech demo with carefully curated shots. Veo 3 just blows this out of the water for example.
The recent Google Veo 3 paper "Video models are zero-shot learners and reasoners" made a fascinating argument for video generation models as multi-purpose computer vision tools in the same way that LLMs are multi-purpose NLP tools. https://video-zero-shot.github.io/
It includes a bunch of interesting prompting examples in the appendix, it would be interesting to see how those work against Sora 2.
I wrote some notes on that paper here: https://simonwillison.net/2025/Sep/27/video-models-are-zero-...
The worst part is we are already seeing bad actors saying 'I didn't say that' or 'I didn't do that, it was a deep fake'. Now you will be able to say anything in real life and use AI for plausible deniability.
I predict a re-resurgence in life performances. Live music and live theater. People are going to get tired of video content when everything is fake.
It's not that I disagree with the criticism; it's rather that when you live on the moving edge it's easy to lose track of the fact that things like this are miraculous and I know not a single person who thought we would get results "even" like this, this quickly.
This is a forum frequented by people making a living on the edge—get it. But still, remember to enjoy a little that you are living in a time of miracles. I hope we have leave to enjoy that.
> A lot of problems with other apps stem from the monetization model incentivizing decisions that are at odds with user wellbeing. Transparently, our only current plan is to eventually give users the option to pay some amount to generate an extra video if there’s too much demand relative to available compute. As the app evolves, we will openly communicate any changes in our approach here, while continuing to keep user wellbeing as our main goal.
Like, it should be preferable to keep all the slop in the same trough. But it's like they can't come up with even one legitimate use case, and so the best product they can build around the technology is to try to create an addictive loop of consuming nothing but auto-generated "empty-calories" content.
Basically proper working persistence of the scene.
For example, I'm working with a walking and talking character at this time using multiple AI video models and systems. Generated clips any length longer than 8 seconds risk rapid quality loss, but sometimes you can get up to 12-19 seconds without the generation breaking down. That means one needs to simulate a multiple camera shoot on a stage, so you can cut around the character(s) and create a longer sequence. But now you need to have multiple views of the same location to place your character(s) into - and current AI models can't reliably give you a "different angled views" of an environment. We just got consistent different views of characters, and it'll be another period until environments can be generally examined from any view. BUT, that's if people realize this is not in the models yet, and so far people are so fascinated by the fantasy violence and sexual content they can make nobody realizes you cannot simply "look left and right" in any of these models and that even works with consistency or reliability. There are workarounds, like creating one's entire set and environments in 3D models, for use as the backgrounds and starting frames, but that's now 3D media production + AI, and none of the AI tools generate media that even has alpha channels, and a lot of similar incompatibilities like that.
Even moreso than Facebook tags, the person being cast can cause the deletion of the source video at any time.
There's still something off about the movements, faces and eyes. Gollum features.
I kid.
Art should require effort. And by that I mean effort on the part of the artist. Not environmental damage. I am SO tired of non tech friends SWOONING me with some song they made in 0.3 seconds. I tell them, sarcastically, that I am indeed very impressed with their endeavors.
I know many people will disagree with me here, but I would be heart broken if it turned out someone like Nick Cave was AI generated.
And of course this goes into a philosophical debate. What does it matter if it was generated by AI?
And that's where we are heading. But for me I feel effort is required, where we are going means close to 0 effort required. Someone here said that just raises the bar for good movies. I say that mostly means we will get 1 billion movies. Most are "free" to produce and displaces the 0.0001% human made/good stuff. I dunno. Whoever had the PR machine on point got the blockbuster. Not weird, since the studio tried 300 000 000 of them at the same time.
Who the fuck wants that?
I feel like that ship in Wall-E. Let's invest in slurpies.
Anyway; AI is here and all of that, we are all embracing it. Will be interesting to see how all this ends once the fallout lands.
Sorry for a comment that feels all over the place; on the tram :)
I know, I know. Most people don't care. How exciting.
Anyone, literally anyone, can use it (eventually) to generate incredible scenes. Imagine the person who comes up with a short film about an epic battle between griffins and aliens...Or a simple story of a boy walking in the woods with their dog...Or a story of a first kiss. Previously people were limited to what they had at hand. They couldn't produce a video because it was too costly. Now they can craft a video to meet their vision.
I do find it exciting.
So much visual power, yet so little soul power. We are dying.
>Every AI video demonstration is always about funny stuff and fancy situations.
The thing about AI slop is that by its very nature, unless it's heavily reined in by a human, it's invariably lowest common denominator garbage. It very likely will generate something you yourself could think of within the first five seconds of hearing the prompt, not some very clever take on it, so it can only work as a placeholder (AI as a replacement of stock images is great, for example) or to add background detail where it won't call attention to itself and its genericity.
>imagine building a video about the moment Jesus was born
Given there are multiple paintings on the subject, I very much doubt no one has generated something like that already.
Edit: looks like this post was actually first, so maybe we'll reverse the merge
If I start a new chat it works.
I'm a Plus subscriber and didn't hit rate limits.
This video gen tool will probably be even more useless.
My boss sends me complete AI Workslop made with these tools and he goes "Look how wild this is! This is the future" or sends me a youtube video with less than a thousand views of a guy who created UGC with Telegram and point and click tools.
I don't ever think he ever takes a beat, looks at the end product, and asks himself, "who is this for? Who even wants this?", and that's aside from the fact that I still think there are so many obvious tells with this content that make you know right away that it is AI.
Tangentially related: it's wild to me that people heading such consequential projects have so little life experience. It's all exuberance and shiny things, zero consideration of the impacts and consequences. First Meta with "Vibes", now this.
1: https://www.gurufocus.com/news/3124829/openai-plans-to-launc...
What am I looking at that's super technically impressive here? The clips look nice, but from one cut to the next there's a lot of obvious differences (usually in the background, sometimes in the foreground).
I think OpenAI is actually doing a great job at easing people into these new technologies. It's not such a huge leap in capabilities that it's shocking, and it helps people acclimate for what's coming. This version is still limited but you can tell that in another generation or two it's going to break through some major capabilities threshold.
To give a comparison: in the LLM model space, the big capabilities threshold event for me came with the release of Gemini 2.5 Pro. The models before that were good in various ways, but that was the first model that felt truly magical.
From a creative perspective, it would be ideal if you could first generate a fixed set of assets, locations, and objects, which are then combined and used to bring multiple scenes to life while providing stronger continuity guarantees.
I saw some promnise with the Segment Anything model but I haven't seen anyone yet turn it into a motion solver. In fact I'm not sure if can do that at all. It may be that we need to use an AI algorithm to translate the video into a more simple rendition (colored dots representing the original motion) that can then be tracked more traditionally.
> How the FUCK does Sora 2 have such a perfect memory of this Cyberpunk side mission that it knows the map location, biome/terrain, vehicle design, voices, and even the name of the gang you're fighting for, all without being prompted for any of those specifics??
> Sora basically got two details wrong, which is that the Basilisk tank doesn't have wheels (it hovers) and Panam is inside the tank rather than on the turret. I suppose there's a fair amount of video tutorials for this mission scattered around the internet, but still––it's a SIDE mission!
Everyone already assumed that Sora was trained on YouTube, but "generate gameplay of Cyberpunk 2077 with the Basilisk Tank and Panam" would have generated incoherent slop in most other image/video models, not verbatim gameplay footage that is consistent.
For reference, this is what you get when you give the same prompt to Veo 3 Fast (trained by the company that owns YouTube): https://x.com/minimaxir/status/1973192357559542169
Here's to hoping that the industry will adapt to have it aid animators for in-betweening and other things that supplement production. Anime studios are infamously terrible with overworking their employees, so I legitimately see benefits coming from this tool if devs can get it to function as proper frame interpolation (where animators do the keyframes themselves and the model in-betweens).
> We are giving users the tools and optionality to be in control of what they see on the feed. Using OpenAI's existing large language models, we have developed a new class of recommender algorithms that can be instructed through natural language. We also have built-in mechanisms to periodically poll users on their wellbeing and proactively give them the option to adjust their feed.
So, nothing? I can see this being generated and then reposted to TikTok, Meta, etc for likes and engagement.
Ever since the launch of Veo, there's already so much AI slop videos on YouTube that it becomes hard to find real videos sometimes.
I'm tired, boss.
Biggest problem OpenAI has is not having an immense data backbone like Meta/Google/MSFT has. I think this is step in that direction -- create a data moat which in turn will help them make better models.
Can it do Will Smith eating spaghetti? (I can't get access in UK)
Also I find it neat that they still include an iOSMath bundle (in chatGPT too), makes me wonder how good their models really are at math.
Let me guess, the ultimate market will be teenagers "creating" a Skibidi Toilet and cheap TikTok propaganda videos which promote Gazan ocean front properties.
There would for sure be large swathes of people who would just lie about what they're doing and use AI to make it seem like they're skateboarding, or skiing or whatever at a pro or semi-pro level and have a lot of people watch it.
Impressive that THAT was one of the issues to find, given where we were at the start of the year.
click
takes me to the iPhone app store...
edit: as per usual it's not yet...
Their ultimate goal is physical AGI, although it wouldn’t hurt them if the social network takes off as well.
Sam looks weirdly like Cillian Murphy in Oppenheimer in some shots. I wonder whether there was dataset bleedover from that.
One use case I'm really excited about is simply making animated sprites and rotational transformations of artwork using these videogen models, but unlike with local open models, they never seem to expose things like depth estimation output heads, aspect ratio alteration, or other things that would actually make these useful tools beyond shortform content generation.
Multiple sci-fi-fantasy tales have been written about technology getting so out of control, either through its own doing or by abuse by a malevolent controller, that society must sever itself from that technology very intentionally and permanently.
I think the idea of AGI and transhumanism is that moment for society. I think it's hard to put the genie back in the bottle because multiple adversarial powers are racing to be more powerful than the rest, but maybe the best thing for society would be if every tensor chip disintegrated the moment they came into existence.
I don't see how society is better when everyone can run their own gooner simulation and share it with videos made of their high school classmates. Or how we'll benefit from being unable to trust any photo or video we see without trusting who sends it to you, and even then doubting its veracity. Not being able to hear your spouse's voice on the phone without checking the post-quantum digital signature of their transmission for authenticity.
Society is heading to a less stable, less certain moment than any point in its history, and it is happening within our lifetime.
Going back to sleep. Wake me up when it's available to me.
Brave new internet, where humans are not needed for any "social" media anymore, AI will generate slop for bots without any human interaction in an endless cycle.
I guess copyright is pretty much dead now that the economy relies on violating it. Too bad those of us not invested into AI still won't be able to freely trade data as we please....
it doesn't spark optimism or joy about the future of engaging with the internet & content which was already at a low point.
old is gold, even more so
Absolutely cooked.
After the disaster that was chatGPT4.001, study mode and now this: an impossibly expensive to maintain AI video slop copyright violater, their releases are uninspired and bland, and smelling of desperation.
Making me giddy for their imminent collapse.