Step 2: Complain about how the OSS/Chinese/whatever models are doing releases without approval
Step 3: Prohibit, because "safety" and "financial risks"(?)
So this is the door-shutting Altman et al have been pushing for eh?
https://deepmind.google/models/gemma/gemma-4/
https://developer.nvidia.com/ai-models#:~:text=NVIDIA%20Nemo...
https://www.microsoft.com/en-us/research/blog/phi-4-reasonin...
I'm 99% sure it was one-and-done, box ticked, and now they can be mentioned in comments like this.
For AI, the most profitable part of the value chain is selling inference. None of the big American companies want to release a leading edge model as open source because this would drive the price of inference to $0. Meanwhile, open source AI models are a huge strategic initiative for China. Having commodity Chinese models that are as good as the leading edge American models from 6 months ago forces the American companies to keep paying more and more money to train better and better models since the amount of time they can collect rent on a model they've previously trained is limited to 6 months.
"The world doesn't go round. It flips over!"
scraping CoT won't stop the advance of Chinese models. neither will a US "ban" on using such models. at this point I'm cheering for DeepSeek or Qwen to catch up to Anthropic. I support anyone who releases open weights.
He who controls the porn controls the universe. - Baron Amodei
But it's harder thanks to US actions in the last few years, and especially in countries which can bite back.
How specifically does that review work? I want to give federal agency Opus 4.8 now, while 4.7 has been out for a while (leaving Mythos aside for now). They have 30 days to figure out whether it poses a threat.
How do you do that? Is there an eval for this and if there is why can't they just make it public? What is the agencies objective (but proprietary?) analysis here?
> develop and maintain a classified benchmarking process to assess the advanced cyber capabilities of AI models and determine the threshold at which an AI model should be designated a “covered frontier model” for the purposes of this order
This old post goes into lots of detail about what they do to red team and why: https://www.aisi.gov.uk/blog/early-lessons-from-evaluating-f...
NIST's similar unit in the US is now called CAISI https://www.nist.gov/caisi - interesting that the most recent post is an evaluation of DeepSeek capabilities, which sound more like watching China. But presumably this executive order alters the emphasis?
Train it dumb on "systems:, user:" prompt pairs.
Unleash on "system:, user:" prompt pairs.
Guess which you're providing for evaluation.
For the same reason the CIA doesn't publish the Windows exploits it finds?
Section 1 doesn't say anything
Section 2 seems to boil down to: "improve cyber security and maybe use AI if we can find funding for it"
Section 3 proposes building a benchmark for evaluating cyber security performance of models that developers can choose to benchmark against. This seems like a good idea, I know Jack Clark has been a huge advocate for government's getting in with benchmarking.
Section 4 says to prioritize prosecuting cyber crimes. Not sure why they wouldn't already be prosecuted.
Section 5 doesn't say anything
Not a whole lot of federal prosecutors. They're very selective about what gets pursued or not.
If they can't reliably build cases with a >90% success rate, it doesn't get prioritized. There's like <500 (federal) convictions per year on this whole area.
We hear about a few big famous ones in the news here, but most of it goes completely unenforced.
And lately they seem to spend most of their time in courts trying to argue that immigrants don't deserve due process
So much for "Hacker" "News".
This Executive Order is just an expansion of the existing censorship framework.
https://www.federalregister.gov/documents/2025/07/28/2025-14...
is this a review of "wokeness" in models and rejecting them if they don't align with the party views - this should not be allowed.
A politically neutral committee that decides what the review entails is what would happen in a true democracy and not a puppet oligarchy like we have today.
You left out the part containing the “barrels of money” incentive.
China, obviously.
For an all powerful AGI to exist, it has to basically beat the computationally irreducable processes within nature - i.e it has to simulate reality faster than reality, with a high degree of accuracy, which would imply that NP=P amongst other things.
And thats assuming that anyone has any idea to build an AI that can automatically build necessary simulations to make decisions in the first place. Such an AI is won't need data center with massive training data to be built. The "genesis" code will be something that is capable of figuring out how to go on the internet, and train itself. How do I know this? Because in order to figure out how to solve complex problems (like how to make humans give you control of the nuclear arsenal), is exactly equivalent to a problem of being able to write/read bytes to a file (assuming that file is a socket in Linux) and figuring out how to talk http to get a particular piece of data, without ever being trained on anything internet.
Even more so, there is a fundamental question of whether this genesis code is a P or NP problem in itself - i.e can we generate this code using a training data set, or can it only get created through simulated evolution, much like human brains and capacity for reasoning did IRL.
So as long as everyone keeps talking about number of parameters, transformers, attention, and benchmarks, I promise you we are safe against all powerfull AI.
\s
> An earlier draft of the order had called for a voluntary review as much as 90 days in advance, a provision that some AI industry officials had called too onerous, POLITICO reported last month.
A 90 days delay on the release of new models would have been insane. I guess I'm glad it's been revised at least on this specific point.
What would have made it "insane" exactly? The only argument I can imagine is that it gives non-US models (e.g. DeepSeek) a potential edge in the market during that time. But this potentially seems to be mitigated it being banned in the US anyway [0].
Given society seems to have developed just fine prior to the release of LLMs, I don't understand what the rush for more powerful and - potentially - more dangerous iterations of this technology is. If there is a legitimate reason that 90 days is somehow catastrophic, can someone ELI5?
[0] https://statetechmagazine.com/article/2025/04/these-states-h...
1. Any company which doesn’t is banned from federal contracts
2. Any company which doesn’t is declared a supply chain risk and federal contractors will be prohibited from doing business with them (e.g. AWS/Azure/GCP can’t offer them to customers without risking their unrelated contracts). That was the big risk Anthropic was worried about.
3. Federal prosecutors and regulators will be told to prioritize going after non-compliant companies on unrelated issues as leverage.
4. These companies will not be granted the same exceptions in tariff negotiations as competitors, which means things like data center buildouts get pricier.
Even if they can successfully fight something in court, that’s expensive and uncertain so it puts a lot of pressure on companies. There are likely also cases where politicized tax or H1-B enforcement could be hard to fight in court because much of the industry is gambling on lax enforcement.
IMO this isn't much more egregious than the "stop woke AI" executive order he signed in July 2025 which explicitly regulated the "ideology" of LLMs
https://www.paulhastings.com/insights/client-alerts/presiden...
https://www.bis.gov/press-release/biden-harris-administratio...
More regulated rather than unregulated (or very lightly regulated).
Most people would probably say that’s a good thing, if I read the tea leaves correctly.
I'm very pessimistic that this is about AI safety. I think it's probably more about giving the Trump administration leverage over AI companies. It will be able to coerce them into e.g. propagandizing or surveilling or similar or else they will risk the same kind of "regulatory oversight" that caused television networks to fire comedians who made jokes the regime didn't like.
Were we not pursuing criminal cases against these individuals previously? Or have we only just decided to make crimes be against the law now?
Edit: let's all remember, by the way, this "review" period does nothing for security. It exists to allow members of the government to trade on insider knowledge.
As other commenters have mentioned, it leaves the question what on earth this EO is actually good for. And still the AI industry is complaining about how onerous it is.
Absolutely no one would be okay with authors being 'encouraged' to submit their works to a 'voluntary' review by the feds to ascertain if their ideas are threatening. AI models are NO different.
(probably a good thing, in this particular case)