So many comments here missing the big picture, and just gleefully pointing out that Anthropic got what they deserved, or that this is the natural culmination of some kind of marketing stunt.
The real story here is that this may be the beginning of governments restricting the availability of strong LLMs to the public, to you. Fable was the strongest model on the market, and the US government has told you you can't use it (technically, only if you're not a US citizen, but in practice, even if you are). If you think the solution here is going to be open source Chinese models and / or running on your own hardware, think again. Do you think China is going to allow the strongest LLMs from companies within its borders to be open source a year from now when they have Mythos capabilities, if the US government is keeping the strongest American models back? Unlikely. These are heading in the direction of being powerful cybersecurity weapons and it will be in the interest of nation states to restrict and control them. In 2 years time, I would be surprised if the strongest LLMs are available for general use at all.
Will we be the poorer for that, or will we be safer? I think poorer, because I hate being told what technology I can and can't use, but I'm not certain. Maybe you think the government should restrict strong LLMs. Maybe you don't. But either way, this is big news and a rubicon has been crossed and a precedent set. That's true even if the motivation for this is just the government settling scores with Anthropic.
Hi Mike, I’m @bfontaine on GitHub (I helped maintain Homebrew in ~2014-2016). I’m always impressed at your longevity as a maintainer; it’s been like what, 16+ years you’ve been maintaining Homebrew and you’re still here, still shipping new features! Thank you for everything!
A very prolific coworker who fully embraced claude has inflicted the team with a flood of AI-generated PRs. About six months later, it is his frequent bemoaning at the standup that their PR don't get reviewed, languishing in inattention. I don't think anyone - including myself - _intentionally_ avoid his PRs. It's just that he doesn't make it easy for the team to look at.
This single headline perfectly captures what I have been thinking. It's not that I reject AI content, but it takes _effort_ to review and weed out any mistakes. When your thoughtful reviews that take an hour(because the PR is typically large, and you want to be _right_ when you're pointing out a hallucination) gets an AI-generated response with AI-generated amendments, It doesn't feel _nice_. I feel dismissed and it has continuously trained me to subconsciously avoid his PRs. After all, the team is fully onboarded with AI, so it's not like there is a lack of PRs to review.
It looks like the sentiment isn't just isolated for me.
The ending is a really powerful point. Most people apparently agree on two things:
1. AI is a great boon for all tasks and specialties we don’t have the skills to do ourselves. Understandable, since (A) we’re ill equipped to see the flaws in its output because it isn’t our area of expertise, and (B) it often can unlock great gains because if we trust it, we then don’t have to pay and wait for humans to do that thing.
2. AI is a terrible replacement for me - my skills are at such a high level that it’s almost theoretical that it’ll ever be good enough to replace me for 90% of what I get paid to do. It’s a tool at best.
This is why I use AI for all my medical questions and doctors use AI to write software, and we both smirk at the quality the other person is getting from it.
It seems more likely that the logical conclusion is the executive branch is mad at Anthropic, and lashing out at them with any convenient tool that they have.
I suspect if OpenAI or Grok was operating at the same level they wouldn’t find themselves on the sharp end of the government stick
Obviously their statements are insincere, because they are building the bloody things. If they were sincere that AI is like nuclear weapons, then they would be devoting all their cash and energy into lobbying the government to nationalize them and treat AI like nuclear weapons. They would not be attempting to IPO and they for sure would not sell their weapon-like thing to the general public.
Anthropic pretending Mythos 5 is so capable it's going to destroy everything, but will release it anyway with "safeguards" (when does this ever work?).
US Gov't using this fake hype as an excuse to handicap Anthropic simply because they have a vendetta.
When you spend a lot of time telling people how dangerous your products are, people who have the power to keep dangerous products off the market might listen.
Especially if those people aren't presently very bright, and are already mad at you for not helping them achieve their unrelated authoritarian goals.
I do not think this is somehow a 3D chess move by Anthropic. They are not masterminds, even if they'd really like to be. People who actually interact with their products know that Fable and Mythos are incremental improvements, not doomsday devices. I think this is a punitive move by an administration that loves being punitive, which they have unknowingly bolstered with their own dumb rhetoric.
I've been in those companies where "struggling departments" ended up getting all the praises and raise in budgets the following quarter because of the heroic saves they did, and raising awareness on how important they are... For stuff they totally caused on themselves.
Meanwhile, my perfectly purring department was struggling to keep the lights on.
It's a serious problem in this industry due to the disconnect between non-technical management (who understands how to double click) and engineering (who holds the company standing).
<insert IBM story about IT department cost cuts>
I'm not sure how we solve this, other than having management come from engineering.
Listen - that's the sound of millions of companies and users doubling down on Chinese models.
It might be a national security problem for other nations to have access to these models. But it's equally now a national security problem for any other nation to depend on them. Or US tech in general.
"Don't expend more effort than they are" has actually long been a good principle to have internalized. Someone done only cursory research before asking a question on a mailing list? Give a cursory answer. Someone obviously spent hours trying to figure things out on their own? Give them a good chunk of your time. Someone on HN responding to you with single-sentence responses? Either don't respond, or respond in kind. Someone obviously engaging with your ideas and taking time to explain their position? Take time to engage with their ideas too.
This has dampened my opinion on Anthropic quite a bit. It's difficult to take their marketing for AI as an empowering technology seriously when they are quite clear in their new deployments that they do not mean empowering for you, but empowering for them and organizations that are in their (or the US government's, despite Anthropics performative disagreements with the administration) good graces. You are allowed to vibe code some dashboards, a web app or let it drive Excel, but anything more interesting than that is forbidden.
If it was just plain monetary concerns and sabotage of competitors I'd almost be fine with it, but it seems they actively want to monopolize most of human progress in their enlightened hands, lest the mob does something undesirable with these powers.
The way I see it, a government led by an adult toddler and his sycophants has decided to punish a firm that refused to cooperate with it's military when it was embarrassed by a militarily weak adversary. The model strength spin strikes me as motivated reasoning.
The rubicon being crossed here is Republicans/the red tribe losing their comparative advantage of being opposed to overregulating a rapidly advancing technology.
But claiming that google lost it's "moral compass" just now is a claim only rich people can make because they retire, not quit.
Google is literally the largest, most organized, tracking and profiling company in the world. Which they tend to grow even larger with the rise of LLMs.
Turning a blind eye of that for the opportunity or whatever, and than claim that _just now_ they lost their moral compass, is being a hypocrite.
> But on the other hand... this is a robust reminder that coding agents can do anything you can do by typing commands into a terminal—and frontier models know every trick in the book and evidently a few that nobody has ever written down before.
> Running coding agents outside of a sandbox has always been a bad idea
I'm continually bemused and astonished by the number of people who clearly acknowledge that it's reckless to give agents full access to your machine, and keep doing it anyway.
It's like posting a video of yourself in the passenger seat of a car, with your feet up on the dashboard, and saying: "Remember, if you're doing this and you get in a crash, the airbags are likely to break your legs or worse! Boy, I sure am glad that didn't happen to me!"
If one is thinking about not getting a degree and trying to go straight to work, as someone who did so (albiet out of poverty rather than choice) but didn't end up like Zuck, please heed my warning:
Social capital matters more than just about anyone who has a degree can understand and tell you or mentor you about, because the majority of them have always had it, and they tend not even to interact with people without it.
It is a signal about your wealth (and your families ability to deploy it for you), from which follows your stability, your intelligence, your taste, your willingness to play the game, and your belonging in the club. These matter more than EVER in the business world - I've never seen a time when tech is less about engineering than right now.
So isn’t the only logical conclusion that we have reached the max of model capabilities that the US allows to be made available to the public? Why invest in smarter models with this precedent?
And potentially more importantly: if a model like Mythos, which at best is an incremental improvement over Opus, is getting this treatment, how are all the AI investments that are based on the expectation of ASI / AGI / significantly better models going to be recouped?
This to me reads like a poignant commentary on the catastrophic loss of human agency, with the actual commit being highly revealing [0].
Author wants to hide a horizontal scrollbar. Any junior frontend dev worth their salt will be asking right away "where do I stick `overflow-x: hidden;`?" A complete solution will then require hitting "Inspect element" in the browser to find the CSS class and running (rip)grep to find where it is in code, to then add a single line to.
An actual proactive programmer might start asking more pointed questions like what content does an empty textbox have that it overflows? And why do I need to insert this workaround that treats the symptom and not the root cause in two different places? Isn't it better to style `textarea` once? Etc, etc.
The replies here arguing we should publish it all are wild in the worst kind of first-order thinking way.
It’s a census: it just asks questions.
If you start publishing and weaponizing the data against people with various attributes, they’ll just lie or not answer. And then you are left with worse than nothing: bad data people try to act on.
I think we should see this as simply silly behavior by a government.
Export control is not an effective tool for controlling a consumer facing technology developers everywhere want to use (see:VPNs) so there was no good faith policy justification for imposing an export control.
This is an administration that seems to be keeping track of who its friends are and aren't, and likes to be the center of every story. They also seem to like extracting concessions and reciprocal favors. We saw some of this behavior in the last administration too. US voters deserve better.
You're correct about CRISPR Cas9. The off-target affects are difficult to manage.
The paper describes Cas12a2. This is a different mechanism with discovery origins in - of all things - agriculture. It does not attempt in any way to reprogram cells. It uses a guide protein to locate a specific mutation with exacting precision and, when it activates, unleashes total destruction of the cell.
The implications of Cas12a2 on undruggable conditions that exhibit known driver mutation profiles is profound.
Source: I have personally funded novel research based on Cas12a2 for an undruggable condition I have. I have personally seen my condition "cured" in vitro using this technology and it left all of my WT cells unharmed. Some of the researchers I've funded are co-authors in the paper linked. I am a layperson in this field (I'm a SWE, not in biotech), but I am happy to answer questions.
I'm hearing a lot of this kind of thing. "Oh if only it was a different government". I'm sorry, but when you cry out for government involvement, it's not always going to be coming from the government you personally wanted. This is the whole problem with government involvement! I don't think that message is getting through, but it's the real lesson that should be learned here.
Last time I suggested on a similar story that there's a disproportionate number of firms in Israel with an explicit focus on subversion, manipulation, spying and malware, seemingly because a large portion of the Israeli population gain a certain expertise in these fields as part of serving in the IDF and working to suppress Palestinians, I got accused of bias because apparently there's many more Israeli startups working on medical research, green technology and world peace.
If there are, they certainly would do no harm in being more vocal, firms like BlackCore is unfortunately what Israel is becoming known for around the world.
We recently had some behavior issues with our kids - they didn't want to do activities outside the house, they hated reading, they hated anything that required even the slightest discomfort or effort.
We decided to cut device usage way down - they get 1 hour in the morning to play whatever games they want on computer, tablet, console. Then they get 1 hour before bed to watch TV. The rest of the day, no devices. We are homeschooled so this is a LOT of free time.
After a few weeks, they're now: blasting through books daily (to the point where they forgot their own TV time, which used to be sacred), playing board games with us more frequently, asking to do things outside like learning to ride bikes (which they've previously shied away from), writing their own comic books and board games on paper, and overall just being creative through the day and entertaining themselves.
It's such a huge difference. It is the devices. It's 100% the devices.
All of my stock has finally vested, and I am independently wealthy enough to signal that I'm quitting purely based on my morals, since there's no way anyone could have known Google wasn't some ethical bastion of hope in 2017.
This is ignoring the fact that the main reason retired phones are e-waste is proprietary firmware blobs and locked-down systems preventing users from maintaining their phone with security updates, and very limited support length from OEM's leads to VERY insecure devices after they drop out of support.
You should not be connecting these old devices to an internet accessible network.
Google notably does well here with 7 years of support, but others such as Sony are 4 years, and Xiaomi on non-flagship devices are similar, or Samsung on their lowest budget models...
The real story here is that this may be the beginning of governments restricting the availability of strong LLMs to the public, to you. Fable was the strongest model on the market, and the US government has told you you can't use it (technically, only if you're not a US citizen, but in practice, even if you are). If you think the solution here is going to be open source Chinese models and / or running on your own hardware, think again. Do you think China is going to allow the strongest LLMs from companies within its borders to be open source a year from now when they have Mythos capabilities, if the US government is keeping the strongest American models back? Unlikely. These are heading in the direction of being powerful cybersecurity weapons and it will be in the interest of nation states to restrict and control them. In 2 years time, I would be surprised if the strongest LLMs are available for general use at all.
Will we be the poorer for that, or will we be safer? I think poorer, because I hate being told what technology I can and can't use, but I'm not certain. Maybe you think the government should restrict strong LLMs. Maybe you don't. But either way, this is big news and a rubicon has been crossed and a precedent set. That's true even if the motivation for this is just the government settling scores with Anthropic.