I think that's the datacenter with the gas turbine generators that operate without permits because they're "portable." Data centers have tremendous externalities but colossus is a particularly nasty offender, and not just due its size.
Have you considered that the march of progress requires human blood to grease the gears and mulched skulls to pave the (highly efficient) road? Really, when you take into account all of the future lives this will improve and save it's difficult to claim any cost now is too high. Would you stand in their way and delay the day that Mythos cures cancer?
I'm not saying human blood and mulched skulls are a renewable source of power, I'm just saying. Or maybe they can partner with SoulCycle to power computation with 24/7 spin classes?
What's the tremendous externality of gas generators? People heat their own homes with natural gas and it's no big deal. How can a datacenter that is miles away be worse than that?
This sort of reasoning genuinely just ruins my day.
Colossus 2 is building towards 1 GW of capacity using Solar Titan-350s. They have a thermal efficiency of about 40%, which probably goes down to 35% during real world use. So, that's 3 GW or so of fuel, or 90 PJ/yr, 90 PJ/yr / 1 GJ/MMBtu ~ 90 Million MMBtu/yr.
Memphis probably consumes around 10 M MMBtu/year... so that's a factor of 9 already. Now a residential furnace probably emits 0.02–0.04 lbs of NOx/MMBtu, and those gas turbines emit 0.1-0.3 lbs of NOx/MMBtu. So we're looking at 50x the NOx emitted by the ENTIRE city of Memphis from burning natural gas.
ALSO, Memphis is 300 sq miles. Colossus is emitting this much NOx right along a fenceline next to, you know, preschools.
Here's a probably stupid question - if someone were unbounded by ethics and conceivably had enough power and connections to power to shield themselves from many consequences of their actions - and that person owned these DCs, could they in theory observe all the streams of tokens coming in and out of these models, and even exfiltrate copies of these models wholesale to have their own teams do what they will with them in the pursuit of building their own competitive models?
Or is there something fundamental in the way these models get deployed (encryption or something or than legal contracts?) at this scale that prohibits the owners of the infra from gaining this level of insight / access?
1) The situation you described would be covered under the contract between Anthropic and xAI, and that any violation of that would be subject to financial penalties and legal proceedings. The US has a robust corporate legal system, and disputes do get resolved through the court system, although in a slow and costly manner.
The contract can stipulate a penalty at a high enough amount to discourage this behavior.
2) Output from models & intra-datacenter communications can be encrypted if customers truly cared.
3) There is no reason do this, because there are far better ways to exfiltrate data from Anthropic models. Chinese companies are already doing this at an industrial scale where they are reselling Claude tokens for 10-20% of the cost while retaining the data to train their own models. https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...
If we look at Deepseek V4-pro, created by Deepseek who Anthropic formally accused of harvesting Claude tokens at scale, it performs the same as Claude did 6 months prior.
There's accusations that the Chinese labs have done essentially that to OpenAI and Anthropic and exfiltrated their models without having DC access, so if you had DC access, yes, you could do that. If you had DC access though you could just copy the model onto an SSD.
More signs that xAI might be giving up on the AGI race. xAI let Cursor train a model on Colossus 2, gave the entire Colossus1 to Anthropic, and is now giving compute in Colossus2 to Anthropic as well.
Elon lost his lawsuit with openAI and knows xAI isn't on the same trajectory. Might as well try to win the bet and flip off Sam by supporting the best competition. Also they are getting a head start on AI as a commodity. I'm sure there's plenty of money to be made for those that can leverage their capital to essentially rent capacity right now. If he's not making enough off of grok, might as well cover their expenses.
Bad read on the situation. xAI has too much compute and not enough customers using it. They have around half a million GPUs, some of which are stolen from Tesla, running at 11% utilization. xAI predicted more people would be using Grok, but Grok is not a SOTA model & users primarily want to use SOTA models. They have excess capacity and it makes sense to rent out GPUs to other customers while they improve their models.
Why is xAI giving up their advantage? Is this a signal that their frontier model improvements are plateauing and decided there is no value in hoarding all their compute?
What advantage? Has there ever been any indication they’re leading in any segment? Sure Elon has thrown a bunch of money at hardware, but to what end?
And frankly as bad as Altman is from a: if AI is really going to disrupt humanity do I want this guy in charge? Elon is 10x worse. So why would the best and the brightest ever work for him?
yes, but it's only advantage if one is compute-strained and the other isn't. if they both have lots then there's no advantage. if one doesn't fully utilise their compute then it's not an advantage either
Well, other than your ability to turn that into cash by renting it out for the highest price to someone who needs it, you can promise prospective employees that are supposed to use that infra to train models that they won’t be compute starved.
You can kick off more model training runs and experiments than your competitors.
You can kick off a $1-2t IPO claiming you are going to capture a large portion of the largest TAM the world has ever seen.
They have neither the most resources nor the best models. They are mediocre at everything except the CSAM generation market, they've got that one cornered
What do you mean by etching? Google does also it's own chip design with TPUs, data centers, and models but afaik only TSMC Intel and Samsung do the actual semiconductor fabrication
I don't see a scenario where it really makes sense to be a frontier lab long term. Eventually model quality will plateau then you distill and get 90% for 10% or less cost.
General AI is that scenario. The investor dream is that their horse hits general AI first, patents it (or otherwise somehow stops the competition from hitting it), and then reaps the massive benefits.
I'm not saying it's a likely scenario, but I genuinely believe a big percentage of AI investment revolves around that (or similar) scenarios.
Fake __AI__ revenue. Maybe Hetzner should build Colossus4, rent it out and book it as AI revenue instead of hardware rental revenue and get a P/E of 100.
Anthropic this quarter will have revenue of $10.9B, up from $4.8B last quarter[0]. They're paying SpaceX $1.25B per month for compute[1] - which is more than what SpaceX earn on space. SpaceX spent about $30-40B in capex on Colossus 1 & 2.
This is all real revenue, real spend, real usage.
Hetzner just aren't at this scale. Not even close. If they wanted to get into this business - first, they're late. Second, it's at a scale of ~10x of their total lifetime datacenter buildout. Third, they'd need to change their business to being one that is debt fronted.
xAI have proven out that being able to deploy compute is a very viable business (and difficult to pull off)
At some point AI cynicism clashes with reality, it must be exhausting maintaining it.
Jesus Christ, the Hetzner example is obviously an example of booking revenue as AI revenue (where investors assume it is generated by Grok subscriptions) vs. hardware rental revenue, which traditionally not valued as highly.
Nowhere does the hypothetical state that Hetzner, an example for hardware rental, has the funding or the capabilities to execute the sarcastic example.
But ok, now hardware rentals have a P/E of 100 or more.
Investors aren’t dumb. These numbers are being reported and the fact that the data centers are being rented out is publicly disclosed everywhere. Investors know full well that the revenue is from the data center rental. No (non-retail) investor is going to see the jump in revenue and think “I better buy up because grok must be kicking ass!”
And yes, if hetzner built a massive AI hyper scale datacenter and rented it out for billions, with the expectation that they would keep building more, they would also see massive PE ratios because it’s expected that their revenue would be going up.
Edit: They did it with Colossus and now they're doing the exact same thing with Colossus2. https://www.selc.org/news/xai-built-an-illegal-power-plant-t...
https://poiesic.com/posts/pattern-recognition
Some wild things happening with those, and infrasound. Colossus is shown 4 mins in
But hey, number must go up, right?
This is a joke. Read it in a mocking tone.
Colossus 2 is building towards 1 GW of capacity using Solar Titan-350s. They have a thermal efficiency of about 40%, which probably goes down to 35% during real world use. So, that's 3 GW or so of fuel, or 90 PJ/yr, 90 PJ/yr / 1 GJ/MMBtu ~ 90 Million MMBtu/yr.
Memphis probably consumes around 10 M MMBtu/year... so that's a factor of 9 already. Now a residential furnace probably emits 0.02–0.04 lbs of NOx/MMBtu, and those gas turbines emit 0.1-0.3 lbs of NOx/MMBtu. So we're looking at 50x the NOx emitted by the ENTIRE city of Memphis from burning natural gas.
ALSO, Memphis is 300 sq miles. Colossus is emitting this much NOx right along a fenceline next to, you know, preschools.
Or is there something fundamental in the way these models get deployed (encryption or something or than legal contracts?) at this scale that prohibits the owners of the infra from gaining this level of insight / access?
The contract can stipulate a penalty at a high enough amount to discourage this behavior.
2) Output from models & intra-datacenter communications can be encrypted if customers truly cared.
3) There is no reason do this, because there are far better ways to exfiltrate data from Anthropic models. Chinese companies are already doing this at an industrial scale where they are reselling Claude tokens for 10-20% of the cost while retaining the data to train their own models. https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...
If we look at Deepseek V4-pro, created by Deepseek who Anthropic formally accused of harvesting Claude tokens at scale, it performs the same as Claude did 6 months prior.
And I'm sure it's a bonus point for Musk that it goes to OpenAI's most relevant competitor
And frankly as bad as Altman is from a: if AI is really going to disrupt humanity do I want this guy in charge? Elon is 10x worse. So why would the best and the brightest ever work for him?
You can kick off more model training runs and experiments than your competitors.
You can kick off a $1-2t IPO claiming you are going to capture a large portion of the largest TAM the world has ever seen.
Also possible he sees infra as the future of xAI if he really believes in the value of space compute.
Hard to see this any of this as anything other than a bearish sign for Grok though.
I'm not saying it's a likely scenario, but I genuinely believe a big percentage of AI investment revolves around that (or similar) scenarios.
But booking outrageous rental fees as fake AI revenue ahead of the SpaceX IPO apparently takes precedence.
This is all real revenue, real spend, real usage.
Hetzner just aren't at this scale. Not even close. If they wanted to get into this business - first, they're late. Second, it's at a scale of ~10x of their total lifetime datacenter buildout. Third, they'd need to change their business to being one that is debt fronted.
xAI have proven out that being able to deploy compute is a very viable business (and difficult to pull off)
At some point AI cynicism clashes with reality, it must be exhausting maintaining it.
[0] https://www.wsj.com/tech/ai/mind-blowing-growth-is-about-to-...
[1] https://www.wired.com/story/spacex-ipo-anthropic-compute-fin...
Nowhere does the hypothetical state that Hetzner, an example for hardware rental, has the funding or the capabilities to execute the sarcastic example.
But ok, now hardware rentals have a P/E of 100 or more.
And yes, if hetzner built a massive AI hyper scale datacenter and rented it out for billions, with the expectation that they would keep building more, they would also see massive PE ratios because it’s expected that their revenue would be going up.