I have a similar claude story (much less money though), with the IRS R&D tax credit. The auditing firm initially said we qualify for $0. But then I had claude analyze past R&D reports and our expenses and it found the problem. The auditor had miscategorized our company.
So claude drafted an email even pointing to the right Internal Revenue Code (IRS Law), and specify why we fall under a specific category. The auditor got back to me two days later admitting their mistake and said our company now qualifies for $8k in tax credits. And a few months ago, it identified items in our AWS that saved us $250 a month (paying for itself).
So now I joke that even if I have a claude max plan, I've still come out ahead financially.
By my estimation (guess) you won't actually need to spend that much because the models are already getting a point where they don't need to get a whole lot better to be extremely helpful across many domains.
And it looks like those very helpful capabilities will continue to transfer to smaller models as well, as architectures and training regimes continue to refine.
I can fairly easily imagine a world where the only people needing to spend a lot of money on models are those that are using them to solve truly novel problems. The rest of us will get plenty of use at reasonable costs for the typical day-to-day helpful stuff.
In my anecdotal experience there is a huge gap between GPT-5-mini which hallucinates relentlessly and Claude Opus or the latest GPTs which are fairly reliable. I'm hoping that gap can be closed with improved approaches for small models and that good reliability is achievable for LLMs without requiring absolutely mammoth computing resources.
For what it's worth, I also used GPT-5.2 (via duck.ai) this year for questions about taxes and it was helpful — which makes sense because there's so much material about taxes out there to be synthesized, so a text predictor trained in that domain should do well.
There seems to be a mass delusion about how capable SOTA models actually are. That's my only explanation for how poorly I find them performing in basic knowledge tasks compared to how others describe their prowess.
I am eagerly awaiting being able to run a strong local model. I'd hand Apple $5k right now for a Claude in a box. I know the cost might not be there now, just saying that is around my ideal price point.
$10k might even be worth it - but i'm assuming that the more expensive it is the beefier it is too, which also means more electricity.. and i already run ~6 computers/servers in my house. If a power surge happens i'm going to go live in the woods lol.
I would do the same but my issue is that the models are changing so fast, so I don't want to be left out of the next model cuz it only runs on an even newer GPU or something like that.
But maybe my limited understanding is thinking of this wrong.
[sci-fi “AGI” scenario] What if those with elite model access philosophize in a way us mere mortals can’t understand, so the elites have to prechew the ideas for us to bring them to our level, and they control the narrative?
In reality now, curious about social implications generally. Does this go beyond problem solving? Maybe the intelligence per token you get via your free library card/membership is insufficient to compete with peers in dating/employment/etc. markets, thus puts you at disadvantage.
That isn't really philosophy, but rather doom and gloom theories.
Control the narrative on what exactly, how I write a bootstrap script for my servers? Or what type of flower is in this photo. Not everything is politics luckily.
To be fair regarding taxes it should be that you get all exemptions by default and the other side telling/justifying you why you don't qualify, instead of you not getting the things you should because you're not sure how to interpret the law or don't know this or that rule exists. Taxes shouldn't be that difficult, and the US version of it seems to be behind even the one I have here in europe (where my taxes are done "for me" for anything non business owner related). If the government is calculating your taxes anyway, they should just give the number to you instead of asking you a number and you better have the same as us or you're guilty of something.
Feels like a system that is deliberatly made to be more punishing for those who can't afford the help or the education to figure things out.
I would agree with you on most situations (like 1040 personal income taxes especially).
But in this particular tax credit, there's no way for the gov to know automatically what percentage of payroll was spent in qualified R&D expenses, since it's day to day business operations. Which is why we are _forced_ to hire an outside firm and pay them thousands of dollars (when Claude did an even better job), just to analyze how much of our time qualified as R&D expenses.
The problem I have is that I am forced to have to find a firm to do this, and most firms won't even work with companies as small as ours. So then we're stuck and losing out on years of R&D tax credits at the moment, when I really don't need them anymore, to be honest.
Kind of. I first of all did the entire return with it. So we went step by step and yes I fed the forms one at a time. So I filled in 1040 as best I could. Then just asked it what to do next at each step. It helps I've done it before so most of the steps it returned were ones I've done before. However, it did mention several things that I had not heard of, and also some new taxes that I had to file due to some exceptional events last year. So all in all, a solid use case. This year I have an accountant, but it saved my butt this last year, and I will absolutely run through my accountants decisions with it. It has an encyclopedic knowledge and an immense capability to understand without getting tired.
Claude Code is really good at stuff like this. The other day I tried to recover some images from an SD card that had gone bad. I used GetDataBack to recover files, but they appeared to be malformed and didn't open in image viewers.
I tasked Claude to analyze the files and figure out what's going on, and eventually we figured out that each file had a custom metadata header + thumbnail + actual image concatenated. I had it write a python script and was able to recover all the images with their metadata. It's nothing a human couldn't have figured out, but it was definitely WAY faster than doing it myself.
I've also used Claude in the past to figure out how to break into routers with locked down firmware. It's great at suggesting and trying different approaches.
I have a friend that just picked up a new consulting job resurrecting an ancient Windows desktop application. No source control, no tests. And it's spread out over a dozen different folders with names like "_old", "_new" and "dates". Claude's doing a tremendous job in getting him to grips with what is actually happening in the application, what's relevant, what's not, what's different. I think it's literally saving him days and days at work.
if your friend has access to the binary and can pull it out to different box, they might get a lot out of a ghidra mcp -> https://github.com/LaurieWired/GhidraMCP
I'm not well versed at reverse engineering binaries or interpreting C/assembly so ghidra MCP has been an absolute gamechanger for helping me write tools. Once my project is complete, I plan to learn how to do the analysis myself manually and have cc guide me along the way.
> I have a friend that just picked up a new consulting job resurrecting an ancient Windows desktop application. No source control, no tests. And it's spread out over a dozen different folders with names like "_old", "_new" and "dates".
That doesn't sound very impressive. Not being tracked with a version control system is fixed instantly with a git init, git add ., git commit .no AI required.
Covering the app with tests is also something that requires no AI. At most, coding agents can generate characterization tests in broad sweeps, but we are talking about a delta between hand rolling and vibe-coding of a couple of days.
Where LLM shines is helping developers build up an understanding of what is in place. Running /explain on a codebase can quickly provide you with a high level summary of what's in place.
The relevancy here is that he's denied the git history, versioning, branches, implicit documentation that even bad source control practices would have given him.
That's what the comment is saying. In normal repositories, version control acts as a record of the momentum of the direction the product was taking. If it's just "_old" and "_new," the developer has to read and understand both, which I think is going to be far more time consuming than your estimation.
I'm sure data recovery companies are pretty pissed that slightly esoteric data recovery abilities are becoming more accessible for average software devs. They were charging an arm and a leg to remote in and run scripts.
They still have two important moats: (1) expensive hardware tools (even stuff like SATA write blockers are kind of expensive for what they are), spare hard drive collections to swap failed PCBs, etc and (2) the "nobody got fired for hiring us" edge similar to how everyone calls in Crowdstrike/Mandiant after an incident. If a suit-level manager finds out customer data was lost, they are going to want to call in an expert so they can immediately tell the customer they did, not have the same internal team try to figure it out.
As an aside to #1: The cool thing is in modern times the hardware tools have come down stupidly cheap in price. Even SD card recovery is (vaguely) in the right skilled hands in a pseudo-professional home lab these days.
I did EXACTLY that last night. Was doing by hand for about an hour and got to a point where I didn’t feel competent anymore and asked Claude to take from where I was.
5 minutes later I had almost 3 hours of important footage recovered.
A lot of "Claude Code is best at X" claims are probably user-selection bias.
The people saying it are often exclusively Claude Code users, not people who are actively benchmarking Claude Code against Gemini CLI, OpenAI Codex, GitHub Copilot, and other agent harnesses on the same tasks.
The claim may still be true for certain scenarios, but the evidence is usually anecdotal, not comparative.
When I hear "claude code one-shotted X" and X is a novel problem, I mentally substituted "the agentic harness that I tried one-shotted X," since that's what they're saying.
Getting any smart model to take a look at the task is the sort of lift that the speaker is usually pointing to.
> Bitcoin trader recovers $400,000 using Claude AI after getting 'stoned' and losing wallet password 11 years ago — bot tried 3.5 trillion passwords before decrypting an old wallet backup
Man. I wish I had a lost wallet worth a quarter of that even, technically didn't need Claude for this, just needed any password cracking software.
Explaining your life to an llm, then having it generate permutations of passwords to try does sound like it would work a decent percentage of the time.
A large percentage of passwords aren't a random string of characters but a memorable word + memorable number. There's existing projects that basically do the same, and 3.5 trillion doesn't really make it clear if one of those wouldn't have worked as well, but I can see it having an above random chance to guess a password.
The idea that someone (the NSA?) is training models on all of our collected info, and using that to predict all of our hidden information, is horrifying.
The best time to start using a password manager was 10 years ago. The second best time is now.
I have a lost wallet with about 300 Bitcoin sitting in a landfill somewhere. I tried out Bitcoin really early on and mined those over a few weeks. But they were worthless back then and I was burning electricity for "nothing" so I stopped. This was before that 10k Bitcoin pizza purchase happened. I have some regrets lol.
I had a high school friend that died about 10 years ago from an over dose. He was always tech forward and had talked in the past about getting drugs from the dark web to sell locally.
I wasn't particularly close with him after high school, but he was an only child, and I can only imagine his (older) parents just tossed his computer. I wouldn't be surprised if he had had a few hundred BTC on there.
Someone gifted me 87 bitcoins back when they were worth ~0. They are still in some wallet somewhere I guess, and I saved the password on a harddrive I threw out around the same time
Way back in the day when Bitcoin first came about, I once idly contemplated spending some time and money on it just because it was a very cool technology. At the time it was a bit of a hassle because you had to mine your own.
Then I was especially tempted years later after running into the MtGox booth at CES, and seeing how convenient it had become. I remember asking a guy at the booth if Satoshi was really still anonymous or if any insiders knew about him, and he said "no" but was bit surprised I knew about Satoshi. I guess Bitcoin was still quite niche then even amongst a technical crowd.
I considered buying a few bucks worth of bitcoin then for lulz, but I thought that money was better spent on beer lol.
I've never really regretted spending that money on beer rather than bitcoin, because I knew that even if I did, it would 100% have been on MtGox and I would have lost it in the hack anyway, which would have been even more bitterly frustrating.
A friend lost £2000 worth of BTC in MtGox which is probably worth a fortune at today's prices. The last time I spoke with him he said there was some sort of lawsuit for victim compensation. How did you recover your funds?
Yup. I was really close to buying $1000 (at $1) worth of BTC ages ago. I don't dwell because the stress of managing that through time would have eaten me away lol.
With that said, i do regret not at least mining/etc. Back then i could have mined in many ways, and getting into it as a hobby probably would have meant holding larger amounts of BTC in the long run.
I remember thinking about buying $100 (at $10). And then realizing I didn't actually know how to do it and didn't feel like looking it up or going through whatever steps to do that kind of transaction online, or worrying about getting scammed....
I was making a long edit in a crappy wiki UI and my browser froze. It would have taken a long time to redo, hours.
I didn't want to take the chance of force closing and losing everything. I used claude code to extract my text out of the browser internals and filesystem objects.
Many crypto wallets use a key derivation function (KDF) to add an amount of computation (and memory usage) per password tried - to mitigate brute force of weak passwords.
The increase in compute (decrease in brute-force cost) combined with price increases in many crypto tokens means brute-forcing old wallets can become worth it years after passwords were forgotten.
And of course even smaller, local AI models can now easily write optimized scripts to brute-force any given KDF function.
The compute power needed use to be of the order of 5s per password try.
So it effectively mitigate brute force back them, you need a absurd compute power to crack them.
Moore law did its thing, now you can do it with a lot less computer power.
Kind of cool to hear... I had a couple computers running miners at home towards the end of my second marriage (which ended in 2010-2011), and after I had a few coins when realizing I couldn't actually spend them (IIRC $0.25 at the time), I just deleted the wallets. I had no foresight or faith that they'd be worth around $100k each in 15 years. I'm curious how many did the same, how many coins out there were just deleted altogether.
i distinctly remember a coworker many years ago offering to just give me like 10 bitcoin to see them and how it worked. i said "no, thanks". granted, hindsight is 20/20
A wallet is just private keys of some specific public keys on the blockchain that have unspent output (UTXO). None of what’s described in this article involves the blockchain, only the storage and protection of the private keys on a local computer.
You can imagine that in your example, you didn’t change the locks on a house, but rather you put the house keys in a secure lock box and you changed the locks on this box.
Changing the locks on a house in this case means transferring from an old wallet to a new wallet and then abandoning the old wallet. That’s exactly what the OP is trying to do. It’s just that you need the original key to do it.
> bot tried 3.5 trillion passwords before decrypting an old wallet backup
> After finding a mnemonic that actually turned out to be their old password a few weeks ago, the user dumped their entire college computer files in Claude in a last-gasp effort. The bot uncovered an old backup wallet file that it successfully decrypted, while also uncovering a bug in the password configuration that was preventing recovery up to that point.
So it switched from brute-force searching passwords against a file, to brute-force searching files against a password?
The other day, I asked Claude to track down the leaked Claude Code source so I could study it. It refused, saying “given who made me, I’ll pass.” It gave me some pointers on how to find it myself, which worked.
There isn’t that much of a difference between “help me crack this bitcoin wallet” and “help me crack this executable.”
I don’t exactly have a solid point, just some general observations. First, I think we’ll see AI more and more simply refuse to do any kind of forensics, as forensics becomes more powerful. Second, that implies local models will become more valuable, since they’re the only ones willing to do that kind of work.
I once got myself banned from Claude by researching barbiturates, since they’re connected with suicide. So my third observation is that we’ll see an uptick in people getting punished for trying to do things with AI that people don’t usually do. (Luckily the unban form worked.)
Someone downthread asked “how’d he convince Claude the coins weren’t stolen?” Which is an interesting question, because presumably some people trying to crack a wallet have stolen it. So I guess the fourth observation is that the exact framing you approach an AI with will become more important. There was the classic “do this or I’ll cut off my arm,” which worked a year ago. But in the future it will be more like “hopefully the AI believes my story, or else I’ll get into trouble.”
It’s good there are multiple AI vendors, or else it’d get real dystopian real fast when the de facto AI’s policy becomes something you have no way of working around.
I spent a couple of days mining many years ago and got 2 bitcoins. At the time, they weren't worth the electricity they cost to mine and over time I lost the wallet and all information related to it.
You are better than me. Back when you could mine BTC with the CPU, i had about 2 coins. I found it useless and silly and deleted my wallet at some point :)
>they weren't worth the electricity they cost to mine...I'd love to mine a bitcoin a day on my PC now...
you can!... but they wouldn't be worth the electricity now either. the cost of mining (amortization of hardware costs plus electricty) is the value of bitcoin. if bitcoins are a bargain to mine, more people will mine them thereby reducing rewards.
should you have mined more back then if you had magical perfect knowledge of the future? no: they weren't worth the electricity.
instead you should have bought more of them back then.
Feels less like "ai cracked crypto" and more like having an insanely patient technical friend sitting next to you for 12 hours doing digital archaeology.
So you’re telling me he used a computer to search all the files and found the old backup? computer file systems are truly proving to be revolutionary technology!
Claude found a file on the computer that the wallet owner had not found. Claude didn't crack a password or do anything magic, it just searched for a file that the wallet owner had not thought to search for before.
So, where the wallet owner had previously only tried to access /Users/example/wallet.dat, Claude thought, "why don't I check if there is another wallet.dat file elsewhere on the system?" which it did.
The outcome is the same, it is great that Claude tried something that the wallet owner hadn't tried, but this is more an example of how dumb humans can be rather than how smart Claude is.
The trillions of passwords are a red herring and unrelated to the solve.
“ Out of frustration, cprkrn then dumped their whole college computer into Claude. This was when the AI discovered an older backup file of the wallet from December 2019 hidden in cprkrn's data.”
… this dweeb had a file containing their seed in their backup, claude just searched through the files
Yesterday I had ChatGPT walk me through fixing my single-node k3s cluster. It required rebuilding the sqlite database (while skipping a few corrupted records), then clearing the containerd cache, and then finally deleting a somehow-corrupted Secret record, and then recreating it.
Without it I would have given up way earlier, but the infinite patience to keep slurping in error messages and continue to troubleshoot really worked out.
Yeah Claude is really, really good there. You tell it the distro and the problem and it will solve it. Saved me a lot of pain when it came to swapping out an encrypted boot drive and was good about emphasizing the order of operations required for what I would consider a higher risk/complexity situation.
Claude found an old wallet and then ran btcrecover on that. The question is why the user could not find an old wallet with any numbers of Unix tools himself.
Since we are dealing with Anthropic, the entire story could be staged of course.
Claude ran a ctrl+f on his file system. Groundbreaking. Insane the dude hadn't figured this out for himself considering a few years salary was just sitting there.
> The bot uncovered an old backup wallet file that it successfully decrypted, while also uncovering a bug in the password configuration that was preventing recovery up to that point.
I know that we're all experts in archaic backup mechanisms and the encryption systems they used, but I think this qualifies as doing more than Ctrl+F
"the user dumped their entire college computer files in Claude in a last-gasp effort."
Claude has limits that would make this simple statement be much more complicated-
Via Claude "So the chat upload file size limit is actually 500MB per file (not 30MB as many third-party sources claim - those appear to be outdated). The 20-file-per-chat cap and the 30MB-per-file limit in Projects remain consistent across plans.
The real constraint at any subscription level remains the context window - how many tokens Claude can hold in memory at once during a conversation. "
On a purely technical level - cool. But I still cannot get over the impression that even in this case LLMs show us how they are mainly useful to grifters. I mean, 5 Bitcoins worth 400K USD. Why? What intrinsic value does Bitcoin deliver? It's like trading for monopoly money.
I've tried Claude Code with another LLM, it's very good at doing tasks and figuring things out. So this made me wonder, even though we know how good Claude models is, maybe the true value is in the harness now?
So claude drafted an email even pointing to the right Internal Revenue Code (IRS Law), and specify why we fall under a specific category. The auditor got back to me two days later admitting their mistake and said our company now qualifies for $8k in tax credits. And a few months ago, it identified items in our AWS that saved us $250 a month (paying for itself).
So now I joke that even if I have a claude max plan, I've still come out ahead financially.
This is no joke; for better or worse, I see a day when I'm paying a lot more for this and it will be a bargain.
And it looks like those very helpful capabilities will continue to transfer to smaller models as well, as architectures and training regimes continue to refine.
I can fairly easily imagine a world where the only people needing to spend a lot of money on models are those that are using them to solve truly novel problems. The rest of us will get plenty of use at reasonable costs for the typical day-to-day helpful stuff.
For what it's worth, I also used GPT-5.2 (via duck.ai) this year for questions about taxes and it was helpful — which makes sense because there's so much material about taxes out there to be synthesized, so a text predictor trained in that domain should do well.
$10k might even be worth it - but i'm assuming that the more expensive it is the beefier it is too, which also means more electricity.. and i already run ~6 computers/servers in my house. If a power surge happens i'm going to go live in the woods lol.
But maybe my limited understanding is thinking of this wrong.
like, yesterday.
In reality now, curious about social implications generally. Does this go beyond problem solving? Maybe the intelligence per token you get via your free library card/membership is insufficient to compete with peers in dating/employment/etc. markets, thus puts you at disadvantage.
Feels like a system that is deliberatly made to be more punishing for those who can't afford the help or the education to figure things out.
But in this particular tax credit, there's no way for the gov to know automatically what percentage of payroll was spent in qualified R&D expenses, since it's day to day business operations. Which is why we are _forced_ to hire an outside firm and pay them thousands of dollars (when Claude did an even better job), just to analyze how much of our time qualified as R&D expenses.
The problem I have is that I am forced to have to find a firm to do this, and most firms won't even work with companies as small as ours. So then we're stuck and losing out on years of R&D tax credits at the moment, when I really don't need them anymore, to be honest.
IRS> Pay your taxes!
me> ok how much?
IRS> idk you have to figure it out
me> ...ok
IRS> if you get it wrong you goto jail
me> so you don’t know how much I owe?
IRS> no, we do…
me> ...ok
IRS> if you get it wrong you goto jail
I tasked Claude to analyze the files and figure out what's going on, and eventually we figured out that each file had a custom metadata header + thumbnail + actual image concatenated. I had it write a python script and was able to recover all the images with their metadata. It's nothing a human couldn't have figured out, but it was definitely WAY faster than doing it myself.
I've also used Claude in the past to figure out how to break into routers with locked down firmware. It's great at suggesting and trying different approaches.
That doesn't sound very impressive. Not being tracked with a version control system is fixed instantly with a git init, git add ., git commit .no AI required.
Covering the app with tests is also something that requires no AI. At most, coding agents can generate characterization tests in broad sweeps, but we are talking about a delta between hand rolling and vibe-coding of a couple of days.
Where LLM shines is helping developers build up an understanding of what is in place. Running /explain on a codebase can quickly provide you with a high level summary of what's in place.
https://blog.acelab.eu.com/pc-3000-flash-spider-board-adapte...
5 minutes later I had almost 3 hours of important footage recovered.
A lot of "Claude Code is best at X" claims are probably user-selection bias.
The people saying it are often exclusively Claude Code users, not people who are actively benchmarking Claude Code against Gemini CLI, OpenAI Codex, GitHub Copilot, and other agent harnesses on the same tasks.
The claim may still be true for certain scenarios, but the evidence is usually anecdotal, not comparative.
Getting any smart model to take a look at the task is the sort of lift that the speaker is usually pointing to.
TBF the real breakthrough was finding this, though no doubt they couldn't have recovered without Claude
They are really underestimating their audiance here.
Man. I wish I had a lost wallet worth a quarter of that even, technically didn't need Claude for this, just needed any password cracking software.
A large percentage of passwords aren't a random string of characters but a memorable word + memorable number. There's existing projects that basically do the same, and 3.5 trillion doesn't really make it clear if one of those wouldn't have worked as well, but I can see it having an above random chance to guess a password.
The best time to start using a password manager was 10 years ago. The second best time is now.
I wasn't particularly close with him after high school, but he was an only child, and I can only imagine his (older) parents just tossed his computer. I wouldn't be surprised if he had had a few hundred BTC on there.
I had to laugh: the most Bitcoin story ever.
Thank you MtGox.
Then I was especially tempted years later after running into the MtGox booth at CES, and seeing how convenient it had become. I remember asking a guy at the booth if Satoshi was really still anonymous or if any insiders knew about him, and he said "no" but was bit surprised I knew about Satoshi. I guess Bitcoin was still quite niche then even amongst a technical crowd.
I considered buying a few bucks worth of bitcoin then for lulz, but I thought that money was better spent on beer lol.
I've never really regretted spending that money on beer rather than bitcoin, because I knew that even if I did, it would 100% have been on MtGox and I would have lost it in the hack anyway, which would have been even more bitterly frustrating.
A few of pints of beer >> years of regret.
Whew, that brings me back!
I still think about the Bitcoin my buddy paid me for his half of a pizza ~15 years ago... worth 6 figures now haha.
Better not to dwell on such things.
With that said, i do regret not at least mining/etc. Back then i could have mined in many ways, and getting into it as a hobby probably would have meant holding larger amounts of BTC in the long run.
Everyone who had coin in Mt.Gox lost it during a hack. A portion of that was returned to the users who had a loss about a year ago.
The first pizza anybody bought that way cost 10,000 bitcoin, over $billion.
BTCUSD has been over 100k, but is not currently.
I didn't want to take the chance of force closing and losing everything. I used claude code to extract my text out of the browser internals and filesystem objects.
The increase in compute (decrease in brute-force cost) combined with price increases in many crypto tokens means brute-forcing old wallets can become worth it years after passwords were forgotten.
And of course even smaller, local AI models can now easily write optimized scripts to brute-force any given KDF function.
Moore law did its thing, now you can do it with a lot less computer power.
In the physical world, I can't imagine too many people being happy that old keys to your house still work even after you've changed the locks.
Can someone more informed, help me understand how this worked and why it's ok.
I'm genuinely wanting to become more informed & better understand.
You can imagine that in your example, you didn’t change the locks on a house, but rather you put the house keys in a secure lock box and you changed the locks on this box.
Changing the locks on a house in this case means transferring from an old wallet to a new wallet and then abandoning the old wallet. That’s exactly what the OP is trying to do. It’s just that you need the original key to do it.
By getting stoned he was forced to hold until AI could solve his problem at a crypto high.
> After finding a mnemonic that actually turned out to be their old password a few weeks ago, the user dumped their entire college computer files in Claude in a last-gasp effort. The bot uncovered an old backup wallet file that it successfully decrypted, while also uncovering a bug in the password configuration that was preventing recovery up to that point.
So it switched from brute-force searching passwords against a file, to brute-force searching files against a password?
The other day, I asked Claude to track down the leaked Claude Code source so I could study it. It refused, saying “given who made me, I’ll pass.” It gave me some pointers on how to find it myself, which worked.
There isn’t that much of a difference between “help me crack this bitcoin wallet” and “help me crack this executable.”
I don’t exactly have a solid point, just some general observations. First, I think we’ll see AI more and more simply refuse to do any kind of forensics, as forensics becomes more powerful. Second, that implies local models will become more valuable, since they’re the only ones willing to do that kind of work.
I once got myself banned from Claude by researching barbiturates, since they’re connected with suicide. So my third observation is that we’ll see an uptick in people getting punished for trying to do things with AI that people don’t usually do. (Luckily the unban form worked.)
Someone downthread asked “how’d he convince Claude the coins weren’t stolen?” Which is an interesting question, because presumably some people trying to crack a wallet have stolen it. So I guess the fourth observation is that the exact framing you approach an AI with will become more important. There was the classic “do this or I’ll cut off my arm,” which worked a year ago. But in the future it will be more like “hopefully the AI believes my story, or else I’ll get into trouble.”
It’s good there are multiple AI vendors, or else it’d get real dystopian real fast when the de facto AI’s policy becomes something you have no way of working around.
I'd love to mine a bitcoin a day on my PC now...
But lets be honest - when BTC hit 100 bucks, we would have cashed it out thinking we were geniuses.
you can!... but they wouldn't be worth the electricity now either. the cost of mining (amortization of hardware costs plus electricty) is the value of bitcoin. if bitcoins are a bargain to mine, more people will mine them thereby reducing rewards.
should you have mined more back then if you had magical perfect knowledge of the future? no: they weren't worth the electricity.
instead you should have bought more of them back then.
https://www.coinbase.com/learn/crypto-glossary/what-is-bitco...
Claude found a file on the computer that the wallet owner had not found. Claude didn't crack a password or do anything magic, it just searched for a file that the wallet owner had not thought to search for before.
So, where the wallet owner had previously only tried to access /Users/example/wallet.dat, Claude thought, "why don't I check if there is another wallet.dat file elsewhere on the system?" which it did.
The outcome is the same, it is great that Claude tried something that the wallet owner hadn't tried, but this is more an example of how dumb humans can be rather than how smart Claude is.
The trillions of passwords are a red herring and unrelated to the solve.
… this dweeb had a file containing their seed in their backup, claude just searched through the files
Without it I would have given up way earlier, but the infinite patience to keep slurping in error messages and continue to troubleshoot really worked out.
Since we are dealing with Anthropic, the entire story could be staged of course.
Claude-powered AI coding agent deletes entire company database in 9 seconds <https://www.tomshardware.com/tech-industry/artificial-intell...>
Claude Code deletes developers' production setup, including its database and snapshots <https://www.tomshardware.com/tech-industry/artificial-intell...>
I know that we're all experts in archaic backup mechanisms and the encryption systems they used, but I think this qualifies as doing more than Ctrl+F
Also, it is right there in the article.
So I guess he might be glad he didn’t figure it out earlier.
Claude has limits that would make this simple statement be much more complicated-
Via Claude "So the chat upload file size limit is actually 500MB per file (not 30MB as many third-party sources claim - those appear to be outdated). The 20-file-per-chat cap and the 30MB-per-file limit in Projects remain consistent across plans. The real constraint at any subscription level remains the context window - how many tokens Claude can hold in memory at once during a conversation. "
I guess the user simply pointed Claude Code at a local folder containing all the backups and files, and Code went through them via find/ls/etc