I've been driving glm-5.2 for a day or two now. It feels like a mature, seasoned colleague.
It could be luck, but I don't know -- it keeps one-shotting relatively hard stuff. And taking initiative to think about what potential regressions it should look out for, and choosing to do strategic refactoring when it should do. It is not confidently incorrect hardly at all, doesn't tell me that it's fresh risky pile of changes is ready for production without having exercised all the code paths and writing a bunch of tests, etc.
Anthropic and OpenAI window for a successful IPO is reducing day by day. All that pressure from their debts, compute costs, infrastructure investments, training costs, and open weight models continuing to improve. I know the stock market is all about hype and isn’t rational, but there will be a point where the hype will fade away, and they have no moat that will differentiate them from the rest.
Good for consumers, it’s competition at its best, we get cheaper, better services. But I would be pretty concerned integrating an AI lab products into my business without having a good abstraction that makes it easy to swap between vendor.
It could be luck, but I don't know -- it keeps one-shotting relatively hard stuff. And taking initiative to think about what potential regressions it should look out for, and choosing to do strategic refactoring when it should do. It is not confidently incorrect hardly at all, doesn't tell me that it's fresh risky pile of changes is ready for production without having exercised all the code paths and writing a bunch of tests, etc.
We might be reaching the next level here...
Good for consumers, it’s competition at its best, we get cheaper, better services. But I would be pretty concerned integrating an AI lab products into my business without having a good abstraction that makes it easy to swap between vendor.