FR version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
82% Positive
Analyzed from 1987 words in the discussion.
Trending Topics
#claude#codex#down#models#more#code#anthropic#better#value#don

Discussion (92 Comments)Read Original on HackerNews
I switched a few days ago and work has been much less frustrating. Feels like CC did back in February before they started playing games.
It also doesn't eat nearly as many tokens, so it's saving me $100/mo.
For coding its fine, I havent experimented too much with Amazon Bedrock myself, but I just might soon to check for any limitations.
I say this as someone working for a tech company who does not have to foot the bill (in the >$1k per month bracket)
I also experienced and accept the 1990s levels of unreliability, which is my “internet generation”. My first access was lifting a handset and placing on a speaker/mic cradle.
Programmers these days are fucking spoiled. If it’s $220 worth of value for $200 - I get it. But I’m getting $100k of value for $10k and so I’ll put up with some shit.
Wrong comparison. If a competitor gives you $230 of value for $200, of course you shouldn't pick the $220 one
I am an API user, and while it being down is super annoying, it isn't really as big of a hit to my overall usage as I can just prepare a bunch of stuff to run in parallel when it does come back up.
Is this just the API and I'm too much of luddite to actually use the API?
Say five eights of reliability. Maybe six.
It's starting to feel like a lot of comments on here and other social media outlets that are anecdotal about their experience with x model and y tool are astroturfing. They add almost zero value to the conversation.
These is a multi-billion dollar market and battleground, so im skeptical of anyone telling me that this isn't happening at a decent clip. I think moderators on the site should definitely consider how to approach this because it's devaluating this space as a place for actual discourse.
My mind also considers that this being one of Altman's old stomping grounds, he may place a higher value in winning here than elsewhere.
[0]I say December, because that's around the time the models got good enough that non-AI folks started to notice.
[1]https://en.wikipedia.org/wiki/Eternal_September
I don't really blame Anthropic here.
We can now shop around easily. They almost all do the same thing now. The models are "Just Enough".
[unknown] missing EndStreamResponse
There's a live Claude status board in the corner so you know when it's time to get back to work.
https://github.com/anthropics/claude-code/issues/54497
*But* I don't work with the defaults -- I work with my own prompt framework based off of superpowers.
Given sufficient prompt scaffolding, I've found the models relatively interchangeable -- _I might_ be getting some of this for free by basing my own system off of superpowers which is used across various harnesses -- In other words achieving this kind of portability may be a lot harder than it looks and I'm benefiting from other people's work.
It's fine for Claude to be unavailable when there is no work at these hours. However, the problem is Claude gave no notice.
At this rate, Claude being unavailable every day is no better than a human on a 9 - 5 working day job.
I worked with 4.6 and found some improvements for better planning and sustained us, but agree some posters 4.7 is slower, overthinking.
What I expect is frontier models to get bigger and more expensive (especially fast mode like on Cerberus). And most of his get much smaller distillations for the more generous subscription tiers.
https://status.claude.com/
99.02 % uptime
Ouch.
Normally I'd just have it write out what it's doing to a file, if I need to transfer context, but if it goes down mid-session that's a no-go.
I think people have built tools for this, and of course you could reasonably vibe one yourself, but I don't really trust something like that to work reliably or in an ongoing manner.
Maybe it should just be a skill.
Anthropic have blocked usage of your subscription however with third party harnesses.
This is the main reason I use different harnesses, but I also expect (could be wrong) codex is better with codex harness (due to training on it's specific tools) than with other harnesses. I use opencode for everything that's not claude/codex.
Still, it's pretty crazy that Claude is down to 1 nine.
However, when there is an incident it is immediately "human error", not Claude.
> Can’t they prompt Mythos to give them better uptime?
Anthropic is currently "vibe coding" the situation right now.
It's impossible to tell these days whether 4.7 is stuck because it's thinking and Anthropic suppressed all output (seriously, 4.7 will just start making changes without explaining any reasoning - how is that an upgrade?) or because the underlying infrastructure is having issues.
4.5 -> 4.7 feels like going from working with a coach-able, junior engineer that does well with clear guidance to working with a cocky mid-level that will spend too long on pointless tangents and make confidently incorrect changes without any discussion.
Many such cases with humans (given that we continue to compare LLMs to humans these days which you cannot)