FR version is available. Content is displayed in original English for accuracy.
Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
35% Positive
Analyzed from 1446 words in the discussion.
Trending Topics
#github#down#reliability#service#self#issues#https#com#downtime#status

Discussion (55 Comments)Read Original on HackerNews
Getting off the GitHub actions dependency is a feature, not a bug
At the very least explore and prepare for alternatives. Map out dependencies that are not trivial to replace. There's probably fewer of those than you think.
API Requests with 4 nines of availability??
Issues with 99.96 uptime?
PR with 99.61% uptime last 90 days??
https://mrshu.github.io/github-statuses/ marks PRs at 95.89% in the same period as an example.
> users typically don’t notice the difference between high reliability and extreme reliability in a service, because the user experience is dominated by less reliable components like the cellular network or the device they are working with. Put simply, a user on a 99% reliable smartphone cannot tell the difference between 99.99% and 99.999% service reliability!
I've been on a shaky relationship with my ISP of late. What brought me to this thread today is that I couldn't push to Github. Notably this isn't covered by their downtime report so, going by the available facts, it's _probably_ not Github's fault I couldn't push; and I've just been on my daily stand-up call and I got disconnected so frequently.
But looking beyond today's available facts, odds are there's a bigger problem GH is not mentioning in their status page. They say the current incident has to do with "unauthorized users" and I wonder if pushing a commit from my IDE client counts as an operation from an "unauthorized user" as I still have to authorize with my SSH key.
It's just insane I can't decide which between Github or German o2 should be the more reliable service!
I think there's 3 big themes with this, thought not
1. LLM tools have added considerable load.
2. LLM used by developers to increase velocity seem to be leading more outages. This calls into question the increased velocity.
3. Roadmaps focused on pushing features that aren't reliability problems. i.e. github moving to azure, or adding AI features.
All these same problems happen to orgs with other fads that aren't AI. Following fads is not good engineering.
Absolutely not. Google has reliability practices so deeply ingrained in their company they’re like an involuntary reflex.
This is a management issue.
If you take on load (this is 100% by choice) beyond capacity, then obviously the system collapses.
At a guess, I could imagine some sort of failure of cached pages, which can be cached for signed out users but probably not for signed in users (as the rendered HTML would need to have user context like their avatar etc)
Sure they can. If Google loads and Github doesn't, then it's clearly Github being down, not the mobile network.
Also not everyone uses a phone. My desktop & fibre internet has way better than 99% reliability.
Honestly it's pretty mad to see, especially without a crisp failover.
I k ow for a fact that ANY other platform would fail faster than github if they had the same volume of http requests.
why i am keep seeing github down news in HN?
https://www.githubstatus.com/history
https://isgithubcooked.com/
Could not have happened on a worse day (Monday) and you can see how unreliable GitHub has been.
Better of self-hosting.
[0] https://news.ycombinator.com/item?id=48418183