Back to News
Advertisement
Advertisement

⚡ Community Insights

Discussion Sentiment

48% Positive

Analyzed from 5100 words in the discussion.

Trending Topics

#data#census#privacy#should#don#government#more#why#https#differential

Discussion (169 Comments)Read Original on HackerNews

asoloveabout 3 hours ago
The replies here arguing we should publish it all are wild in the worst kind of first-order thinking way.

It’s a census: it just asks questions.

If you start publishing and weaponizing the data against people with various attributes, they’ll just lie or not answer. And then you are left with worse than nothing: bad data people try to act on.

throwawayffffas3 minutes ago
The easy solution is to just reduce the resolution and scope of the data to the degree it is absolutely necessary. The census exists to inform representation decisions. All other concerns are addons. You can have all the data on the county or voting district level and strip data as you increase your resolution, to the point you only keep population number at the neighborhood, block level.

Knowing the racial, ethnic and socioeconomic background of the residents of a single building block is only useful to discriminate against them.

bagelsabout 2 hours ago
The US Government is the entity that weaponizes the data. The most obvious example is the Census Bureau compiling lists of people of Japanese descent to imprison during WWII. That's just the most obvious one that I know of without looking up more.

The real push for this now is to form lists of people to disenfranchise.

newZWhoDisabout 2 hours ago
Does anyone actually believe this crap?

You think the census is what the government would use to mass identify and imprison people, not the NSA database(s)?

You think homeland security, or the FBI, or any other alphabet agency doesn't already have access to a giant list of people?

Think about what meta knows about everyone, or Google. You do realize that the US gov has read access to their core databases right?

"The census" has absolutely no bearing on any of that which you're worried about.

It's just shocking the level of ignorance that gets upvoted in the comments here now.

falsemyrmidonabout 1 hour ago
You think they wouldn't use every tool available to then, including the census data?
kgwxd16 minutes ago
They haven't done a single thing without malicious intent. Go back and find whatever else you've defended in the past, and look at the results instead of the stated reason/goal for doing them. They won't match. They'll be opposites. You'll rationalize or shift blame, of course. But maybe this time, something decent will get through.
awesomeMilouabout 2 hours ago
Yeah okay fair, I was about to post a knee jerk reaction, but it's well known that the US government can obtain higher quality data by just simply buying it from the public market.
esseph9 minutes ago
[delayed]
willmaddenabout 1 hour ago
I'm not sure why your comment is grayed out.

Cell tower data, credit bureau integration, social media scraping, palantir, smart home device surveillance, DNA database exploitation, facial recognition networks, tax, payroll, passport, visa, medicare/medicaid, immigrations and customs databases and many more...

The census is a historical relic used to jerrymander congressional seats, and that's about it.

FrustratedMonkyabout 2 hours ago
I have to agree. I'd like Census data to be private, but the cat is out of the bag.

I'm all for keeping all of this data private. But to think it isn't already available is a bit 'head in sand'. Maybe put laws in place for 'general' privacy across all data, before getting too inflamed about Census in particular.

smrtinsertabout 2 hours ago
Even at a quick glance this doesn't make any sense. The census is literally how they get the data. Where else would it come from? Drones? Every computer being hacked Michael Bay style?
conceptionabout 2 hours ago
Remember “leftist “ and transgender activists are terrorists now.

First they came for…

whattheheckheckabout 2 hours ago
What's the actual antidote to this? 5calls.org?
bilbo0sabout 2 hours ago
I'm pretty sure they didn't think this through in a comprehensive fashion.

Because making it esy to find all the rich people just seems like a very bad idea given the direction things are going.

When it was broad, the only thing you could do was locate, say, large minority groups. Blacks and latinos for instance. And even that led to problems. I can't imagine what will happen when we can drill down and tease out immigrants from citizens. Gay from straight. Rich from well to do. And so on.

ShinyLeftPadabout 2 hours ago
You first gather the data while people don't know or care. Then you weaponize it later. It happened at least once not long ago in another country, seems not overreaction to be concerned about it
comex33 minutes ago
It happened a year ago in this country, with IRS sharing data with ICE (breaking a longstanding policy of keeping taxpayer data private within the government).
kajmanabout 1 hour ago
If this is a Nazi reference, Census data was used to send people to concentration camps here during the same era. Less awful than death camps, at least.
HumblyTossedabout 1 hour ago
This administration does ... not ... care ... about ... facts.
tbrownawabout 2 hours ago
Any use to identify where government resources are best used, will have people thinking they should have gotten more and would have if they'd answered differently. Ie, that their answers were "weaponized" against them.
glensteinabout 2 hours ago
I guess the way to optimize is to find an equilibrium between an extreme of specificity and an extreme of vagueness that's still actionable from a high-level policy perspective.

Something about this conversation is fundamentally broken if there's no space to iterate towards optimization and instead it's just swinging between maximalist extremes.

webnrrd2k34 minutes ago
There's a pretty good chance the Elon Musk, plus Russia and China have had more-orless unrestricted access to American's data since the DOGE dismantling of US government. Plus, by intentionally removing security and accountability mechanisms it makes it impossible to accurately determine how bad the damage actually was.
cyanydeezabout 2 hours ago
have you not been paying attention for 10 years? At the top of the rotting snakehead they know all this, they arn't arguing in good faith.
mc32about 2 hours ago
You can’t completely trust what people say anyway. There are stated preferences and observed preferences in economics but it applies to other areas of life.
tokaiabout 2 hours ago
>It’s a census: it just asks questions.

Thats what dutch and french bureaucrats thought until 1940.

derektankabout 3 hours ago
The entity most capable of weaponizing demographic data is the government itself. If people weren’t previously providing false information to the census, I’m skeptical that this change is what will push people over the edge.
notfromhereabout 2 hours ago
Congress passed laws that blocked the federal government from fusing data across departments for this specific reason. the admin decided to ignore those, and a friendly congress is deciding to not act on that.

You really, really don't want a government who can build a unified profile on you in that way.

r14cabout 2 hours ago
Isn't the issue here the lack of accountability? Maybe an unpopular opinion, but I don't think its a foregone conclusion that governments are fundamentally corrupt. Ours certainly is and we have a very weak constitution which makes it worse, but that's the US. I think better constitutions are possible, but we have to stop treating it like a sacred document and be practical.
kajmanabout 1 hour ago
I "enumerated" for the last census. Trust in my community was already not high* and I had lots of interesting encounters. I really believed the rather invasive data I was collecting with a friendly face would be used and handled responsibly. I feel for the poor souls that'll sign up to go door to door for 2030 now that the firewalls against weaponizing and monetizing all of our sensitive government data has been torn down, and even more for those that will volunteer information that can hurt them.

The comments that this rather expensive endeavour should just be about getting a head count are also amusing to me. The data collected was such an important baseline of common understanding, and this will not be a good thing for its future quality. I've grown very jaded now seeing all the things taken for granted in this country and lost or degraded recently with a whimper.

*: To be fair, they sent me specifically to places that didn't respond, so I was naturally led to believe that everyone in my region hated the government, ignored bizzarrely threatening fliers, or had recently moved and had no knowledge of the inhabitants (if any) during the census period.

Kim_Bruningabout 2 hours ago
Coming from a certain european country, you never know what answer on the census might get you into trouble.

"What is your religious affiliation". Seems perfectly innocuous, but turned out to be retroactively fatal if your answer could be attributed to you by a certain foreign occupier in the 1940s .

Bratmonabout 2 hours ago
Surely any such foreign occupier would just demand the unredacted data?
iso16315 minutes ago
That's where you hope people like Rene Carmille are around. S
gambiting28 minutes ago
Yes, which is why the government shouldn't have this data at all in the first place.
monitorlizard17 minutes ago
Say you get your way, and, for fear of Mark Carney rolling the tanks in and taking over North America, the US stops collecting any data on its citizens. How is the IRS supposed to know how much tax to expect from you? How is SNAP supposed to determine your eligibility? How is unemployment supposed to know if you're ripping them off or not? Data privacy is a real concern, but you need PII to run government services effectively. Running a state without collecting PII is like running a hospital without collecting any.
well_ackshuallyabout 1 hour ago
France used to make plenty of lists. We loved lists. Lists are good. Jews lists? Sure, it's maybe useful one day when we want to do something.

Boy were the Germans happy to find these.

The American obsession with asking for people their perceived origins (AAPI, AA, Latino, ...) is more than weird: it's downright dangerous. Don't fucking ask these questions, and never, ever write it down, especially not with names.

Thankfully, now they can just buy it from data brokers and let Palantir target, so that makes life easier for them

Rygianabout 2 hours ago
"What is your religious affiliation" makes absolutely no sense in a census exercise. IMO.
twoodfinabout 2 hours ago
The U.S. Census Bureau collects tons of data unrelated to the decennial counting for Congressional apportionment.

https://www.census.gov/programs-surveys.html

The American Community Survey is the most well-known, as it replaced the “long form” sampling that had been an extension to the Census.

yoyohello13about 2 hours ago
Unless you’re a government explicitly and openly aligned with Christian nationalists.
talon8635about 1 hour ago
The point might be going over my head… why does it make no sense?
iririririr7 minutes ago
why it makes sense? please try to answer. what action of the gov would change based on that data?

then, make it so your answer is more valid than if they asked what you usually have for breakfast.

i guarantee you more gov actions can be positively impacted by the breakfast question than the religion one.

the ONLY use for religious data is to get it for free for campaigns.

mschuster91about 1 hour ago
It actually does. Religious affinity can absolutely be useful for longer trend studies, and census data is usually of much, much higher quality than other random sample studies.
WillPostForFoodabout 1 hour ago
Asking about your religion on the census is against the law in the US:

no person shall be compelled to disclose information relative to his religious beliefs or to membership in a religious body.

https://www.congress.gov/94/statute/STATUTE-90/STATUTE-90-Pg...

petilonabout 1 hour ago
Religion is just an example. Don't dwell on it.
swsieberabout 1 hour ago
> compelled

Doesn't that mean they can ask that question with an option for "rather not disclose"?

MinimalActionabout 2 hours ago
Whatever you do, there is a level of trust that is assumed when census takes place. The trust that this data is then not identified in a way that could be targeted for scams, frauds, and other such evils. But in NY, house sale records are made public but much to the detriment, many mortgage companies fake a bill for payment.

Differential privacy is absolutely necessary, and the social scientists being unable to reconstruct the data at an individual level is intended. A macroscopic description is rather enough for most purposes, and anything more is asking for a surveillance state.

mschuster91about 1 hour ago
> But in NY, house sale records are made public but much to the detriment, many mortgage companies fake a bill for payment.

That frankly sounds more like a failure of enforcement, on top of a failure of the construction of the financial system. Here in Germany, it is absolutely not a common thing that mortgages or the banks holding them get sold like a hot potato towards some other sucker, and thus such a letter would cause immediate suspicion.

arjieabout 2 hours ago
Pretty sad, in my opinion. In my ideal the state should have visibility into the shape of the people present so that we can make good decisions about our combined organization. I think we’re making a mistake we will come to regret by intentionally damaging our data collection infrastructure.

I think a large amount of the US’s success is the result of good institutions handling granular data. Policies can be adjusted to match outcomes more rapidly than otherwise.

I understand why people decide to diminish all state capacity - they feel that governments are populated by their opponents who will use state capacity against them. But as our relative strength wanes, our ability to overcome these forces of inertia does as well. And then our governments become less capable and eventually life starts getting worse.

We don’t need house-level data immediately (except perhaps in order to place census blocks within their appropriate congressional district etc). But there are aggregation units above which we should be using as good information as we possibly could be.

tempodoxabout 1 hour ago
> I think we’re making a mistake we will come to regret by intentionally damaging our data collection infrastructure.

Intentionally damaging infrastructure is the recurring theme of this administration.

bee_riderabout 1 hour ago
I’d be more interested in giving my state detailed info, letting them run programs. The country can have aggregate data.
Bratmonabout 2 hours ago
But this article is about a decision to damage the census less. If you value an accurate census, you should be celebrating!
swiftcoderabout 1 hour ago
TFA lays out why things don't work that way. If you erode trust in the privacy of census responses, an awful lot of folks will have to start lying on their census
jmoleabout 3 hours ago
Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.

I don't know what the political undertones are here, but at some level you need to have actual ground truth, including "this person/household declined".

Publishing raw data though? That seems like shooting yourself in the foot from a national security perspective, not to mention all the other reasons not to do it.

glitchcabout 2 hours ago
> Ban it from the dataset, add it to the analysis. You can choose your own flavor of noise.

It is introduced in the public data, not the secret data.

tbrownawabout 3 hours ago
> Differential privacy makes this trade-off explicit, and thus impossible to ignore. Maybe banning it is a way of pretending that the problem doesn't exist, in the hope that it will go away?

Or it's saying that one of these conflicting goals is more valuable than the other, and so shouldn't be sacrificed for it.

sherburt318 minutes ago
So "differential privacy" pretty much sounds like someone gets to modify the results of a census and how it gets modified is entirely up to their discretion.

Seems like something that could be abused to achieve political objectives.

iugtmkbdfil834about 2 hours ago
Can anyone explain to me the previous state and why it was desirable? I admittedly do not understand why people are getting riled up. I am not being difficult. I really don't understand the original state and the changed state here.
layer8about 1 hour ago
foolfoolzabout 2 hours ago
i have such a hard time reconciling stuff like this:

> The census bureau decided to adopt differential privacy for the 2020 Census

and:

> The consequences will be dire for utility or for privacy, and possibly both. It's hard to understate this point: future statistical releases will either be useless compared to past ones, or they will be incredibly unsafe

so we took the census for centuries before this point, and it was “ok.” and for the last census only we added some privacy items. but if we remove just one of those filters, we are in “dire” circumstances? but there were no privacy features before. so we’re actually still much better off than we were for hundreds of years before this.

this makes it feel like an emotional overblown problem

vlovich123about 2 hours ago
Believe it or not, mathematical techniques and computational power have increased in the past hundreds of years, not to mention the digitization of everything.

Privacy issues that weren’t possible before due to cost are now pennies to exploit. Also keep in mind as it points out people were using census data to drive gerrymandering efforts, so these attacks are real and have been going on for a long time.

antasvaraabout 1 hour ago
> but there were no privacy features before. so we’re actually still much better off than we were for hundreds of years before this.

One notable thing we have today that we didn't have 100 years ago is a computer. Before, you could reasonably assume that recreating individual records wasn't feasible, at least not on a large scale. You can't assume that now. A 4 digit password was safe for hundreds of years, but it would be a security lability today for the same reason.

hristov36 minutes ago
As the article clearly states, privacy features have been in the census since 1990. It is just that the previously used privacy feature was not very strong and could be defeated. So it was replaced by a stronger feature in 1920. Before 1990 the census. 1990 was when personal computers were being popularized and the computing power available to individuals exploded and so then it was possible to use computers to separate out individual information from the data the census publishes. So the issue came up then.

No it is not an overblown problem.

baqabout 1 hour ago
For decades we were encrypting our communications with rsa, surely nothing is wrong with it?
gkbrk38 minutes ago
There is nothing wrong with it, and RSA is still commonly used. In fact, RSA is better against quantum computers compared to ECC.
LPisGoodabout 2 hours ago
The concerns here, like most concerns about privacy, are hyperbolic hypothetical hypochondria, until they’re not.
declan_roberts13 minutes ago
Census data is extremely powerful. It's why some states lost house seats and why some gained house seats.

It must therefore be maximally transparent. Do you want president Trump or palantir to decide on the "noise infusion" algorithm?

Advertisement
thih9about 2 hours ago
I guess this could be implemented externally.

Eg via some app that instructs respondents to enter a specific answer in a pseudorandomly chosen question.

Of course security would be another question.

ProllyInfamousabout 2 hours ago
The fines for non-compliance are low enough to remain silent.

Do. The American Census Survey (randomly-selected long-form questionairre) is dangerously overinvasive.

watersbabout 3 hours ago
The better to sell the data, all your privates are belong to us.
lokarabout 2 hours ago
Can anyone share how other countries handle this?
simonwabout 2 hours ago
A lot of countries are really bad at running their census. https://asteriskmag.com/issues/11/why-governments-cant-count
ghaffabout 1 hour ago
And a lot of countries have things like national IDs that, rightly or wrongly, given things like RealID and passports, that a lot of Americans just don't like on principle.
peloratabout 1 hour ago
Sure, in Europe we don't because we already have databases of all citizens, also recording attributes like race, skin color, religious affiliation or political leaning in a database is highly illegal, both for the government and for private use.
throw-the-towel17 minutes ago
Wait, are you saying Europe doesn't have censuses?
ThePhysicistabout 2 hours ago
I think it should be noted that there was a lot of dissatisfaction from users of the census data as far as I know. So it's not been banned just for politicals sake or because they hate privacy... Some people I talked to in the privacy field even called the whole thing a total disaster and weren't shy to put blame on John Abowd who apparently pushed this through despite a lot of internal opposition and concerns. Not sure if that's true, but what is definitely true is that the way the data was released produced serious issues downstream as most researchers and statisticians that ingested the data weren't prepared for receiving noisy data values. Differential privacy was applied in a way such that many invariants that data users cared about weren't preserved, which was expected as it's not possible as you can't preserve all invariants and at the same time add meaningful noise to the data. The thing is, with such a differentially private data release you need to adapt all of the downstream analyses to take into account the exact mechanism the data was altered in. And since the census bureau used a very intricate mechanism that didn't just add Laplace noise to data values but instead relied on a multi-stage process that preserved some invariants but not others it was very difficult to even write routines to account for the changes being made to the data. They essentially asked of every data user to rewrite their whole analysis pipeline based on the exact disclosure mechanism that contained a large number of bespoke choices regarding which data invariants to preserve and basically produced a mix of noisy, synthesized data that was just really hard to reason about. I don't even know if there even would've been a way to do this better, but the fact is that not every small county or school district has top-tier statisticians at hand that can just read a whole monograph on differentially private synthesized census data and then hotpatch their existing analysis systems to work with that data.

I was a big fan of differential privacy but now I think it might be doing more harm than good, as I haven't seen a single case where it was applied successfully in a problem where it actually mattered, and it contributed strongly to discrediting and preventing a lot of work on other anonymization techniques as it was deemed the only way to preserve privacy by the research community, so showing up with enhancements to k-anonymity or any other noise mechanism not rooted in it was a sure way to get ridiculed and ignored. And it's just not a practical mechanism, even when it works for a single disclosure you always end up having to blow up the privacy budget to a ridiculous amount in order to keep disclosing statistics as otherwise you would for almost all real-world data run out of budget after a few publications.

So, for me it's a technique that works in the areas where it doesn't really matter (publishing highly aggregated statistics that pose almost zero privacy risk even without differential privacy) and doesn't work in other areas where it would actually matter (publishing fine-grained data about individuals or small groups). There are some niche use cases but in my view the privacy community has really overblown the importance of differential privacy by portraying it as the only way to reliably anonymize data.

BTW the German census bureau has an interesting approach to anonymization which they use for several decades already and so far I haven't heard of any cases of successful de-anonymization of the data, maybe the US bureau should have a look at that for their own needs.

hristov30 minutes ago
Of course there will be dissatisfaction from users of the data. Anyone that wants to use census data will prefer less privacy in the data. And anytime privacy is enforced the data becomes less useful. It would be certainly very convenient for both advertisers and gerrymandering political consultants to have detailed data on every citizen.

As the article says anytime you want to enforce privacy, the data becomes somewhat less useful, there is just no way around that.

The point of rights is that we have them and that they should not be trampled upon when they become slightly inconvenient to someone in power.

swiftcoderabout 1 hour ago
> serious issues downstream as most researchers and statisticians that ingested the data weren't prepared for receiving noisy data values

They weren't prepared for data that was obviously noisy. The data has always been inherently inaccurate, and folks just chose to ignore that previously

ThePhysicistabout 1 hour ago
No, there are dozens of articles discussing the mechanism and explaining the impact it had in different areas e.g. [1,2,3]. And the release mechanism wasn't just "add noise", far from it, you may read the original paper [4] to see how intricate it was, anyone wanting to make real use the resulting data would have needed to understand that approach in detail to work with the resulting data. The report of the national academies [3] is probably the most comprehensive analysis of the mechanism and the complications it introduced, so writing "it has always been inherently inaccurate" is just wrong, this new mechanism was way worse than just introducing unbiased sampling noise.

1: https://www.aeaweb.org/articles?id=10.1257%2Fpandp.20191107&... 2: https://www.science.org/doi/10.1126/sciadv.abk3283?utm_sourc... 3: https://www.nationalacademies.org/read/27150/chapter/14

4: https://hdsr.mitpress.mit.edu/pub/7evz361i/release/2

mikelitorisabout 1 hour ago
But why?? Differential privacy works? It's not even "woke" or whatever these people perceive. It's just math man...
delichonabout 3 hours ago
The dueling political demands of accuracy and privacy are simply incompatible at some level. After reading this, maybe Hanlon's Razor isn't the right standard. Besides malice and stupidity, there is impossibility. Some problems just aren't solvable under certain constraints. I don't envy the statisticians tasked with finding a politically palatable solution to a math problem.
Sol-about 3 hours ago
But the strength of differential privacy is that you can now make this tradeoff explicit and quantify it. I always liked it because it offers a mathematical solution to a policy problem, but then of course it's up to us to decide what parameters and tradeoff to choose. Also, some data might just not get published at all if the privacy implications are too problematic, so differential privacy might buy you more signal!
thatfrenchguyabout 2 hours ago
Yeah, the main issue with differential privacy is that you need competent government officials making decisions who understand math beyond a high school level.
tbrownawabout 2 hours ago
It offers a mathematical description of a policy tradeoff, and the policy makers are apparently setting one of the parameters to zero.
ghaffabout 3 hours ago
There's a ton of information in the US that is accessible to various degrees--especially through the the deep web much less background investigations. Unless you're a wealthy person who can set up various levels of trusts you can't really hide them.

You can of course disagree about what what should actually be part of a transparent public record. (Though I suspect a lot of people post-date what was generally available in a "phone book.")

SpicyLemonZestabout 1 hour ago
I really have to take the anti-noise side here. I get why it's a hard problem, and I get why the Census Bureau thought this was a neat solution. But I'm imagining an accountant stepping through a similar chain of logic:

* I want to accurately report the finances of our company to the best of my ability.

* But that report would allow people to reconstruct private data about the terms of our contracts with various counterparties. I'd really like to avoid that, there's no rule that says we're supposed to release that data. In fact some of those contracts probably came with nondisclosure agreements!

* So here's what I'm going to do. I'm going to calculate our results to the best of my ability, and then I'm going to add random values to them and report only the randomized ones. Any reconstruction people try to do will be wrong because of the randomness.

* If the SEC says "no, you need to report your actual numbers", I will explain to them that there's no such thing as an actual number because all data is noisy.

I can't get behind it.

wnc3141about 3 hours ago
Stalin's demographic researchers kept disappearing until they came up with the numbers he wanted.
Advertisement
ck2about 2 hours ago
if you want to keep your sanity, I suggest silently adding the phrase

     "...for the next 950 days" 
every time you read some politically spiteful news like this

because the next two years are going to become insanely miserable

layer8about 1 hour ago
It’s highly uncertain what will happen in 950 days.
zkzk_gamalabout 2 hours ago
i think they will use ai as a leverage card to other country to order them
yegortkabout 1 hour ago
Data shall set you free... or not
xenophonfabout 3 hours ago
This is a gift to reactionary gerrymandering and voting restriction efforts, along with things like yesterday's FBI raid of an Ohio voting rights organization.

https://www.statenews.org/government-politics/2026-06-12/ohi...

Representative Joyce Beatty is from Ohio and was instrumental in stopping Trump from illegally renaming the Kennedy Center.

https://www.theatlantic.com/culture/2026/06/kennedy-center-b...

thepryzabout 2 hours ago
Representative Beatty serves her own interests and her involvement Kennedy Center naming was just more of the same performative politics she routinely engages in. She's on the verge of being an octogenarian and missed a number of key votes, like the bill that cut funding to NPR, PBS, and other govt. programs. Kudos to her for working to remove Trump's name from the Kennedy Center but she needs to go.
smrtinsertabout 2 hours ago
The removal of his name is not performative since we're in the thick of a cult of personality president (at a bare minimum).
Pragmataabout 3 hours ago
Frankly i see no reason to keep this data private. They should simply publish a full dataset of the census, with no such data coarsening/differential privacy/ etc...

Fundamentally this is public data. If it's to dangerous to make public, it's too dangerous to collect, and people should be aware of exactly what it is.

There are very few things that the state has data on that should not be made public. Census data is simply not one of those things.

publishing should be the default for any data, and to keep it unpublished should require substantially good reasons that impact the country as a whole. Frankly, if it isn't detailed national defence plans, i struggle to see any data that should not be public.

simonwabout 3 hours ago
How hard have you thought about this?

The biggest challenge with running a census is getting people to trust you enough to answer your questions.

A lot of census questions are sensitive. The ACS covers topics like citizenship status, disabilities, income, SNAP assistance, languages spoken at home.

If you want accurate information about the people who live in your country you need the census process to feel as safe for people to respond to as possible.

Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.

wpollockabout 2 hours ago
> Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.

I'll say that. The state representatives should provide congress and the president any data needed to inform policy decisions about the people they represent. And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.

Except for gerrymandering purposes, I fail to see why income, party affiliations, etc., is useful for the purpose the census was created for.

simonw44 minutes ago
The census doesn't collect party affiliations.

https://www.census.gov/topics/public-sector/voting/about/faq...

> the CPS Voting and Registration Supplement does not ask any questions of a partisan nature.

nojitoabout 2 hours ago
>And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.

There are laws in place forbidding government agencies from merging together datasets.

The last thing people should support is creating of profiles of individuals by combining data from different government agencies. This is why the census is so important as a data collection mechanism.

jonhohleabout 3 hours ago
This seems’s like an issue created by congress. the constitution only requires a headcount by state. Maybe they should use another mechanism to collect demographic data. Since the concern is not about representation, but allocation, tax returns seem like an obvious alternative and they are already private and collected at a much more granular level.
simonwabout 2 hours ago
I don't think the question "Has this person given birth to any children in the past 12 months?" would look good on a tax return.
SoftTalkerabout 3 hours ago
The census isn't for helping the country make any decisions other than determining the number of representatives and apportionment of taxes. It should not be collecting any data that isn't necessary for that.
simonwabout 2 hours ago
https://constitution.congress.gov/browse/article-1/section-2...

> The actual Enumeration shall be made within three Years after the first Meeting of the Congress of the United States, and within every subsequent Term of ten Years, in such Manner as they shall by Law direct.

The key thing you're missing is "in such Manner as they shall by Law direct".

Congress has passed a whole bunch of laws that attach additional responsibilities to the census for the purpose of supporting government decisions.

The Permanent Census Office Act of 1902 for example, which established the census office and tacked on "an annual survey of cotton production, and other economic censuses" https://www.census.gov/about/history/historical-censuses-and...

pstuartabout 2 hours ago
That's not true, they also wanted to get an understanding of who they were governing.
Telemakhosabout 2 hours ago
I'd like to know when they stopped publishing census data. I have used it for genealogical purposes to track ancestors: you can see exactly who was living in which house, how they are related, and what their ages are (I found that women in my family often reported, both on the census and marriage documents, being younger than they actually were). I don't think I've seen data from after 1950, though.

I don't understand why the census would include SNAP data or income: surely the government already has that information. I have never doubted that the IRS knows my income better than I do. Maybe better use of existing datasets could restrict the census to less invasive questions.

Polizeiposauneabout 2 hours ago
They haven't stopped but they don't happen immediately.

Detailed census records are published 72 years after they were collected; the last release (of 1950 census data) came out in 2022; the next one should be published in 2032.

See: https://www.archives.gov/research/census

philipkglassabout 2 hours ago
They didn't stop publishing census data. Its publication is delayed for approximately one human lifetime, to avoid affecting the living:

https://prologue.blogs.archives.gov/2022/01/20/census-record...

personofinteresabout 2 hours ago
The Census Bureau is a lot more than the 10-year Census, and it already makes very extensive use of IRS data and other administrative sources. Virtually everything that is published using these sources uses either differential privacy or other privacy protection methods that are prohibited by the order. I'm guessing that a lot of those pieces of data are just going to be put on hold until the order is reversed or weakened. A number of things might have to go away permanently, as there's almost certainly no way to protect privacy in them without some kind of noise infusion.

TBH I don't think the people who wrote this knew how much collateral impact it would have.

mobeetsabout 3 hours ago
Thank you for writing a much more thoughtful reply to this comment than I was drafting
bpt3about 3 hours ago
Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.

I don't trust the Census Bureau with my data, so if this is as "dangerous" as the author and some people here seem to think, they shouldn't be collecting it in the first place.

tbrownawabout 2 hours ago
> Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.

This works by the same principle as how nobody ever drives faster than the speed limit.

CAP_NET_ADMINabout 3 hours ago
1. People give the information to the government under the expectation that this data is to be kept private or used in such a way that individual targeting is made impossible, you break that expectation and people will lie or won't give you this data.

2. Without noise injection it's rather simple to do statistical attacks to reverse engineer individual entities.

3. This data is and has already been used in the past to undermine democratic systems by targeting and disenfranchising minorities, as well as gerrymandering the US to hell.

4. "Too dangerous to make public, too dangerous to collect" - this is a false dichotomy. To govern effectively you need sensitive data, but it should be collected and used in a way that's safe for the individuals.

5. Macro level aggregates don't need individual exposure, that's why noise, anonymization and statistical functions are fine.

lokarabout 2 hours ago
Re point 1, not just an expectation, and explicit legal requirement.
toast0about 3 hours ago
> They should simply publish a full dataset of the census, with no such data coarsening/differential privacy/ etc...

They do. After a substantial delay. Pretty handy for geneological research, while protecting privacy for the living.

halJordanabout 3 hours ago
That's a good default position, and I think should be our starting point.

But the devil is in the details. If we don't want advertisers constructing semi-complete profiles from simple web interactions then why would we publish 330 million census questionnaires for their use?

fwipsy43 minutes ago
So do you believe that individual income should be public? Or do you believe that the government should not take income into account for taxation or distribution of benefits?
righthandabout 3 hours ago
Then dox yourself right now with your previous census answers and PII. There are several obvious reasons to keep the data private, all you have to do is use your brain.
anonymarsabout 2 hours ago
I've never met a "privacy is irrelevant" advocate that doesn't close the door when they go to the toilet
UqWBcuFx6NV4rabout 3 hours ago
Don’t quit your day job. One guess as to what gender, sexual orientation, and skin colour you have.
SoftTalkerabout 3 hours ago
But why is the census asking about those attrbutes at all. The Constitution requires a count. That's it. A number. We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.
glitchcabout 2 hours ago
> We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.

But we do. A detailed census is essential for making good policy. For example, knowing the age and distribution of children across the country helps local and state governments decide where to put the next school or children's hospital. The federal govt. allocates funds for education and daycare accordingly.

The census is the best and most important measure of govt. policy. Taking it away would leave everyone worse off.

righthandabout 3 hours ago
The census is already voluntary LOL. So we’d have two censuses?
whatever1about 3 hours ago
We can make them more accurate by leveraging ICE going door to door.