AI Accountability

Benefits by Algorithm: The Welfare State's Black Box Problem

When AI decides who gets food, housing, or healthcare, the question of explainability stops being academic.

By Jakub Jirák Jan 17, 2027 6 min read

ai-accountabilitywelfare-policyalgorithmic-decision-makingpublic-servicesai-governance

Consider what it means to be told no by a machine. Not the minor no of a rejected password or a failed card transaction. The no that means your rent assistance has been terminated. The no that means your disability support has been reduced by 40%. The no that comes with a reference number, a PDF generated by a system whose name you don’t know, and a phone number that leads to a call center where the person on the other end has no more access to the underlying logic than you do.

This is not a hypothetical. It is the operational reality of benefits administration in 2027 across most of the developed world. Governments that spent decades struggling with the cost and inconsistency of human case management have found, in machine learning, a solution that is fast, cheap, and scalable. The fact that it is also frequently wrong, and that the wrongness is often invisible until catastrophe strikes, has not yet registered as a political emergency. It will.

The seduction of administrative efficiency

The appeal of algorithmic benefits administration is straightforward and not entirely dishonest. Human case management is expensive. It is inconsistent — the same application assessed by two different caseworkers in the same department on the same day can produce different outcomes, which means the system’s outputs depend partly on who happens to review your file. Human caseworkers carry biases, have bad days, have caseloads that leave no time for genuine engagement with complex situations. Replacing some of that with algorithmic triage seems like a sensible use of technology.

The Dutch government made this argument when it deployed SyRI (Systeem Risico Indicatie), a system that cross-referenced 17 government datasets to identify households at high risk of committing welfare fraud. The system flagged people based on combinations of factors — living in certain postal codes, having a foreign-sounding surname, owning a car above a certain value — without disclosing the weighting. A Rotterdam district court ruled it illegal in 2020, finding that it violated the European Convention on Human Rights because citizens had no meaningful way to challenge conclusions they couldn’t see. The Netherlands had been running it since 2014.

New Zealand deployed a predictive risk model in 2014 to score children at birth for likelihood of future involvement with child protection services. The scores were derived from parents’ welfare history, criminal records, and demographic data. The system was shut down before full deployment after a public outcry, but not before the data had been collected. Australia’s “Robodebt” scheme — which automatically raised debt notices against welfare recipients by averaging income data in ways that bore no relationship to actual debt — ran from 2016 to 2019, issued 470,000 notices, was declared unlawful, and resulted in a class action settlement of AU$1.8 billion. The algorithm was wrong. It ran for three years before anyone with authority stopped it.

The systematic errors are not random

What’s telling about these cases is that the errors are not distributed uniformly across populations. Algorithmic benefits systems fail in patterned ways that track demographic fault lines. This is not surprising — machine learning models trained on historical data reproduce historical patterns, and historical welfare administration in most countries was shaped by explicit and implicit discrimination. Train a model on 50 years of biased outcomes and it learns the bias.

The US Department of Housing and Urban Development documented in 2022 that AI-assisted homelessness services prioritization tools consistently ranked Black and Indigenous applicants lower than comparably situated white applicants, because the training data came from a period when housing services were administered discriminatorily. The model had learned which populations historically received assistance and was predicting future assistance allocation on that basis, creating a self-reinforcing loop.

The Allegheny County Family Screening Tool in Pennsylvania, used since 2016 to assess child abuse and neglect risk, has been studied extensively. Researchers found in 2023 that it over-predicted risk for low-income Black families and under-predicted it for affluent white families — not because it was designed to discriminate, but because it used prior involvement with child protective services as a predictor variable, and prior involvement correlated with poverty, which correlated with race. The system was accurately predicting who would be surveilled, not who was at risk.

This is a general problem with using historical administrative contact as a predictor. The populations that have been subject to more administrative scrutiny will score higher on any model that uses scrutiny as a proxy for risk. The model then generates more scrutiny, more data, higher scores — the cycle compounds. Poverty surveillance becomes a proxy for risk in ways that embed and amplify historical patterns.

The explainability problem in practice

European law, specifically the GDPR’s Article 22, nominally gives citizens a right not to be subject to solely automated decision-making, and a right to an explanation when automated decisions do occur. The UK’s retained data protection framework contains similar provisions. These rights look robust on paper and turn out to be largely hollow in practice.

“Explanation” in administrative AI typically means: the system generated a score of 73, which exceeded the threshold of 70, so the application was denied. The recipient learns the threshold but not the weight of the variables. They cannot determine which aspect of their situation drove the score. They cannot know whether the model would have reached the same conclusion if a single fact had been different. They cannot reproduce the calculation. The explanation is procedurally present and practically useless.

The harder problem is that in many cases the vendors don’t want to explain the model because the explanation is proprietary, and the government agencies don’t want to explain it because the explanation might reveal decisions they’d rather not defend publicly. The Michigan unemployment fraud system that generated 40,000 wrong determinations ran for years partly because neither the state nor the vendor was enthusiastic about transparency into why the system was doing what it was doing.

When the UK Home Office eventually disclosed details about its visa streaming algorithm under legal challenge in 2020, the explanation made things worse rather than better. It revealed that nationality was being used as a determinative factor in ways that a human decision-maker could not legally have applied. The algorithm was doing things that would have been illegal if a caseworker had done them consciously. The automation had provided cover for discrimination that was structurally invisible until someone forced disclosure.

What accountability actually requires

Fixing this is not primarily a technical problem, though technical choices matter. Explainability methods — SHAP values, LIME, counterfactual explanations — exist and are mature. The question is whether governments require their use, fund their implementation, and create institutional channels for the explanations to be acted on.

What actually works, in the cases where it works, looks like this: mandatory logging of every automated decision and the variables that drove it; a legal right for applicants to receive that log in a readable form; an independent review body with the technical capacity to audit model performance systematically; clear statutory liability when the model produces discriminatory outcomes; and a public reporting requirement that makes aggregate error rates visible before class action litigation forces the disclosure.

Several EU member states are moving toward this under pressure from the AI Act, which came into force in 2024. The high-risk AI system classification includes benefits determination, which means systems operating in that space need human oversight mechanisms, conformity assessments, and post-market monitoring. The enforcement infrastructure is still being built. Whether the requirements translate into genuine accountability or become a compliance exercise will depend on choices that haven’t been made yet.

The United States has no equivalent federal framework. Individual states are the locus of action, and state-level reforms have been driven almost entirely by litigation rather than proactive policy. The UK post-Brexit is developing its own framework with less prescriptive requirements than the EU — which sounds like regulatory flexibility but functions in practice as a lower floor on accountability.

The uncomfortable conclusion is that the welfare state’s black box problem is a political choice, not a technical inevitability. Every government deploying AI in benefits administration is choosing, by default, the level of accountability it accepts. Most are choosing very little. The people absorbing the consequences of that choice are the ones who needed the benefits in the first place — which is why the political pressure to fix it remains less than the harm would suggest it should be.