Group Attribution Bias

Group attribution bias is a cognitive bias in which people assume that the characteristics or behaviors of an individual are representative of the entire group to which that individual belongs, or that a group's collective decision reflects the personal attitudes of every member within it. In the context of artificial intelligence and machine learning, group attribution bias refers to the tendency of algorithms and models to systematically generalize traits, outcomes, or behaviors observed in some members of a demographic group to all members of that group. This results in stereotyping, unfair predictions, and discriminatory outputs that disproportionately affect marginalized populations.

Group attribution bias is one of several human cognitive biases that can become embedded in training data, learned by models during training, and amplified through deployment at scale. Understanding this bias is central to the fields of algorithmic fairness, responsible AI, and AI ethics.

ELI5: explain like I'm 5

Imagine you meet one kid from a school across town, and that kid is really loud. If you then think "all the kids at that school must be loud," you have made a group attribution error. You judged the whole group based on just one person.

Computers can make the same kind of mistake. When a computer learns from data, it might notice that some people in a certain group did something, and then it starts treating everyone in that group the same way. For example, if a computer learns from old job applications where mostly men got hired, it might start thinking men are better for the job, even though that is not true. The computer is not being mean on purpose; it just learned a pattern that was unfair to begin with.

Group attribution bias has its roots in social psychology research on how people explain the behavior of others. The concept draws on two related but distinct phenomena: generalizing from an individual to a group, and generalizing from a group decision to individual members.

Type I: individual-to-group attribution

The first form of group attribution error occurs when people observe the behavior of a single group member and generalize that behavior to the entire group. A foundational study by Ruth Hamill, Timothy Wilson, and Richard Nisbett in 1980 demonstrated this effect. In their experiment, participants read a vivid account of a welfare recipient. Some participants were told the individual was typical of welfare recipients; others were told the individual was highly atypical. Regardless of the typicality information provided, participants who read the vivid story formed more negative attitudes toward welfare recipients as a whole than participants who did not read it. This showed that people generalize from individual cases to entire groups even when explicitly told the case is unrepresentative.

Type II: group-to-member attribution

The second form was formally identified by Scott T. Allison and David M. Messick in their 1985 paper "The Group Attribution Error," published in the Journal of Experimental Social Psychology. Allison and Messick found that people tend to assume a group's decision reflects the personal preferences of each member in that group. In their experiments, participants were presented with policy decisions made at the national, state, and local levels under varying conditions: decisions made by a single leader with no vote, decisions made by a popular vote of roughly 50% of the population, and decisions made by a vote of over 90%. Across all conditions, participants assumed the group decision reflected the attitudes of the individual members, even when the decision-making process should have weakened that inference.

Relationship to other attribution biases

Group attribution bias is closely related to several other attribution biases studied in social psychology.

Bias	Description	Key researcher(s)
Fundamental attribution error	The tendency to attribute others' behavior to their personality or disposition rather than to situational factors	Lee Ross (1977)
Ultimate attribution error	Extending the fundamental attribution error to groups: attributing negative outgroup behavior to disposition and positive outgroup behavior to luck or special circumstances	Thomas F. Pettigrew (1979)
Group attribution error (Type I)	Generalizing traits of a single group member to the entire group	Hamill, Wilson, and Nisbett (1980)
Group attribution error (Type II)	Assuming a group decision reflects the attitudes of every member	Allison and Messick (1985)
In-group bias	Favoring members of one's own group over members of other groups	Henri Tajfel (1979)
Out-group homogeneity bias	Perceiving members of other groups as more similar to each other than members of one's own group	Various researchers

The ultimate attribution error and intergroup prejudice

Thomas F. Pettigrew's 1979 paper, "The Ultimate Attribution Error: Extending Allport's Cognitive Analysis of Prejudice," described a systematic pattern in how people make attributions about in-group and out-group members. This framework is directly relevant to understanding how group attribution bias operates in both human cognition and AI systems.

Pettigrew identified an asymmetric pattern. When an out-group member behaves negatively, the behavior is attributed to internal, dispositional causes (such as character flaws or inherent tendencies). When an in-group member behaves negatively, the same behavior is attributed to external, situational causes (such as bad luck or difficult circumstances). The reverse holds for positive behavior: in-group successes are attributed to internal qualities, while out-group successes are explained away.

Pettigrew outlined four ways people dismiss positive behavior by out-group members:

The exceptional case: the individual is treated as an atypical member who does not represent the group
Luck or special advantage: the positive outcome is attributed to external factors rather than ability
High motivation and effort: the individual succeeded through excessive effort, implying they lack natural ability
Situational context: favorable circumstances, not personal qualities, caused the positive outcome

Empirical studies have supported this framework. In one study examining Hindu and Muslim participants in India, each group attributed undesirable acts committed by members of the other group to internal (dispositional) factors, while attributing the same undesirable acts committed by their own group to external (situational) factors.

How group attribution bias enters AI systems

Group attribution bias can enter AI systems through multiple pathways. The bias does not require any intentional prejudice from developers; it arises from systematic patterns in the data, the design of models, and the feedback loops created by deployment.

Biased training data

The most common pathway is through training data that reflects historical and societal biases. If the data used to train a model contains patterns where certain outcomes are correlated with group membership (for example, race, gender, age, or socioeconomic status), the model will learn these correlations and reproduce them in its predictions. Because machine learning models optimize for patterns in the data, they treat group-level statistical associations as predictive features, effectively generalizing from observed group-level trends to individual predictions.

A landmark study by Aylin Caliskan, Joanna J. Bryson, and Arvind Narayanan, published in Science in 2017, demonstrated that word embeddings trained on standard internet text corpora contain the same implicit biases found in human psychology. Using a method called the Word Embedding Association Test (WEAT), modeled after the Implicit Association Test (IAT) used in psychology, the researchers showed that word embeddings associate European American names with pleasant terms and African American names with unpleasant terms, and associate male names with career terms and female names with family terms. These associations mirror human implicit biases measured by the IAT.

In 2016, Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, and Adam Kalai published "Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings" at NeurIPS. They showed that word embeddings trained on Google News articles encode gender stereotypes: the vector analogy "man is to computer programmer as woman is to X" returned "homemaker." The embeddings systematically associated male terms with technical and leadership roles and female terms with domestic and support roles.

Proxy variables and indirect discrimination

Even when sensitive attributes like race or gender are removed from training data, models can learn to discriminate through proxy variables. A proxy variable is a feature that is correlated with a protected attribute. For example, zip code can serve as a proxy for race due to residential segregation, and name can serve as a proxy for gender or ethnicity. Models trained on such features can reproduce group-level biases without ever explicitly using the protected attribute.

Feedback loops

Deployed AI systems can create feedback loops that amplify group attribution bias over time. If a predictive policing algorithm directs more police to neighborhoods with higher historical crime rates (which often correlate with race), more arrests occur in those neighborhoods, generating more data that reinforces the original pattern. The model's predictions become self-fulfilling, compounding the bias with each cycle.

Label bias

Bias can also enter through the labels used to train supervised models. If human annotators apply labels that reflect their own biases (for example, rating resumes from certain demographic groups lower), the model will learn to replicate those biased judgments. This is sometimes called annotation bias or labeling bias.

Real-world case studies

Several high-profile cases have demonstrated how group attribution bias manifests in deployed AI systems.

Amazon's AI recruiting tool

In 2018, Reuters reported that Amazon had developed an AI-powered recruiting tool that systematically penalized female job applicants. The system was trained on 10 years of resumes submitted to the company. Because the technology industry has historically been male-dominated, the majority of those resumes came from men. The model learned to favor language patterns more common on male resumes (such as the verbs "executed" and "captured") and penalized resumes containing the word "women's" or the names of women's colleges. Amazon scrapped the project after discovering the bias. This case illustrates Type I group attribution error in an AI context: the model generalized from the historical predominance of male applicants to a preference for male candidates in general.

COMPAS recidivism prediction

The Correctional Offender Management Profiling for Alternative Sanctions (COMPAS) algorithm, developed by Northpointe (now Equivant), is used by courts in the United States to predict the likelihood that a defendant will reoffend. In 2016, ProPublica analyzed COMPAS scores for over 10,000 criminal defendants in Broward County, Florida. The investigation found that Black defendants were almost twice as likely as white defendants to be incorrectly classified as high risk (false positives), while white defendants were more frequently and incorrectly classified as low risk (false negatives). After controlling for criminal history, age, and gender, Black defendants were 45% more likely to be assigned a higher risk score for any future crime and 77% more likely to receive a higher score for future violent crime. The COMPAS case highlights how group-level statistical associations in historical criminal justice data can lead an algorithm to systematically assign higher risk to members of a particular racial group.

Healthcare resource allocation

In 2019, Ziad Obermeyer, Brian Powers, Christine Vogeli, and Sendhil Mullainathan published a study in Science examining a widely used algorithm that guided healthcare decisions for approximately 200 million patients per year in the United States. The algorithm was designed to identify patients who would benefit from additional care, but it used healthcare costs as a proxy for health needs. Because Black patients historically had less access to healthcare and therefore lower healthcare expenditures, the algorithm systematically assigned lower risk scores to Black patients even when they were equally or more sick than white patients. At a given risk score, Black patients had 26% more chronic conditions than white patients. The researchers estimated that fixing the bias would increase the proportion of Black patients flagged for extra care from 17.7% to 46.5%. This case demonstrates how proxy variables can encode group-level disparities and produce discriminatory outcomes.

Facial recognition systems

In 2018, Joy Buolamwini and Timnit Gebru published the Gender Shades study, which evaluated three commercial facial recognition systems for accuracy across different demographic groups. They found that the systems performed worst on darker-skinned women, with error rates up to 34.7%, compared to a maximum error rate of just 0.8% for lighter-skinned men. The study also found that widely used facial analysis benchmarks were overwhelmingly composed of lighter-skinned subjects (79.6% for IJB-A and 86.2% for Adience). The underrepresentation of certain groups in training data led the models to generalize poorly for those groups, a direct manifestation of group attribution bias in computer vision.

Case study	Domain	Year	Key finding	Group affected
Amazon recruiting tool	Hiring	2018	Penalized female applicants due to male-dominated training data	Women
COMPAS algorithm	Criminal justice	2016	Black defendants nearly 2x more likely to be falsely labeled high risk	Black defendants
Healthcare allocation algorithm	Healthcare	2019	Used cost as a proxy for health, underestimating Black patients' needs	Black patients
Gender Shades	Facial recognition	2018	Error rates up to 34.7% for darker-skinned women vs. 0.8% for lighter-skinned men	Women with darker skin

In-group bias and out-group homogeneity in ML

Two specific manifestations of group attribution bias are particularly relevant to machine learning: in-group bias and out-group homogeneity bias.

In-group bias

In-group bias (also called in-group favoritism) is the tendency to favor members of one's own group. In machine learning, this can manifest when the people building the system unconsciously design it to perform better for populations similar to themselves. For example, Google's ML Crash Course on fairness notes that a machine learning practitioner may unconsciously favor job applicants who attended the same university they did. More broadly, when development teams lack diversity, the systems they build may perform better for demographic groups that are well-represented on the team and in the data they collect.

Out-group homogeneity bias

Out-group homogeneity bias is the tendency to perceive members of other groups as more similar to each other than they actually are. In ML, this manifests as models that have lower accuracy or more homogeneous predictions for underrepresented groups. When training data contains fewer examples from minority groups, the model has less information to learn fine-grained distinctions within those groups. The result is that predictions for underrepresented groups tend to be less accurate and more stereotyped, while predictions for well-represented groups are more nuanced and individualized.

This pattern was clearly visible in the Gender Shades study: facial recognition systems had fine-grained accuracy for lighter-skinned men (the most represented group in training data) but treated darker-skinned women as a more homogeneous group, resulting in much higher error rates.

Fairness metrics for measuring group attribution bias

Several quantitative metrics have been developed to detect and measure the effects of group attribution bias in machine learning systems.

Metric	Definition	What it measures
Demographic parity	The proportion of positive predictions should be equal across all groups	Whether selection rates differ by group membership
Equalized odds	True positive rates and false positive rates should be equal across groups	Whether accuracy differs by group membership
Predictive parity	The positive predictive value should be equal across groups	Whether the meaning of a positive prediction differs by group
Disparate impact	The ratio of positive prediction rates between groups should exceed a threshold (commonly 80%)	Whether one group receives positive outcomes at a substantially lower rate
Counterfactual fairness	A prediction should be the same in a counterfactual world where the individual belonged to a different group	Whether group membership causally affects the prediction
Individual fairness	Similar individuals should receive similar predictions regardless of group membership	Whether the model treats comparable individuals consistently

The impossibility theorem

An important theoretical result in algorithmic fairness, proven independently by Alexandra Chouldechova (2017) and by Jon Kleinberg, Sendhil Mullainathan, and Manish Raghavan (2016), establishes that it is generally impossible to simultaneously satisfy calibration, balance for the positive class, and balance for the negative class when base rates differ across groups. In practical terms, this means that a system cannot simultaneously achieve predictive parity and equalized odds when the prevalence of the target outcome differs between groups. The COMPAS controversy illustrates this directly: ProPublica emphasized equalized odds (equal false positive and false negative rates across races), while Northpointe emphasized calibration (a given risk score meaning the same thing regardless of race). Both metrics cannot be satisfied at once when recidivism base rates differ.

This impossibility result means that practitioners must make explicit choices about which fairness criteria to prioritize, and those choices have real consequences for different groups.

Bias mitigation techniques

Researchers and practitioners have developed a range of techniques to detect and reduce group attribution bias in machine learning systems. These techniques are typically categorized by the stage of the ML pipeline at which they intervene.

Preprocessing techniques

Preprocessing methods modify the training data before it is used to train a model.

Reweighing: Introduced by Faisal Kamiran and Toon Calders, this technique assigns different weights to training examples based on their group membership and label. Examples from disadvantaged groups with positive outcomes are upweighted so they have a greater influence on model training, while examples from advantaged groups with negative outcomes may be downweighted. The goal is to make the weighted dataset satisfy demographic parity.
Disparate impact remover: This method transforms feature values to remove correlations with the protected attribute while preserving the rank ordering within each group.
Learning fair representations (LFR): This approach learns a new representation of the data that encodes the information needed for prediction while removing information about group membership.

In-processing techniques

In-processing methods modify the learning algorithm itself to incorporate fairness constraints during training.

Adversarial debiasing: The model is trained simultaneously with an adversary (discriminator) that attempts to predict the protected attribute from the model's predictions. The model is optimized to perform well on the primary task while making it difficult for the adversary to infer group membership. If the adversary can predict the protected attribute, it indicates the model is treating groups differently.
Fairness constraints: The optimization objective is modified to include a penalty term for violations of a chosen fairness metric, such as demographic parity or equalized odds.
Prejudice remover: A regularization term is added to the learning objective that penalizes the model for depending on the protected attribute.

Post-processing techniques

Post-processing methods adjust the model's predictions after training, without modifying the model itself.

Equalized odds post-processing: The decision threshold is adjusted separately for each group to equalize true positive rates and false positive rates across groups.
Calibrated equalized odds: Similar to equalized odds post-processing, but with an additional constraint to preserve calibration.
Reject option classification: For predictions near the decision boundary (where the model is least confident), the method flips predictions to favor the disadvantaged group.

Comparison of mitigation approaches

Approach	Stage	Advantages	Limitations
Preprocessing	Before training	Model-agnostic; works with any algorithm	May oversimplify bias structures in the data
In-processing	During training	Can achieve better fairness-performance trade-offs	Requires access to and modification of the training algorithm
Post-processing	After training	Works with black-box models; no retraining needed	Cannot address bias in the model's internal representations

Fairness toolkits and auditing frameworks

Several open-source tools have been developed to help practitioners detect and mitigate group attribution bias.

AI Fairness 360 (AIF360): Developed by IBM Research and later donated to the Linux Foundation AI and Data, AIF360 is a comprehensive toolkit that includes over 70 fairness metrics and 12 bias mitigation algorithms spanning preprocessing, in-processing, and post-processing stages. It is available in both Python and R.

Aequitas: Developed by the Center for Data Science and Public Policy at the University of Chicago, Aequitas is an open-source bias auditing toolkit that allows users to upload data, select protected groups and fairness metrics, and generate audit reports. Its primary purpose is bias detection and reporting rather than mitigation.

Fairlearn: Developed by Microsoft, Fairlearn provides algorithms for mitigating unfairness in AI systems and metrics for assessing fairness. It integrates with scikit-learn and supports both group fairness and individual fairness assessments.

What-If Tool: Developed by Google, this tool provides a visual interface for exploring the behavior of machine learning models across different subgroups, making it easier to identify and understand group-level disparities.

Regulatory and policy landscape

Governments and international organizations have begun to address group attribution bias in AI through regulation and policy.

EU AI Act

The European Union's AI Act, which entered into force in 2024, establishes requirements for high-risk AI systems that directly address group attribution bias. Article 10 requires providers of high-risk AI systems to examine training, validation, and testing datasets for possible biases that could lead to discrimination. The Act mandates bias monitoring, detection, and correction, and even permits the processing of special categories of personal data (such as race and ethnicity) when strictly necessary for bias detection. High-risk categories include AI used in employment, credit scoring, law enforcement, and access to public services.

United States

The United States has taken a more sector-specific approach. The Equal Employment Opportunity Commission (EEOC) has applied existing anti-discrimination laws to AI hiring tools. The Federal Trade Commission (FTC) has issued guidance on algorithmic fairness. In 2023, New York City implemented Local Law 144, requiring employers to conduct annual bias audits of automated employment decision tools.

UNESCO Recommendation on the Ethics of AI

In 2021, UNESCO adopted a Recommendation on the Ethics of Artificial Intelligence, endorsed by 193 member states. The recommendation calls for AI systems to be designed to avoid reproducing or amplifying existing biases and discrimination, and for regular auditing of AI systems for fairness.

Connections to broader AI fairness research

Group attribution bias intersects with several other areas of AI fairness research.

Intersectionality

Bias often compounds at the intersection of multiple group identities. The Gender Shades study demonstrated this clearly: while facial recognition error rates were elevated for both women and for darker-skinned individuals, the worst performance was for darker-skinned women, a group defined by the intersection of race and gender. Research by Crenshaw (1989) on intersectionality in legal theory has influenced how AI fairness researchers think about compounding disadvantages.

Representation and sampling bias

Group attribution bias is closely related to sampling bias and coverage bias. When certain groups are underrepresented in training data, models have less information to make accurate predictions for those groups. This underrepresentation itself can be a form of group-level bias, as it reflects historical patterns of exclusion from data collection efforts.

Stereotyping in large language models

Research has shown that large language models (LLMs) can reproduce and amplify group-based stereotypes from their training data. Even models that pass explicit bias tests may harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases on implicit measures. Studies have found that LLMs associate certain character traits with specific racial or gender groups (for example, associating the term "resilient" with Black women and "petite" with Asian women), reproducing group-level stereotypes in their outputs.

Feedback loops and compound bias

When AI systems with group attribution bias are deployed in decision-making contexts (such as lending, hiring, or criminal justice), their biased outputs can influence the real world in ways that generate new data confirming the original bias. A lending algorithm that denies loans to members of a certain group creates data showing that group members do not repay loans (because they were never given the opportunity), reinforcing the bias in future model updates. This creates a cycle of compounding disadvantage.

Limitations of current approaches

Despite progress in detecting and mitigating group attribution bias, several challenges remain.

First, the impossibility theorem means that no single fairness metric can capture all dimensions of fairness simultaneously. Choosing one metric inherently involves trade-offs with others, and these trade-offs have distributional consequences.

Second, most fairness metrics are defined for binary or categorical protected attributes and binary outcomes. Real-world identities are multidimensional and continuous, and outcomes often exist on a spectrum.

Third, bias mitigation techniques can reduce measurable bias but may not eliminate the deeper structural inequalities that produce biased data in the first place. Technical solutions alone cannot solve problems rooted in social and economic systems.

Fourth, there is an ongoing tension between group fairness (treating groups equally in aggregate) and individual fairness (treating similar individuals similarly). These two goals can conflict, and neither fully captures what most people mean by "fairness."

References

Allison, S. T., & Messick, D. M. (1985). "The group attribution error." *Journal of Experimental Social Psychology*, 21(6), 563-579.
Hamill, R., Wilson, T. D., & Nisbett, R. E. (1980). "Insensitivity to sample bias: Generalizing from atypical cases." *Journal of Personality and Social Psychology*, 39(4), 578-589.
Pettigrew, T. F. (1979). "The ultimate attribution error: Extending Allport's cognitive analysis of prejudice." *Personality and Social Psychology Bulletin*, 5(4), 461-476.
Caliskan, A., Bryson, J. J., & Narayanan, A. (2017). "Semantics derived automatically from language corpora contain human-like biases." *Science*, 356(6334), 183-186.
Bolukbasi, T., Chang, K. W., Zou, J., Saligrama, V., & Kalai, A. (2016). "Man is to computer programmer as woman is to homemaker? Debiasing word embeddings." *Advances in Neural Information Processing Systems*, 29.
Angwin, J., Larson, J., Mattu, S., & Kirchner, L. (2016). "Machine bias: There's software used across the country to predict future criminals. And it's biased against blacks." *ProPublica*.
Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). "Dissecting racial bias in an algorithm used to manage the health of populations." *Science*, 366(6464), 447-453.
Buolamwini, J., & Gebru, T. (2018). "Gender Shades: Intersectional accuracy disparities in commercial gender classification." *Proceedings of Machine Learning Research*, 81, 1-15.
Dastin, J. (2018). "Amazon scraps secret AI recruiting tool that showed bias against women." *Reuters*.
Chouldechova, A. (2017). "Fair prediction with disparate impact: A study of bias in recidivism prediction instruments." *Big Data*, 5(2), 153-163.
Kleinberg, J., Mullainathan, S., & Raghavan, M. (2016). "Inherent trade-offs in the fair determination of risk scores." *arXiv preprint arXiv:1609.05807*.
Bellamy, R. K. E., et al. (2019). "AI Fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias." *IBM Journal of Research and Development*, 63(4/5), 4:1-4:15.
Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). "A survey on bias and fairness in machine learning." *ACM Computing Surveys*, 54(6), 1-35.
European Parliament. (2024). "Regulation (EU) 2024/1689 laying down harmonised rules on artificial intelligence (AI Act)." *Official Journal of the European Union*.
Google Developers. "Fairness: Types of bias." *Machine Learning Crash Course*. https://developers.google.com/machine-learning/crash-course/fairness/types-of-bias

ELI5: explain like I'm 5

Origins in social psychology

Type I: individual-to-group attribution

Type II: group-to-member attribution

Relationship to other attribution biases

The ultimate attribution error and intergroup prejudice

How group attribution bias enters AI systems

Biased training data

Proxy variables and indirect discrimination

Feedback loops

Label bias

Real-world case studies

Amazon's AI recruiting tool

COMPAS recidivism prediction

Healthcare resource allocation

Facial recognition systems

In-group bias and out-group homogeneity in ML

In-group bias

Out-group homogeneity bias

Fairness metrics for measuring group attribution bias

The impossibility theorem

Bias mitigation techniques

Preprocessing techniques

In-processing techniques

Post-processing techniques

Comparison of mitigation approaches

Fairness toolkits and auditing frameworks

Regulatory and policy landscape

EU AI Act

United States

UNESCO Recommendation on the Ethics of AI

Connections to broader AI fairness research

Intersectionality

Representation and sampling bias

Stereotyping in large language models

Feedback loops and compound bias

Limitations of current approaches

See also

References

Improve this article

Related Articles

In-Group Bias

Implicit Bias

ARC-AGI 2

Confirmation Bias

Experimenter's Bias

Out-Group Homogeneity Bias

ELI5: explain like I'm 5

Origins in social psychology

Type I: individual-to-group attribution

Type II: group-to-member attribution

Relationship to other attribution biases

The ultimate attribution error and intergroup prejudice

How group attribution bias enters AI systems

Biased training data

Proxy variables and indirect discrimination

Feedback loops

Label bias

Real-world case studies

Amazon's AI recruiting tool

COMPAS recidivism prediction

Healthcare resource allocation

Facial recognition systems

In-group bias and out-group homogeneity in ML

In-group bias

Out-group homogeneity bias

Fairness metrics for measuring group attribution bias

The impossibility theorem

Bias mitigation techniques

Preprocessing techniques

In-processing techniques

Post-processing techniques

Comparison of mitigation approaches

Fairness toolkits and auditing frameworks

Regulatory and policy landscape

EU AI Act

United States

UNESCO Recommendation on the Ethics of AI

Connections to broader AI fairness research

Intersectionality