What Is a Stereotype?
A stereotype is a generalisation from a group average to an individual member — an inference about what a specific person is like based on a characteristic of the group they belong to. This page explains why that inference is usually statistically unjustified and legally problematic.
Two types of generalisation
Not all generalisations are equally problematic. The distinction that matters is between a statistical fact about a group and a prediction about an individual.
Statistical fact (valid)
"On average, men in England are 13.5 cm taller than women."
This is a verifiable empirical statement about group averages. It is relevant for decisions genuinely calibrated to a group average — for example, setting the default height of wash basins, mirrors, and worktops in sex-segregated facilities, or defining population health reference ranges for clinical screening.
Individual prediction (invalid)
"This person is a woman, therefore she is shorter than the average man."
This is an inference about an individual based on group membership. Because height distributions overlap substantially, it will be wrong for a meaningful proportion of individuals.
Treating an individual as if they possess a characteristic of the average member of a group to which they belong, without evidence about the individual themselves. Stereotyping conflates descriptive statistics (what the group looks like on average) with predictive facts about a particular person.
Where stereotypes come from
Stereotypes are not always invented from nothing. Many are grounded in real, measurable group differences — differences in average height, in average time spent on unpaid care, in representation across occupational grades. The problem is not that the group-level data is false. The problem is what happens when that data is applied to individuals:
The central statistical argument
For the vast majority of traits relevant to employment, service delivery, and social policy, within-group variation is substantially larger than between-group differences. This means group membership carries very little predictive power for any individual — even when a real group difference exists.
The Overlap Principle
The interactive charts below show how even large group differences produce substantially overlapping distributions. Use the sliders to explore how the degree of overlap changes as the gap between the two group averages increases or decreases.
Reading the charts: overlap and effect size explained
What "overlap" means here: The darker shaded region where the two curves meet marks the range of values shared by both groups — the zone where you cannot reliably tell from this characteristic alone which group someone belongs to. Anyone whose value falls in that zone could plausibly belong to either group. The overlap percentage shown in the statistics below each chart is the share of the combined distribution area that sits in that shared zone. An overlap of 60% means that for 60% of the joint area, group membership gives you no reliable information about the individual's value.
What "effect size (d)" means: Effect size measures how far apart the two group averages are, expressed as a multiple of the typical spread of individual scores within each group. If most people in each group vary by about 7 cm from their group's average (mean), and the two averages are 14 cm apart, the effect size is 2.0. As a rough guide: below 0.2 is negligible; 0.2 is small; 0.5 is medium; 0.8 is large. Height — at around 2.0 — is one of the largest differences ever reliably measured between social groups. For most characteristics relevant to employment and service decisions, the effect size is considerably smaller and the overlap considerably greater.
overlap
(effect size)
male average
female average
overlap
(effect size)
female average
male average
The key insight from both examples
Even for height — one of the largest and most reliable sex differences in any human trait — around 32% of the combined distribution area overlaps. For characteristics more relevant to employment decisions (commitment, performance, availability, ambition), effect sizes are typically much smaller and overlap much greater.
When you make a decision about an individual based on a group average, you are working with weak information at best. The larger the within-group variance relative to the between-group difference, the weaker that information is — and individual-level evidence about the actual person will almost always be more reliable.
From Statistics to Discrimination
When a stereotype — a group-level generalisation — is used to make a decision about an individual, it discriminates against the majority of members of that group who do not conform to the stereotype. The interactive chart below uses job performance and age as a case study.
What the research actually shows: age and job performance
Age-related stereotypes in the workplace are among the most persistent and consequential. The stereotype: older workers are less productive, less adaptable, and less committed. The evidence tells a very different story.
Ng & Feldman's (2008) meta-analysis — drawing on 380 studies covering over 400,000 workers — found that age correlates near-zero with core task performance (r = −0.01). The correlations with overall performance, organisational citizenship behaviour, and creative performance were similarly negligible. Older workers showed lower rates of counterproductive behaviour and absenteeism than younger workers.
performance (r)
behaviour (r)
productive behaviour (r)
meta-analysis
A correlation of −0.01 is, for practical purposes, zero. Individual variation in job performance within any age group dwarfs any age-related average difference.
25–34 vs 55–64
25–34 vs 55–64
per decade (SD)
25–34 average
The discrimination mechanism
When an organisation uses age as a proxy for capability or productivity, it treats every individual in the older age group as if they are the (stereotyped) average — excluding people who are above average and would have performed well. This is not a hypothetical: if 47% of workers aged 55–64 outperform the average 25–34 year old, any age-based exclusion is wrong about nearly half the people it affects.
The precision problem
The statistical argument against stereotyping is ultimately about the precision of inference. Group membership is a low-precision predictor for almost all individual outcomes. Using it as the basis for decisions — rather than individual-level information — introduces systematic error.
Better predictors almost always exist: past performance records, skills assessments, structured interviews, work samples. When individual-level evidence is available, falling back on group membership as a shortcut is not just statistically indefensible — it is the definition of prejudice.
Indirect Discrimination
A stereotype embedded in a policy, rule, or practice — rather than a one-off individual act — can give rise to a claim of indirect discrimination. There are two broad patterns: a neutral rule that has unequal effects on a protected group, and a rule that explicitly uses a protected characteristic (or a close proxy for it) as its criterion. In both cases, the organisation may be able to avoid liability if it can show the PCP is a proportionate means of achieving a legitimate aim — the justification defence, explored in the third card below.
A person (A) discriminates against another (B) if A applies to B a provision, criterion or practice (PCP) which is applied equally to persons not sharing B's protected characteristic but which puts or would put persons with that characteristic at a particular disadvantage compared with persons not sharing it — unless A can show it is a proportionate means of achieving a legitimate aim.
Neutral rules with unequal effects
The most common form of indirect discrimination involves a rule or requirement that says nothing about any protected characteristic on its face, but whose practical effect falls unequally across groups. The organisation did not set out to disadvantage anyone — but because the rule's implicit assumptions do not fit all groups equally, some groups are put at a particular disadvantage.
The distribution data from Example 2 illustrates the mechanism directly: because women on average carry more unpaid care than men, any requirement that clashes with significant caring commitments will exclude women at higher rates — not by intent, but as a consequence of the underlying difference in averages.
| Policy or rule (PCP) | Protected characteristic | Why it causes unequal effects |
|---|---|---|
| Requires full-time hours | Sex | Women carry the majority of unpaid care in aggregate (ONS data). A blanket full-time requirement excludes those who need part-time work at higher rates — even though the rule itself makes no reference to sex. |
| Requires a continuous employment history with no gaps | Sex / Pregnancy & maternity | Career gaps caused by maternity leave or childcare fall disproportionately on women. A requirement for an unbroken work history may be impossible to meet without penalising the reason for the gap. |
| Mandatory early-morning or Friday-evening shifts | Religion or belief | Shift patterns that conflict with Sabbath or other religious observance put certain groups at a particular disadvantage — even when the scheduling rule applies to all staff equally. |
These rules are facially neutral. The indirect discrimination arises from their application to a population that does not conform uniformly to the rule's implicit assumptions.
When the characteristic itself becomes the criterion
A more direct form arises when the rule explicitly uses a protected characteristic — or a characteristic so closely correlated with a protected group that it functions as a stand-in. Here the link to stereotyping from Section 3 is most visible: the policy treats every member of a group as if they share the group average, when many do not.
| Policy or rule (PCP) | Protected characteristic | Why justification is harder |
|---|---|---|
| Maximum recruitment age (e.g. no applications accepted from anyone over 50) | Age | Age is the explicit criterion. As Example 3 shows, age correlates near-zero with job performance — so using age as a cut-off excludes many people who would have performed well. The justification defence is very difficult to sustain. |
| Minimum height requirement above genuine operational need | Sex | Height is strongly sex-linked (d ≈ 2.0, Example 1). A blanket height threshold functions as a near-proxy for sex. Unless the threshold is genuinely operationally necessary, it amounts to indirect sex discrimination. |
| Physical fitness standard set above the genuine operational requirement | Sex / Age / Disability | Women on average have lower aerobic capacity than men; older workers and disabled people also perform differently on physical tests. If the standard exceeds what the role genuinely requires, the surplus requirement cannot be justified. See the justification defence below for when a fitness standard is lawfully justifiable. |
The two-part justification test
A PCP that causes group disadvantage is not automatically unlawful. The organisation can justify it by satisfying both elements:
1. Legitimate aim: The PCP pursues a real, concrete organisational need — not one that is itself discriminatory. Cost saving alone is generally insufficient as a justification. Genuine operational effectiveness, health and safety, and service quality are typically acceptable aims.
2. Proportionate means: The PCP is no more discriminatory than necessary to achieve that aim. If a less discriminatory alternative achieves the same result, the more discriminatory option cannot be proportionate. This is where the statistical argument from Sections 2 and 3 is most directly applicable: where a characteristic is a poor predictor of the operational need, an individual-level assessment will almost always be a less discriminatory alternative — and proportionality requires it.
The Justification Defence — when disparate impact is lawful
Not every policy with a disparate impact is unlawful. Where a requirement reflects a genuine operational necessity that cannot be met by less discriminatory means, the justification defence can succeed. The following examples from policing illustrate this.
The principle in both cases is the same: the requirement must be genuinely necessary for the role, set at the minimum level needed, and applied to the individual through assessment — not used as a shortcut to exclude groups whose members are, on average, less likely to meet it.
Showing particular disadvantage
Indirect discrimination requires showing that the PCP puts the protected group at a "particular disadvantage" — not just that a single individual is affected. In practice this is established by statistics, or where statistics are unavailable, by evidence that the PCP is "inherently likely" to have a disproportionate effect.
The overlap distributions in Section 2 demonstrate directly how this disadvantage arises: a rule built on a group average will catch many members of the disadvantaged group who do not conform to that average — exactly the proportion represented by the overlap region. The overlap percentage is, in effect, a measure of the rule's error rate when applied to individuals.
Positive Action vs Stereotyping
Positive action and stereotyping are sometimes confused — both involve taking protected characteristics into account. They differ fundamentally in their basis, their target, and their legality.
The three conditions for lawful positive action (s.158)
Positive action is permitted where the organisation reasonably thinks that:
(a) Disadvantage: Persons sharing the protected characteristic suffer a disadvantage connected to that characteristic; or
(b) Under-representation: Persons sharing the characteristic have needs different from those without it, or participation in an activity by persons sharing the characteristic is disproportionately low; and
(c) Proportionality: The positive action taken is a proportionate means of achieving the aim of enabling or encouraging persons sharing the characteristic to overcome or minimise the disadvantage, or to participate in the activity.
Practical illustration: recruitment
Unlawful stereotyping
"We will not shortlist candidates over 50 because older workers are less adaptable to our technology." This assumes an individual characteristic (low adaptability) from group membership (age), without evidence. It causes direct age discrimination and relies on a stereotype unsupported by the research on age and performance.
Lawful positive action
"Our workforce data shows that workers over 50 are under-represented at senior grades relative to the workforce as a whole. We will include targeted outreach in our senior recruitment campaign to encourage applications from older workers." This responds to documented evidence of under-representation, does not impose quotas, and addresses the barrier rather than assuming individual characteristics.
Key Takeaways
What practitioners need to know when evaluating whether a policy, rule, or practice relies on stereotyping — and when group-level data can legitimately inform action.
Within-group variation almost always exceeds between-group differences
For virtually every characteristic relevant to employment and service delivery, individuals within any group vary more than the groups differ from each other. Group membership is therefore a weak predictor of individual characteristics — even when a real group difference exists.
Even large effects produce substantial overlap
Height — one of the largest sex differences in any human trait — produces distributions that overlap by around 32%. For job performance, commitment, adaptability, and most other employment-relevant traits, effect sizes are far smaller and overlap is far greater.
Group-level statistics cannot justify individual treatment
Applying a group average to an individual is statistically unjustified whenever within-group variance is large. Where individual-level evidence is available, it will almost always be a more reliable basis for decisions than group membership.
Indirect discrimination does not require intent
A policy built on a stereotype can be indirectly discriminatory even if no individual act of prejudice was intended. The test is effect: does the PCP put a protected group at a particular disadvantage? Policies justified by group stereotypes often fail the proportionality test.
The justification defence requires evidence
To justify a PCP that causes group disadvantage, organisations must show the aim is legitimate and the means are proportionate. Where the stereotyped characteristic is a poor predictor of the operational need, less discriminatory alternatives will usually exist — and will be required by the proportionality test.
Positive action and stereotyping are opposites
Positive action is grounded in evidence of actual group disadvantage and addresses systemic barriers. Stereotyping infers individual characteristics from group averages. One responds to real data; the other substitutes assumed characteristics for individual assessment.
References
Ng, T. W. H., & Feldman, D. C. (2008). The relationship of age to ten dimensions of job performance. Journal of Applied Psychology, 93(2), 392–423.
Office for National Statistics (2016). Changes in the value and division of unpaid care work in the UK: 2000 to 2015. ONS.
NHS Digital (2020). Health Survey for England 2019. NHS.
Equality Act 2010, ss. 19, 158, 159. legislation.gov.uk.
Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences (2nd ed.). Lawrence Erlbaum Associates.
Distribution overlap coefficients computed by numerical integration of overlapping normal PDFs. Cohen's d computed using pooled standard deviation. Statistical illustrations use normal approximations; real-world distributions are often right-skewed.