Equal accuracy for Andrew and Abubakar—detecting and mitigating bias in name-ethnicity classification algorithms

AI and Society:1-25 (forthcoming)
  Copy   BIBTEX

Abstract

Uncovering the world’s ethnic inequalities is hampered by a lack of ethnicity-annotated datasets. Name-ethnicity classifiers (NECs) can help, as they are able to infer people’s ethnicities from their names. However, since the latest generation of NECs rely on machine learning and artificial intelligence (AI), they may suffer from the same racist and sexist biases found in many AIs. Therefore, this paper offers an algorithmic fairness audit of three NECs. It finds that the UK-Census-trained EthnicityEstimator displays large accuracy biases with regards to ethnicity, but relatively less among gender and age groups. In contrast, the Twitter-trained NamePrism and the Wikipedia-trained Ethnicolr are more balanced among ethnicity, but less among gender and age. We relate these biases to global power structures manifested in naming conventions and NECs’ input distribution of names. To improve on the uncovered biases, we program a novel NEC, N2E, using fairness-aware AI techniques. We make N2E freely available at www.name-to-ethnicity.com.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,197

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Call for papers.[author unknown] - 2018 - AI and Society 33 (3):453-455.
Call for papers.[author unknown] - 2018 - AI and Society 33 (3):457-458.
A Look into Modern Working Life.Lena Skio¨ld - 2002 - AI and Society 16 (1-2):166-167.
The scientist of the scientist.Tomer Simon - 2024 - AI and Society 39 (2):803-804.

Analytics

Added to PP
2023-02-10

Downloads
23 (#685,349)

6 months
19 (#137,612)

Historical graph of downloads
How can I increase my downloads?

References found in this work

Add more references