Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition. Mehrabi, N., \textbfGowda, Thamme, Morstatter, F., Peng, N., & Galstyan, A. In Proceedings of the 31st ACM Conference on Hypertext and Social Media, of HT '20, pages 231–232, New York, NY, USA, 2020. Association for Computing Machinery.
Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition [link]Paper  doi  abstract   bibtex   
In this paper, we study the bias in named entity recognition (NER) models—specifically, the difference in the ability to recognize male and female names as PERSON entity types. We evaluate NER models on a dataset containing 139 years of U.S. census baby names and find that relatively more female names, as opposed to male names, are not recognized as PERSON entities. The result of this analysis yields a new benchmark for gender bias evaluation in named entity recognition systems. The data and code for the application of this benchmark is publicly available for researchers to use.
@inproceedings{mehrabi2020NERbias,

author = {Mehrabi, Ninareh and \textbf{Gowda, Thamme} and Morstatter, Fred and Peng, Nanyun and Galstyan, Aram},
title = {Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition},
year = {2020},
isbn = {9781450370981},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3372923.3404804},
doi = {10.1145/3372923.3404804},
abstract = {In this paper, we study the bias in named entity recognition (NER) models---specifically, the difference in the ability to recognize male and female names as PERSON entity types. We evaluate NER models on a dataset containing 139 years of U.S. census baby names and find that relatively more female names, as opposed to male names, are not recognized as PERSON entities. The result of this analysis yields a new benchmark for gender bias evaluation in named entity recognition systems. The data and code for the application of this benchmark is publicly available for researchers to use.},
booktitle = {Proceedings of the 31st ACM Conference on Hypertext and Social Media},
pages = {231–232},
numpages = {2},
keywords = {algorithmic fairness, named entity recognition, evaluation, natural language processing},
location = {Virtual Event, USA},
series = {HT '20},
}

Downloads: 0