A Domain-Adaptive Pre-Training Approach for Language Bias Detection in News

A Domain-Adaptive Pre-Training Approach for Language Bias Detection in News. Krieger, J., Spinde, T., Ruas, T., Kulshrestha, J., & Gipp, B. In Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries, of JCDL '22, New York, NY, USA, June, 2022. Association for Computing Machinery. Number of pages: 7 Place: Cologne, Germany tex.articleno: 3

Paper doi abstract bibtex

Media bias is a multi-faceted construct influencing individual behavior and collective decision-making. Slanted news reporting is the result of one-sided and polarized writing which can occur in various forms. In this work, we focus on an important form of media bias, i.e. bias by word choice. Detecting biased word choices is a challenging task due to its linguistic complexity and the lack of representative gold-standard corpora. We present DA-RoBERTa, a new state-of-the-art transformer-based model adapted to the media bias domain which identifies sentence-level bias with an F1 score of 0.814. In addition, we also train, DA-BERT and DA-BART, two more transformer models adapted to the bias domain. Our proposed domain-adapted models outperform prior bias detection approaches on the same data.

@inproceedings{krieger_domain-adaptive_2022,
	address = {New York, NY, USA},
	series = {{JCDL} '22},
	title = {A {Domain}-{Adaptive} {Pre}-{Training} {Approach} for {Language} {Bias} {Detection} in {News}},
	isbn = {978-1-4503-9345-4},
	url = {https://doi.org/10.1145/3529372.3530932},
	doi = {10.1145/3529372.3530932},
	abstract = {Media bias is a multi-faceted construct influencing individual behavior and collective decision-making. Slanted news reporting is the result of one-sided and polarized writing which can occur in various forms. In this work, we focus on an important form of media bias, i.e. bias by word choice. Detecting biased word choices is a challenging task due to its linguistic complexity and the lack of representative gold-standard corpora. We present DA-RoBERTa, a new state-of-the-art transformer-based model adapted to the media bias domain which identifies sentence-level bias with an F1 score of 0.814. In addition, we also train, DA-BERT and DA-BART, two more transformer models adapted to the bias domain. Our proposed domain-adapted models outperform prior bias detection approaches on the same data.},
	booktitle = {Proceedings of the 22nd {ACM}/{IEEE} {Joint} {Conference} on {Digital} {Libraries}},
	publisher = {Association for Computing Machinery},
	author = {Krieger, Jan-David and Spinde, Timo and Ruas, Terry and Kulshrestha, Juhi and Gipp, Bela},
	month = jun,
	year = {2022},
	note = {Number of pages: 7
Place: Cologne, Germany
tex.articleno: 3},
	keywords = {domain adaptive, media bias, neural classification, news slant, text analysis},
}

Downloads: 0

{"_id":"mdoZ8sbqp89fzodo7","bibbaseid":"krieger-spinde-ruas-kulshrestha-gipp-adomainadaptivepretrainingapproachforlanguagebiasdetectioninnews-2022","author_short":["Krieger, J.","Spinde, T.","Ruas, T.","Kulshrestha, J.","Gipp, B."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","address":"New York, NY, USA","series":"JCDL '22","title":"A Domain-Adaptive Pre-Training Approach for Language Bias Detection in News","isbn":"978-1-4503-9345-4","url":"https://doi.org/10.1145/3529372.3530932","doi":"10.1145/3529372.3530932","abstract":"Media bias is a multi-faceted construct influencing individual behavior and collective decision-making. Slanted news reporting is the result of one-sided and polarized writing which can occur in various forms. In this work, we focus on an important form of media bias, i.e. bias by word choice. Detecting biased word choices is a challenging task due to its linguistic complexity and the lack of representative gold-standard corpora. We present DA-RoBERTa, a new state-of-the-art transformer-based model adapted to the media bias domain which identifies sentence-level bias with an F1 score of 0.814. In addition, we also train, DA-BERT and DA-BART, two more transformer models adapted to the bias domain. Our proposed domain-adapted models outperform prior bias detection approaches on the same data.","booktitle":"Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries","publisher":"Association for Computing Machinery","author":[{"propositions":[],"lastnames":["Krieger"],"firstnames":["Jan-David"],"suffixes":[]},{"propositions":[],"lastnames":["Spinde"],"firstnames":["Timo"],"suffixes":[]},{"propositions":[],"lastnames":["Ruas"],"firstnames":["Terry"],"suffixes":[]},{"propositions":[],"lastnames":["Kulshrestha"],"firstnames":["Juhi"],"suffixes":[]},{"propositions":[],"lastnames":["Gipp"],"firstnames":["Bela"],"suffixes":[]}],"month":"June","year":"2022","note":"Number of pages: 7 Place: Cologne, Germany tex.articleno: 3","keywords":"domain adaptive, media bias, neural classification, news slant, text analysis","bibtex":"@inproceedings{krieger_domain-adaptive_2022,\n\taddress = {New York, NY, USA},\n\tseries = {{JCDL} '22},\n\ttitle = {A {Domain}-{Adaptive} {Pre}-{Training} {Approach} for {Language} {Bias} {Detection} in {News}},\n\tisbn = {978-1-4503-9345-4},\n\turl = {https://doi.org/10.1145/3529372.3530932},\n\tdoi = {10.1145/3529372.3530932},\n\tabstract = {Media bias is a multi-faceted construct influencing individual behavior and collective decision-making. Slanted news reporting is the result of one-sided and polarized writing which can occur in various forms. In this work, we focus on an important form of media bias, i.e. bias by word choice. Detecting biased word choices is a challenging task due to its linguistic complexity and the lack of representative gold-standard corpora. We present DA-RoBERTa, a new state-of-the-art transformer-based model adapted to the media bias domain which identifies sentence-level bias with an F1 score of 0.814. In addition, we also train, DA-BERT and DA-BART, two more transformer models adapted to the bias domain. Our proposed domain-adapted models outperform prior bias detection approaches on the same data.},\n\tbooktitle = {Proceedings of the 22nd {ACM}/{IEEE} {Joint} {Conference} on {Digital} {Libraries}},\n\tpublisher = {Association for Computing Machinery},\n\tauthor = {Krieger, Jan-David and Spinde, Timo and Ruas, Terry and Kulshrestha, Juhi and Gipp, Bela},\n\tmonth = jun,\n\tyear = {2022},\n\tnote = {Number of pages: 7\nPlace: Cologne, Germany\ntex.articleno: 3},\n\tkeywords = {domain adaptive, media bias, neural classification, news slant, text analysis},\n}\n\n\n\n\n\n\n\n\n\n\n\n","author_short":["Krieger, J.","Spinde, T.","Ruas, T.","Kulshrestha, J.","Gipp, B."],"key":"krieger_domain-adaptive_2022","id":"krieger_domain-adaptive_2022","bibbaseid":"krieger-spinde-ruas-kulshrestha-gipp-adomainadaptivepretrainingapproachforlanguagebiasdetectioninnews-2022","role":"author","urls":{"Paper":"https://doi.org/10.1145/3529372.3530932"},"keyword":["domain adaptive","media bias","neural classification","news slant","text analysis"],"metadata":{"authorlinks":{}}},"bibtype":"inproceedings","biburl":"https://bibbase.org/zotero-group/ruasterry/2503580","dataSources":["XJBi8b8xDjDoWPzcZ","hG7rv86o2PDG2z44d","Ntor4bPwHAu6xbP9D"],"keywords":["domain adaptive","media bias","neural classification","news slant","text analysis"],"search_terms":["domain","adaptive","pre","training","approach","language","bias","detection","news","krieger","spinde","ruas","kulshrestha","gipp"],"title":"A Domain-Adaptive Pre-Training Approach for Language Bias Detection in News","year":2022}