Inferring Atmospheric Particulate Matter Concentrations from Chinese Social Media Data. Tao, Z., Kokas, A., Zhang, R., Cohan, D. S., & Wallach, D. Plos One, 11(9):e0161389, September, 2016. WOS:000383892100005doi abstract bibtex Although studies have increasingly linked air pollution to specific health outcomes, less well understood is how public perceptions of air quality respond to changing pollutant levels. The growing availability of air pollution measurements and the proliferation of social media provide an opportunity to gauge public discussion of air quality conditions. In this paper, we consider particulate matter (PM) measurements from four Chinese megacities (Beijing, Shanghai, Guangzhou, and Chengdu) together with 112 million posts on Weibo (a popular Chinese microblogging system) from corresponding days in 2011-2013 to identify terms whose frequency was most correlated with PM levels. These correlations are used to construct an Air Discussion Index (ADI) for estimating daily PM based on the content of Weibo posts. In Beijing, the Chinese city with the most PM as measured by U.S. Embassy monitor stations, we found a strong correlation (R = 0.88) between the ADI and measured PM. In other Chinese cities with lower pollution levels, the correlation was weaker. Nonetheless, our results show that social media may be a useful proxy measurement for pollution, particularly when traditional measurement stations are unavailable, censored or misreported.
@article{tao_inferring_2016,
title = {Inferring {Atmospheric} {Particulate} {Matter} {Concentrations} from {Chinese} {Social} {Media} {Data}},
volume = {11},
issn = {1932-6203},
doi = {10.1371/journal.pone.0161389},
abstract = {Although studies have increasingly linked air pollution to specific health outcomes, less well understood is how public perceptions of air quality respond to changing pollutant levels. The growing availability of air pollution measurements and the proliferation of social media provide an opportunity to gauge public discussion of air quality conditions. In this paper, we consider particulate matter (PM) measurements from four Chinese megacities (Beijing, Shanghai, Guangzhou, and Chengdu) together with 112 million posts on Weibo (a popular Chinese microblogging system) from corresponding days in 2011-2013 to identify terms whose frequency was most correlated with PM levels. These correlations are used to construct an Air Discussion Index (ADI) for estimating daily PM based on the content of Weibo posts. In Beijing, the Chinese city with the most PM as measured by U.S. Embassy monitor stations, we found a strong correlation (R = 0.88) between the ADI and measured PM. In other Chinese cities with lower pollution levels, the correlation was weaker. Nonetheless, our results show that social media may be a useful proxy measurement for pollution, particularly when traditional measurement stations are unavailable, censored or misreported.},
language = {English},
number = {9},
journal = {Plos One},
author = {Tao, Zhu and Kokas, Aynne and Zhang, Rui and Cohan, Daniel S. and Wallach, Dan},
month = sep,
year = {2016},
note = {WOS:000383892100005},
keywords = {社交媒体},
pages = {e0161389},
}
Downloads: 0
{"_id":"fDGeppJjsWLP6HiBh","bibbaseid":"tao-kokas-zhang-cohan-wallach-inferringatmosphericparticulatematterconcentrationsfromchinesesocialmediadata-2016","author_short":["Tao, Z.","Kokas, A.","Zhang, R.","Cohan, D. S.","Wallach, D."],"bibdata":{"bibtype":"article","type":"article","title":"Inferring Atmospheric Particulate Matter Concentrations from Chinese Social Media Data","volume":"11","issn":"1932-6203","doi":"10.1371/journal.pone.0161389","abstract":"Although studies have increasingly linked air pollution to specific health outcomes, less well understood is how public perceptions of air quality respond to changing pollutant levels. The growing availability of air pollution measurements and the proliferation of social media provide an opportunity to gauge public discussion of air quality conditions. In this paper, we consider particulate matter (PM) measurements from four Chinese megacities (Beijing, Shanghai, Guangzhou, and Chengdu) together with 112 million posts on Weibo (a popular Chinese microblogging system) from corresponding days in 2011-2013 to identify terms whose frequency was most correlated with PM levels. These correlations are used to construct an Air Discussion Index (ADI) for estimating daily PM based on the content of Weibo posts. In Beijing, the Chinese city with the most PM as measured by U.S. Embassy monitor stations, we found a strong correlation (R = 0.88) between the ADI and measured PM. In other Chinese cities with lower pollution levels, the correlation was weaker. Nonetheless, our results show that social media may be a useful proxy measurement for pollution, particularly when traditional measurement stations are unavailable, censored or misreported.","language":"English","number":"9","journal":"Plos One","author":[{"propositions":[],"lastnames":["Tao"],"firstnames":["Zhu"],"suffixes":[]},{"propositions":[],"lastnames":["Kokas"],"firstnames":["Aynne"],"suffixes":[]},{"propositions":[],"lastnames":["Zhang"],"firstnames":["Rui"],"suffixes":[]},{"propositions":[],"lastnames":["Cohan"],"firstnames":["Daniel","S."],"suffixes":[]},{"propositions":[],"lastnames":["Wallach"],"firstnames":["Dan"],"suffixes":[]}],"month":"September","year":"2016","note":"WOS:000383892100005","keywords":"社交媒体","pages":"e0161389","bibtex":"@article{tao_inferring_2016,\n\ttitle = {Inferring {Atmospheric} {Particulate} {Matter} {Concentrations} from {Chinese} {Social} {Media} {Data}},\n\tvolume = {11},\n\tissn = {1932-6203},\n\tdoi = {10.1371/journal.pone.0161389},\n\tabstract = {Although studies have increasingly linked air pollution to specific health outcomes, less well understood is how public perceptions of air quality respond to changing pollutant levels. The growing availability of air pollution measurements and the proliferation of social media provide an opportunity to gauge public discussion of air quality conditions. In this paper, we consider particulate matter (PM) measurements from four Chinese megacities (Beijing, Shanghai, Guangzhou, and Chengdu) together with 112 million posts on Weibo (a popular Chinese microblogging system) from corresponding days in 2011-2013 to identify terms whose frequency was most correlated with PM levels. These correlations are used to construct an Air Discussion Index (ADI) for estimating daily PM based on the content of Weibo posts. In Beijing, the Chinese city with the most PM as measured by U.S. Embassy monitor stations, we found a strong correlation (R = 0.88) between the ADI and measured PM. In other Chinese cities with lower pollution levels, the correlation was weaker. Nonetheless, our results show that social media may be a useful proxy measurement for pollution, particularly when traditional measurement stations are unavailable, censored or misreported.},\n\tlanguage = {English},\n\tnumber = {9},\n\tjournal = {Plos One},\n\tauthor = {Tao, Zhu and Kokas, Aynne and Zhang, Rui and Cohan, Daniel S. and Wallach, Dan},\n\tmonth = sep,\n\tyear = {2016},\n\tnote = {WOS:000383892100005},\n\tkeywords = {社交媒体},\n\tpages = {e0161389},\n}\n\n","author_short":["Tao, Z.","Kokas, A.","Zhang, R.","Cohan, D. S.","Wallach, D."],"key":"tao_inferring_2016","id":"tao_inferring_2016","bibbaseid":"tao-kokas-zhang-cohan-wallach-inferringatmosphericparticulatematterconcentrationsfromchinesesocialmediadata-2016","role":"author","urls":{},"keyword":["社交媒体"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"article","biburl":"https://bibbase.org/zotero/wybert","dataSources":["TJkbwzD8s2wCxBy6Y"],"keywords":["社交媒体"],"search_terms":["inferring","atmospheric","particulate","matter","concentrations","chinese","social","media","data","tao","kokas","zhang","cohan","wallach"],"title":"Inferring Atmospheric Particulate Matter Concentrations from Chinese Social Media Data","year":2016}