Trumping the Polls: Event Analysis During the 2016 Presidential Election. Shaban, T. 2017. Undergraduate Honors Thesis, Emory University, Atlanta, GA, 2017.
Trumping the Polls: Event Analysis During the 2016 Presidential Election [link]Paper  Trumping the Polls: Event Analysis During the 2016 Presidential Election [link]Slides  abstract   bibtex   
This thesis introduces a subtask of entity linking, called character identification, that maps mentions in multiparty conversation to their referent characters. Transcripts of TV shows are collected as the sources of our corpus and automatically annotated with mentions by linguistically-motivated rules. These mentions are manually linked to their referents and disambiguate with abstract referent labels through crowdsourcing. Our annotated corpus comprises 448 scenes from 2 seasons and 46 episodes of the TV show Friends, and shows the inter-annotator agreement of kappa = 79.96. For statistical modeling, this task is reformulated as coreference resolution, and experimented with two state-of-the-art systems on our corpus. A novel mention-to-mention ranking model is proposed to provides better mention and mention-pair representations learned from feature groupings of dialogue-specific features After linking coreferent clusters to their referent entity with our proposed rule-based remap- ping algorithm, the best model gives a purity score of 57.27% on average, which is promising given the challenging nature of this task and our corpus.
@jurthesis{shaban:17b,
	abstract = {This thesis introduces a subtask of entity linking, called character identification, that maps mentions in multiparty conversation to their referent characters. Transcripts of TV shows are collected as the sources of our corpus and automatically annotated with mentions by linguistically-motivated rules. These mentions are manually linked to their referents and disambiguate with abstract referent labels through crowdsourcing. Our annotated corpus comprises 448 scenes from 2 seasons and 46 episodes of the TV show Friends, and shows the inter-annotator agreement of kappa = 79.96. For statistical modeling, this task is reformulated as coreference resolution, and experimented with two state-of-the-art systems on our corpus. A novel mention-to-mention ranking model is proposed to provides better mention and mention-pair representations learned from feature groupings of dialogue-specific features After linking coreferent clusters to their referent entity with our proposed rule-based remap- ping algorithm, the best model gives a purity score of 57.27% on average, which is promising given the challenging nature of this task and our corpus.},
	address = {Atlanta, GA},
	author = {Shaban, Tarrek},
	date-added = {2017-07-13 02:56:51 +0000},
	date-modified = {2019-05-28 14:07:34 -0400},
	keywords = {emorynlp},
	note = {Undergraduate Honors Thesis, Emory University, Atlanta, GA, 2017.},
	school = {Emory University},
	title = {{Trumping the Polls: Event Analysis During the 2016 Presidential Election}},
	url_paper = {https://etd.library.emory.edu/concern/etds/jd472w54s},
	url_slides = {https://www.slideshare.net/jchoi7s/trumping-the-polls-event-analysis-during-the-2016-presidential-election},
	year = {2017},
	Bdsk-Url-1 = {https://etd.library.emory.edu/view/record/pid/emory:pjw0g}}

Downloads: 0