Automatic Personality Prediction with Attention-based Neural Networks

Automatic Personality Prediction with Attention-based Neural Networks. Jiang, H. 2018. Undergraduate Honors Thesis, Emory University, Atlanta, GA, 2018.

Paper

Slides abstract bibtex

Distributional Semantic word representation allows Natural Language Processing systems to extract and model an immense amount of information about a language. This technique maps words into a high dimensional continuous space through the use of a single-layer neural network. This process has allowed for advances in many Natural Language Processing research areas and tasks. These representation models are evaluated with the use of analogy tests, questions of the form ``If a is to a' then b is to what?'' are answered by composing multiple word vectors and searching the vector space. During the neural network training process, each word is examined as a member of its context. Generally, a word's context is considered to be the elements adjacent to it within a sentence. While some work has been conducted examining the effect of expanding this definition, very little exploration has been done in this area. Further, no inquiry has been conducted as to the specific linguistic competencies of these models or whether modifying their contexts impacts the information they extract. In this paper we propose a thorough analysis of the various lexical and grammatical competencies of distributional semantic models. We aim to leverage analogy tests to evaluate the most advanced distributional model across 14 different types of linguistic relationships. With this information we will then be able to investigate as to whether modifying the training context renders any differences in quality across any of these categories. Ideally we will be able to identify approaches to training that increase precision in some specific linguistic categories, which will allow us to investigate whether these improvements can be combined by joining the information used in different training approaches to build a single, improved, model.

@jurthesis{jiang:18a,
	abstract = {Distributional Semantic word representation allows Natural Language Processing systems to extract and model an immense amount of information about a language. This technique maps words into a high dimensional continuous space through the use of a single-layer neural network. This process has allowed for advances in many Natural Language Processing research areas and tasks. These representation models are evaluated with the use of analogy tests, questions of the form ``If a is to a' then b is to what?'' are answered by composing multiple word vectors and searching the vector space.

During the neural network training process, each word is examined as a member of its context. Generally, a word's context is considered to be the elements adjacent to it within a sentence. While some work has been conducted examining the effect of expanding this definition, very little exploration has been done in this area. Further, no inquiry has been conducted as to the specific linguistic competencies of these models or whether modifying their contexts impacts the information they extract.

In this paper we propose a thorough analysis of the various lexical and grammatical competencies of distributional semantic models. We aim to leverage analogy tests to evaluate the most advanced distributional model across 14 different types of linguistic relationships. With this information we will then be able to investigate as to whether modifying the training context renders any differences in quality across any of these categories. Ideally we will be able to identify approaches to training that increase precision in some specific linguistic categories, which will allow us to investigate whether these improvements can be combined by joining the information used in different training approaches to build a single, improved, model.},
	address = {Atlanta, GA},
	author = {Jiang, Hang},
	date-added = {2018-07-10 17:06:27 +0000},
	date-modified = {2019-05-28 14:06:46 -0400},
	keywords = {emorynlp},
	note = {Undergraduate Honors Thesis, Emory University, Atlanta, GA, 2018.},
	school = {Emory University},
	title = {{Automatic Personality Prediction with Attention-based Neural Networks}},
	url_paper = {https://etd.library.emory.edu/concern/etds/rv042t11v},
	url_slides = {https://www.slideshare.net/jchoi7s/automatic-personality-prediction-with-attentionbased-neural-networks},
	year = {2018},
	Bdsk-Url-1 = {https://etd.library.emory.edu/view/record/pid/emory:rj97r}}

Downloads: 0

{"_id":"nG2CvneZGtrTooQJk","bibbaseid":"jiang-automaticpersonalitypredictionwithattentionbasedneuralnetworks-2018","author_short":["Jiang, H."],"bibdata":{"bibtype":"jurthesis","type":"jurthesis","abstract":"Distributional Semantic word representation allows Natural Language Processing systems to extract and model an immense amount of information about a language. This technique maps words into a high dimensional continuous space through the use of a single-layer neural network. This process has allowed for advances in many Natural Language Processing research areas and tasks. These representation models are evaluated with the use of analogy tests, questions of the form ``If a is to a' then b is to what?'' are answered by composing multiple word vectors and searching the vector space. During the neural network training process, each word is examined as a member of its context. Generally, a word's context is considered to be the elements adjacent to it within a sentence. While some work has been conducted examining the effect of expanding this definition, very little exploration has been done in this area. Further, no inquiry has been conducted as to the specific linguistic competencies of these models or whether modifying their contexts impacts the information they extract. In this paper we propose a thorough analysis of the various lexical and grammatical competencies of distributional semantic models. We aim to leverage analogy tests to evaluate the most advanced distributional model across 14 different types of linguistic relationships. With this information we will then be able to investigate as to whether modifying the training context renders any differences in quality across any of these categories. Ideally we will be able to identify approaches to training that increase precision in some specific linguistic categories, which will allow us to investigate whether these improvements can be combined by joining the information used in different training approaches to build a single, improved, model.","address":"Atlanta, GA","author":[{"propositions":[],"lastnames":["Jiang"],"firstnames":["Hang"],"suffixes":[]}],"date-added":"2018-07-10 17:06:27 +0000","date-modified":"2019-05-28 14:06:46 -0400","keywords":"emorynlp","note":"Undergraduate Honors Thesis, Emory University, Atlanta, GA, 2018.","school":"Emory University","title":"Automatic Personality Prediction with Attention-based Neural Networks","url_paper":"https://etd.library.emory.edu/concern/etds/rv042t11v","url_slides":"https://www.slideshare.net/jchoi7s/automatic-personality-prediction-with-attentionbased-neural-networks","year":"2018","bdsk-url-1":"https://etd.library.emory.edu/view/record/pid/emory:rj97r","bibtex":"@jurthesis{jiang:18a,\n\tabstract = {Distributional Semantic word representation allows Natural Language Processing systems to extract and model an immense amount of information about a language. This technique maps words into a high dimensional continuous space through the use of a single-layer neural network. This process has allowed for advances in many Natural Language Processing research areas and tasks. These representation models are evaluated with the use of analogy tests, questions of the form ``If a is to a' then b is to what?'' are answered by composing multiple word vectors and searching the vector space.\n\nDuring the neural network training process, each word is examined as a member of its context. Generally, a word's context is considered to be the elements adjacent to it within a sentence. While some work has been conducted examining the effect of expanding this definition, very little exploration has been done in this area. Further, no inquiry has been conducted as to the specific linguistic competencies of these models or whether modifying their contexts impacts the information they extract.\n\nIn this paper we propose a thorough analysis of the various lexical and grammatical competencies of distributional semantic models. We aim to leverage analogy tests to evaluate the most advanced distributional model across 14 different types of linguistic relationships. With this information we will then be able to investigate as to whether modifying the training context renders any differences in quality across any of these categories. Ideally we will be able to identify approaches to training that increase precision in some specific linguistic categories, which will allow us to investigate whether these improvements can be combined by joining the information used in different training approaches to build a single, improved, model.},\n\taddress = {Atlanta, GA},\n\tauthor = {Jiang, Hang},\n\tdate-added = {2018-07-10 17:06:27 +0000},\n\tdate-modified = {2019-05-28 14:06:46 -0400},\n\tkeywords = {emorynlp},\n\tnote = {Undergraduate Honors Thesis, Emory University, Atlanta, GA, 2018.},\n\tschool = {Emory University},\n\ttitle = {{Automatic Personality Prediction with Attention-based Neural Networks}},\n\turl_paper = {https://etd.library.emory.edu/concern/etds/rv042t11v},\n\turl_slides = {https://www.slideshare.net/jchoi7s/automatic-personality-prediction-with-attentionbased-neural-networks},\n\tyear = {2018},\n\tBdsk-Url-1 = {https://etd.library.emory.edu/view/record/pid/emory:rj97r}}\n\n","author_short":["Jiang, H."],"key":"jiang:18a","id":"jiang:18a","bibbaseid":"jiang-automaticpersonalitypredictionwithattentionbasedneuralnetworks-2018","role":"author","urls":{" paper":"https://etd.library.emory.edu/concern/etds/rv042t11v"," slides":"https://www.slideshare.net/jchoi7s/automatic-personality-prediction-with-attentionbased-neural-networks"},"keyword":["emorynlp"],"metadata":{"authorlinks":{}}},"bibtype":"jurthesis","biburl":"http://www.mathcs.emory.edu/~choi/cv/jinho_choi-20210601.bib","dataSources":["WRbWtphN7JFZaJSwS","KCe4LtCfLaE5R9apZ"],"keywords":["emorynlp"],"search_terms":["automatic","personality","prediction","attention","based","neural","networks","jiang"],"title":"Automatic Personality Prediction with Attention-based Neural Networks","year":2018}