Evaluating Unsupervised Representation Learning for Detecting Stances of Fake News. Guderlei, M. & Aßenmacher, M. In Proceedings of the 28th International Conference on Computational Linguistics, pages 6339–6349, Barcelona, Spain (Online), December, 2020. International Committee on Computational Linguistics.
Evaluating Unsupervised Representation Learning for Detecting Stances of Fake News [link]Paper  doi  abstract   bibtex   
Our goal is to evaluate the usefulness of unsupervised representation learning techniques for detecting stances of Fake News. Therefore we examine several pre-trained language models with respect to their performance on two Fake News related data sets, both consisting of instances with a headline, an associated news article and the stance of the article towards the respective headline. Specifically, the aim is to understand how much hyperparameter tuning is necessary when fine-tuning the pre-trained architectures, how well transfer learning works in this specific case of stance detection and how sensitive the models are to changes in hyperparameters like batch size, learning rate (schedule), sequence length as well as the freezing technique. The results indicate that the computationally more expensive autoregression approach of XLNet (Yanget al., 2019) is outperformed by BERT-based models, notably by RoBERTa (Liu et al., 2019).While the learning rate seems to be the most important hyperparameter, experiments with different freezing techniques indicate that all evaluated architectures had already learned powerful language representations that pose a good starting point for fine-tuning them.
@inproceedings{guderlei_evaluating_2020,
	address = {Barcelona, Spain (Online)},
	title = {Evaluating {Unsupervised} {Representation} {Learning} for {Detecting} {Stances} of {Fake} {News}},
	url = {https://www.aclweb.org/anthology/2020.coling-main.558},
	doi = {10.18653/v1/2020.coling-main.558},
	abstract = {Our goal is to evaluate the usefulness of unsupervised representation learning techniques for detecting stances of Fake News. Therefore we examine several pre-trained language models with respect to their performance on two Fake News related data sets, both consisting of instances with a headline, an associated news article and the stance of the article towards the respective headline. Specifically, the aim is to understand how much hyperparameter tuning is necessary when fine-tuning the pre-trained architectures, how well transfer learning works in this specific case of stance detection and how sensitive the models are to changes in hyperparameters like batch size, learning rate (schedule), sequence length as well as the freezing technique. The results indicate that the computationally more expensive autoregression approach of XLNet (Yanget al., 2019) is outperformed by BERT-based models, notably by RoBERTa (Liu et al., 2019).While the learning rate seems to be the most important hyperparameter, experiments with different freezing techniques indicate that all evaluated architectures had already learned powerful language representations that pose a good starting point for fine-tuning them.},
	urldate = {2021-01-29},
	booktitle = {Proceedings of the 28th {International} {Conference} on {Computational} {Linguistics}},
	publisher = {International Committee on Computational Linguistics},
	author = {Guderlei, Maike and Aßenmacher, Matthias},
	month = dec,
	year = {2020},
	pages = {6339--6349},
}

Downloads: 0