Validating Topic Modeling as a Method of Analyzing Sujet and Theme. Schröter, J. & Du, K. Journal of Computational Literary Studies, December, 2022. Number: 1 Publisher: Universitäts- und Landesbibliothek Darmstadt
Validating Topic Modeling as a Method of Analyzing Sujet and Theme [link]Paper  doi  abstract   bibtex   
In Computational Literary Studies (CLS), several procedures for thematic analysis have been adapted from NLP and Computer Science. Among these procedures, topic modeling is the most prominent and popular technique. We maintain, however, that this procedure is used only in the context of exploration up to date, but not in the context of justification. When we seek to prove assumptions concerning the correlation between genres, methods of computational text analysis have to be set up in research environments of justification, i.e. in environments of hypothesis testing. We provide a holistic model of validation and conceptual disambiguation of the notion of aboutness as sujet, fabula, and theme, and discuss essential methodological requirements for hypothesis-based analysis. As we maintain that validation has to be performed for individual tasks respectively, we shall perform empirical validation of topic modeling based on a new corpus of German novellas and comprehensive annotations and draw hypothetical generalizations on the applicability of topic modeling for analyzing aboutness in the domain of narrative fiction.
@article{schroter_validating_2022,
	title = {Validating {Topic} {Modeling} as a {Method} of {Analyzing} {Sujet} and {Theme}},
	volume = {1},
	copyright = {Copyright: © 2022 The Author(s). The text of this work is released under the Creative Commons license CC BY 4.0 International. You can find the contract text of the license at https://creativecommons.org/licenses/by/4.0/. The illustrations are excluded from this license, here the copyright lies with the respective rights holder.},
	issn = {2940-1348},
	url = {https://jcls.io/article/id/91/},
	doi = {10.48694/jcls.91},
	abstract = {In Computational Literary Studies (CLS), several procedures for thematic analysis have been adapted from NLP and Computer Science. Among these procedures, topic modeling is the most prominent and popular technique. We maintain, however, that this procedure is used only in the context of exploration up to date, but not in the context of justification. When we seek to prove assumptions concerning the correlation between genres, methods of computational text analysis have to be set up in research environments of justification, i.e. in environments of hypothesis testing. We provide a holistic model of validation and conceptual disambiguation of the notion of aboutness as sujet, fabula, and theme, and discuss essential methodological requirements for hypothesis-based analysis. As we maintain that validation has to be performed for individual tasks respectively, we shall perform empirical validation of topic modeling based on a new corpus of German novellas and comprehensive annotations and draw hypothetical generalizations on the applicability of topic modeling for analyzing aboutness in the domain of narrative fiction.},
	language = {eng},
	number = {1},
	urldate = {2025-07-07},
	journal = {Journal of Computational Literary Studies},
	author = {Schröter, Julian and Du, Keli},
	month = dec,
	year = {2022},
	note = {Number: 1
Publisher: Universitäts- und Landesbibliothek Darmstadt},
}

Downloads: 0