Interobserver agreement in behavioral research: Importance and calculation

Interobserver agreement in behavioral research: Importance and calculation. Watkins, M. W. & Pacheco, M. Journal of Behavioral Education, 10(4):205–212, 2000.

Paper doi abstract bibtex

Behavioral researchers have developed a sophisticated methodology to evaluate behavioral change which is dependent upon accurate measurement of behavior. Direct observation of behavior has traditionally been the mainstay of behavioral measurement. Consequently, researchers must attend to the psychometric properties, such as interobserver agreement, of observational measures to ensure reliable and valid measurement. Of the many indices of interobserver agreement, percentage of agreement is the most popular. Its use persists despite repeated admonitions and empirical evidence indicating that it is not the most psychometrically sound statistic to determine interobserver agreement due to its inability to take chance into account. Cohen's (1960) kappa has long been proposed as the more psychometrically sound statistic for assessing interobserver agreement. Kappa is described and computational methods are presented.

@article{watkins_interobserver_2000,
	title = {Interobserver agreement in behavioral research: {Importance} and calculation},
	volume = {10},
	issn = {1053-0819, 1573-3513},
	shorttitle = {Interobserver {Agreement} in {Behavioral} {Research}},
	url = {http://link.springer.com/article/10.1023/A%3A1012295615144},
	doi = {10.1023/A:1012295615144},
	abstract = {Behavioral researchers have developed a sophisticated methodology to evaluate behavioral change which is dependent upon accurate measurement of behavior. Direct observation of behavior has traditionally been the mainstay of behavioral measurement. Consequently, researchers must attend to the psychometric properties, such as interobserver agreement, of observational measures to ensure reliable and valid measurement. Of the many indices of interobserver agreement, percentage of agreement is the most popular. Its use persists despite repeated admonitions and empirical evidence indicating that it is not the most psychometrically sound statistic to determine interobserver agreement due to its inability to take chance into account. Cohen's (1960) kappa has long been proposed as the more psychometrically sound statistic for assessing interobserver agreement. Kappa is described and computational methods are presented.},
	language = {en},
	number = {4},
	urldate = {2014-10-26},
	journal = {Journal of Behavioral Education},
	author = {Watkins, Marley W. and Pacheco, Miriam},
	year = {2000},
	keywords = {Clinical Psychology, Health Psychology, Kappa, Pedagogic Psychology, Psychology of Personality, interobserver agreement, interrater reliability, observer agreement, percent agreement},
	pages = {205--212},
}

Downloads: 0

{"_id":"K9Z8v2LqnJhmYc5A9","bibbaseid":"watkins-pacheco-interobserveragreementinbehavioralresearchimportanceandcalculation-2000","author_short":["Watkins, M. W.","Pacheco, M."],"bibdata":{"bibtype":"article","type":"article","title":"Interobserver agreement in behavioral research: Importance and calculation","volume":"10","issn":"1053-0819, 1573-3513","shorttitle":"Interobserver Agreement in Behavioral Research","url":"http://link.springer.com/article/10.1023/A%3A1012295615144","doi":"10.1023/A:1012295615144","abstract":"Behavioral researchers have developed a sophisticated methodology to evaluate behavioral change which is dependent upon accurate measurement of behavior. Direct observation of behavior has traditionally been the mainstay of behavioral measurement. Consequently, researchers must attend to the psychometric properties, such as interobserver agreement, of observational measures to ensure reliable and valid measurement. Of the many indices of interobserver agreement, percentage of agreement is the most popular. Its use persists despite repeated admonitions and empirical evidence indicating that it is not the most psychometrically sound statistic to determine interobserver agreement due to its inability to take chance into account. Cohen's (1960) kappa has long been proposed as the more psychometrically sound statistic for assessing interobserver agreement. Kappa is described and computational methods are presented.","language":"en","number":"4","urldate":"2014-10-26","journal":"Journal of Behavioral Education","author":[{"propositions":[],"lastnames":["Watkins"],"firstnames":["Marley","W."],"suffixes":[]},{"propositions":[],"lastnames":["Pacheco"],"firstnames":["Miriam"],"suffixes":[]}],"year":"2000","keywords":"Clinical Psychology, Health Psychology, Kappa, Pedagogic Psychology, Psychology of Personality, interobserver agreement, interrater reliability, observer agreement, percent agreement","pages":"205–212","bibtex":"@article{watkins_interobserver_2000,\n\ttitle = {Interobserver agreement in behavioral research: {Importance} and calculation},\n\tvolume = {10},\n\tissn = {1053-0819, 1573-3513},\n\tshorttitle = {Interobserver {Agreement} in {Behavioral} {Research}},\n\turl = {http://link.springer.com/article/10.1023/A%3A1012295615144},\n\tdoi = {10.1023/A:1012295615144},\n\tabstract = {Behavioral researchers have developed a sophisticated methodology to evaluate behavioral change which is dependent upon accurate measurement of behavior. Direct observation of behavior has traditionally been the mainstay of behavioral measurement. Consequently, researchers must attend to the psychometric properties, such as interobserver agreement, of observational measures to ensure reliable and valid measurement. Of the many indices of interobserver agreement, percentage of agreement is the most popular. Its use persists despite repeated admonitions and empirical evidence indicating that it is not the most psychometrically sound statistic to determine interobserver agreement due to its inability to take chance into account. Cohen's (1960) kappa has long been proposed as the more psychometrically sound statistic for assessing interobserver agreement. Kappa is described and computational methods are presented.},\n\tlanguage = {en},\n\tnumber = {4},\n\turldate = {2014-10-26},\n\tjournal = {Journal of Behavioral Education},\n\tauthor = {Watkins, Marley W. and Pacheco, Miriam},\n\tyear = {2000},\n\tkeywords = {Clinical Psychology, Health Psychology, Kappa, Pedagogic Psychology, Psychology of Personality, interobserver agreement, interrater reliability, observer agreement, percent agreement},\n\tpages = {205--212},\n}\n\n","author_short":["Watkins, M. W.","Pacheco, M."],"key":"watkins_interobserver_2000","id":"watkins_interobserver_2000","bibbaseid":"watkins-pacheco-interobserveragreementinbehavioralresearchimportanceandcalculation-2000","role":"author","urls":{"Paper":"http://link.springer.com/article/10.1023/A%3A1012295615144"},"keyword":["Clinical Psychology","Health Psychology","Kappa","Pedagogic Psychology","Psychology of Personality","interobserver agreement","interrater reliability","observer agreement","percent agreement"],"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://bibbase.org/zotero/ofurtado","dataSources":["7i2Yc4ejK6JQ7w28D"],"keywords":["clinical psychology","health psychology","kappa","pedagogic psychology","psychology of personality","interobserver agreement","interrater reliability","observer agreement","percent agreement"],"search_terms":["interobserver","agreement","behavioral","research","importance","calculation","watkins","pacheco"],"title":"Interobserver agreement in behavioral research: Importance and calculation","year":2000}