I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers. Vashistha, R. & Farahi, A. abstract bibtex As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework – a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by o!ering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The e!ectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insu"ciency of existing methods to achieve I-trustworthiness.
@misc{vashistha_i-trustworthy_nodate,
title = {I-trustworthy {Models}. {A} framework for trustworthiness evaluation of probabilistic classifiers},
abstract = {As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework – a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by o!ering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The e!ectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insu"ciency of existing methods to achieve I-trustworthiness.},
language = {en},
author = {Vashistha, Ritwik and Farahi, Arya},
}
Downloads: 0
{"_id":"SSr9QKnrcfCcEE64h","bibbaseid":"vashistha-farahi-itrustworthymodelsaframeworkfortrustworthinessevaluationofprobabilisticclassiers","author_short":["Vashistha, R.","Farahi, A."],"bibdata":{"bibtype":"misc","type":"misc","title":"I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers","abstract":"As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework – a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by o!ering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The e!ectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insu\"ciency of existing methods to achieve I-trustworthiness.","language":"en","author":[{"propositions":[],"lastnames":["Vashistha"],"firstnames":["Ritwik"],"suffixes":[]},{"propositions":[],"lastnames":["Farahi"],"firstnames":["Arya"],"suffixes":[]}],"bibtex":"@misc{vashistha_i-trustworthy_nodate,\n\ttitle = {I-trustworthy {Models}. {A} framework for trustworthiness evaluation of probabilistic classifiers},\n\tabstract = {As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework – a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by o!ering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The e!ectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insu\"ciency of existing methods to achieve I-trustworthiness.},\n\tlanguage = {en},\n\tauthor = {Vashistha, Ritwik and Farahi, Arya},\n}\n\n\n\n\n\n\n\n","author_short":["Vashistha, R.","Farahi, A."],"key":"vashistha_i-trustworthy_nodate","id":"vashistha_i-trustworthy_nodate","bibbaseid":"vashistha-farahi-itrustworthymodelsaframeworkfortrustworthinessevaluationofprobabilisticclassiers","role":"author","urls":{},"metadata":{"authorlinks":{}}},"bibtype":"misc","biburl":"https://bibbase.org/zotero-group/pratikmhatre/5933976","dataSources":["yJr5AAtJ5Sz3Q4WT4"],"keywords":[],"search_terms":["trustworthy","models","framework","trustworthiness","evaluation","probabilistic","classi","ers","vashistha","farahi"],"title":"I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers","year":null}