MURA Dataset: Towards Radiologist-Level Abnormality Detection in Musculoskeletal Radiographs. Rajpurkar, P., Irvin, J., Bagul, A., Ding, D., Duan, T., Mehta, H., Yang, B., Zhu, K., Laird, D., Ball, R. L., Langlotz, C., Shpanskaya, K., Lungren, M. P., & Ng, A. 2017. cite arxiv:1712.06957
MURA Dataset: Towards Radiologist-Level Abnormality Detection in Musculoskeletal Radiographs [link]Paper  abstract   bibtex   
We introduce MURA, a large dataset of musculoskeletal radiographs containing 40,895 images from 14,982 studies, where each study is manually labeled by radiologists as either normal or abnormal. On this dataset, we train a 169-layer densely connected convolutional network to detect and localize abnormalities. To evaluate our model robustly and to get an estimate of radiologist performance, we collect additional labels from board-certified Stanford radiologists on the test set, consisting of 209 musculoskeletal studies. We compared our model and radiologists on the Cohen's kappa statistic, which expresses the agreement of our model and of each radiologist with the gold standard, defined as the majority vote of a disjoint group of radiologists. We find that our model achieves performance comparable to that of radiologists. Model performance is higher than the best radiologist performance in detecting abnormalities on finger studies and equivalent on wrist studies. However, model performance is lower than best radiologist performance in detecting abnormalities on elbow, forearm, hand, humerus, and shoulder studies, indicating that the task is a good challenge for future research. To encourage advances, we have made our dataset freely available at https://stanfordmlgroup.github.io/projects/mura
@misc{rajpurkar2017dataset,
  abstract = {We introduce MURA, a large dataset of musculoskeletal radiographs containing
40,895 images from 14,982 studies, where each study is manually labeled by
radiologists as either normal or abnormal. On this dataset, we train a
169-layer densely connected convolutional network to detect and localize
abnormalities. To evaluate our model robustly and to get an estimate of
radiologist performance, we collect additional labels from board-certified
Stanford radiologists on the test set, consisting of 209 musculoskeletal
studies. We compared our model and radiologists on the Cohen's kappa statistic,
which expresses the agreement of our model and of each radiologist with the
gold standard, defined as the majority vote of a disjoint group of
radiologists. We find that our model achieves performance comparable to that of
radiologists. Model performance is higher than the best radiologist performance
in detecting abnormalities on finger studies and equivalent on wrist studies.
However, model performance is lower than best radiologist performance in
detecting abnormalities on elbow, forearm, hand, humerus, and shoulder studies,
indicating that the task is a good challenge for future research. To encourage
advances, we have made our dataset freely available at
https://stanfordmlgroup.github.io/projects/mura},
  added-at = {2018-01-19T10:15:17.000+0100},
  author = {Rajpurkar, Pranav and Irvin, Jeremy and Bagul, Aarti and Ding, Daisy and Duan, Tony and Mehta, Hershel and Yang, Brandon and Zhu, Kaylie and Laird, Dillon and Ball, Robyn L. and Langlotz, Curtis and Shpanskaya, Katie and Lungren, Matthew P. and Ng, Andrew},
  biburl = {https://www.bibsonomy.org/bibtex/2c8ff2f68a0e57716153eefdda755bfd8/alexattia},
  description = {MURA Dataset: Towards Radiologist-Level Abnormality Detection in
  Musculoskeletal Radiographs},
  interhash = {0fbf86d39e14c19e59ea2d81d3feda51},
  intrahash = {c8ff2f68a0e57716153eefdda755bfd8},
  keywords = {azmed deep-learning recvis},
  note = {cite arxiv:1712.06957},
  timestamp = {2018-01-19T10:15:17.000+0100},
  title = {MURA Dataset: Towards Radiologist-Level Abnormality Detection in
  Musculoskeletal Radiographs},
  url = {http://arxiv.org/abs/1712.06957},
  year = 2017
}

Downloads: 0