Database Evolution, by Scientists, for Scientists: A Case Study. Schuler, R., Singla, J., Vallat, B., White, K. L., Berman, H. M., & Kesselman, C. In 2023 IEEE 19th International Conference on E-Science (e-Science), pages 1–10, Limassol, Cyprus, October, 2023. IEEE.
doi  abstract   bibtex   
Database management systems have been used to great advantage for industry usage scenarios. As science becomes increasingly dependent on carefully organized and curated data to inform and drive new discoveries, the need for database management systems has grown significantly. The long standing ``20 questions'' method was advocated in early studies of applying relational databases for science in order to elicit requirements for designing and developing information systems for scientific data. It has been observed, however, that database designs become outdated within months of usage leading to degradation in the quality of the database schema. In addition, there is limited evidence that scientists themselves have the tools and processes necessary to develop and maintain scientific databases without reliance on database administrators. Beyond learning to query databases, scientists need tools to create and evolve databases and guidance on how to apply those tools to develop information systems. In this paper, we present a simplified methodology for database evolution for scientists and a case study of database evolution by a scientist in the context of a research database for cell modeling. We include a detailed analysis of the activities and processes employed by the scientist during the schema evolution. Our results show that a scientist can successfully evolve a complex information system driven by new research requirements.
@inproceedings{Schuler2023,
	abstract = {Database management systems have been used to great advantage for industry usage scenarios. As science becomes increasingly dependent on carefully organized and curated data to inform and drive new discoveries, the need for database management systems has grown significantly. The long standing ``20 questions'' method was advocated in early studies of applying relational databases for science in order to elicit requirements for designing and developing information systems for scientific data. It has been observed, however, that database designs become outdated within months of usage leading to degradation in the quality of the database schema. In addition, there is limited evidence that scientists themselves have the tools and processes necessary to develop and maintain scientific databases without reliance on database administrators. Beyond learning to query databases, scientists need tools to create and evolve databases and guidance on how to apply those tools to develop information systems. In this paper, we present a simplified methodology for database evolution for scientists and a case study of database evolution by a scientist in the context of a research database for cell modeling. We include a detailed analysis of the activities and processes employed by the scientist during the schema evolution. Our results show that a scientist can successfully evolve a complex information system driven by new research requirements.},
	address = {{Limassol, Cyprus}},
	author = {Schuler, Robert and Singla, Jitin and Vallat, Brinda and White, Kate L. and Berman, Helen M. and Kesselman, Carl},
	booktitle = {2023 {{IEEE}} 19th {{International Conference}} on E-{{Science}} (e-{{Science}})},
	date-added = {2024-01-22 12:05:54 -0800},
	date-modified = {2024-01-22 12:05:54 -0800},
	doi = {10.1109/e-Science58273.2023.10254872},
	file = {/Users/schuler/Zotero/storage/L493H8AJ/Schuler et al. - 2022 - Relational Database Schema Evolution, by Scientist.pdf;/Users/schuler/Zotero/storage/PD8URSG9/CHiSEL_CaseStudy_2022.pdf},
	keywords = {in review},
	month = oct,
	pages = {1--10},
	publisher = {{IEEE}},
	title = {Database {{Evolution}}, by {{Scientists}}, for {{Scientists}}: {{A Case Study}}},
	year = {2023},
	bdsk-url-1 = {https://doi.org/10.1109/e-Science58273.2023.10254872}}

Downloads: 0