Data Management for Health Data Reuse: Proposal of a Standard Workflow and a R Tutorial with Jupyter Notebook. Lamer, A., Al Massati, S., Saint-Dizier, C., Fares, E., Chazard, E., & Fruchart, M. Studies in Health Technology and Informatics, 298:82–86, August, 2022.
doi  abstract   bibtex   
The data collected in the clinical registries or by data reuse require some modifications in order to suit the research needs. Several common operations are frequently applied to select relevant patients across the cohort, combine data from multiple sources, add new variables if needed and create unique tables depending on the research purpose. We carried out a qualitative survey by conducting semi-structured interviews with 7 experts in data reuse and proposed a standard workflow for health data management. We implemented a R tutorial based on a synthetic data set using Jupyter Notebook for a better understanding of the data management workflow.
@article{lamer_data_2022,
	title = {Data {Management} for {Health} {Data} {Reuse}: {Proposal} of a {Standard} {Workflow} and a {R} {Tutorial} with {Jupyter} {Notebook}},
	volume = {298},
	issn = {1879-8365},
	shorttitle = {Data {Management} for {Health} {Data} {Reuse}},
	doi = {10.3233/SHTI220912},
	abstract = {The data collected in the clinical registries or by data reuse require some modifications in order to suit the research needs. Several common operations are frequently applied to select relevant patients across the cohort, combine data from multiple sources, add new variables if needed and create unique tables depending on the research purpose. We carried out a qualitative survey by conducting semi-structured interviews with 7 experts in data reuse and proposed a standard workflow for health data management. We implemented a R tutorial based on a synthetic data set using Jupyter Notebook for a better understanding of the data management workflow.},
	language = {eng},
	journal = {Studies in Health Technology and Informatics},
	author = {Lamer, Antoine and Al Massati, Sanae and Saint-Dizier, Chloé and Fares, Emile and Chazard, Emmanuel and Fruchart, Mathilde},
	month = aug,
	year = {2022},
	pmid = {36073461},
	keywords = {Data Management, Data Science, Data management, Data reuse, Education, Humans, Programming, Workflow},
	pages = {82--86},
}

Downloads: 0