Towards Long-term and Archivable Reproducibility. Akhlaghi, M., Infante-Sainz, R., Roukema, B. F., Valls-Gabaud, D., & Baena-Gallé, R. June, 2020.
Paper abstract bibtex 5 downloads Reproducible workflow solutions commonly use high-level technologies that were popular when they were created, providing an immediate solution which is unlikely to be sustainable in the long term. We therefore introduce a set of criteria to address this problem and demonstrate their practicality and implementation. The criteria have been tested in several research publications and can be summarized as: completeness (no dependency beyond a POSIX-compatible operating system, no administrator privileges, no network connection and storage primarily in plain text); modular design; minimal complexity; scalability; verifiable inputs and outputs; temporal provenance; linking analysis with narrative; and free-and-open-source software. As a proof of concept, we have implemented "Maneage", a solution which stores the project in machine-actionable and human-readable plain-text, enables version-control, cheap archiving, automatic parsing to extract data provenance, and peer-reviewable verification. We show that requiring longevity of a reproducible workflow solution is realistic, without sacrificing immediate or short-term reproducibility and discuss the benefits of the criteria for scientific progress. This paper has itself been written in Maneage, with snapshot 1637cce.
@article{akhlaghi_towards_2020,
title = {Towards {Long}-term and {Archivable} {Reproducibility}},
url = {http://arxiv.org/abs/2006.03018v1},
abstract = {Reproducible workflow solutions commonly use high-level technologies that
were popular when they were created, providing an immediate solution which is
unlikely to be sustainable in the long term. We therefore introduce a set of
criteria to address this problem and demonstrate their practicality and
implementation. The criteria have been tested in several research publications
and can be summarized as: completeness (no dependency beyond a POSIX-compatible
operating system, no administrator privileges, no network connection and
storage primarily in plain text); modular design; minimal complexity;
scalability; verifiable inputs and outputs; temporal provenance; linking
analysis with narrative; and free-and-open-source software. As a proof of
concept, we have implemented "Maneage", a solution which stores the project in
machine-actionable and human-readable plain-text, enables version-control,
cheap archiving, automatic parsing to extract data provenance, and
peer-reviewable verification. We show that requiring longevity of a
reproducible workflow solution is realistic, without sacrificing immediate or
short-term reproducibility and discuss the benefits of the criteria for
scientific progress. This paper has itself been written in Maneage, with
snapshot 1637cce.},
language = {en},
urldate = {2020-06-09},
author = {Akhlaghi, Mohammad and Infante-Sainz, Raúl and Roukema, Boudewijn F. and Valls-Gabaud, David and Baena-Gallé, Roberto},
month = jun,
year = {2020},
}
Downloads: 5
{"_id":"mhZ2mvzBiJc4Z2wsi","bibbaseid":"akhlaghi-infantesainz-roukema-vallsgabaud-baenagall-towardslongtermandarchivablereproducibility-2020","authorIDs":[],"author_short":["Akhlaghi, M.","Infante-Sainz, R.","Roukema, B. F.","Valls-Gabaud, D.","Baena-Gallé, R."],"bibdata":{"bibtype":"article","type":"article","title":"Towards Long-term and Archivable Reproducibility","url":"http://arxiv.org/abs/2006.03018v1","abstract":"Reproducible workflow solutions commonly use high-level technologies that were popular when they were created, providing an immediate solution which is unlikely to be sustainable in the long term. We therefore introduce a set of criteria to address this problem and demonstrate their practicality and implementation. The criteria have been tested in several research publications and can be summarized as: completeness (no dependency beyond a POSIX-compatible operating system, no administrator privileges, no network connection and storage primarily in plain text); modular design; minimal complexity; scalability; verifiable inputs and outputs; temporal provenance; linking analysis with narrative; and free-and-open-source software. As a proof of concept, we have implemented \"Maneage\", a solution which stores the project in machine-actionable and human-readable plain-text, enables version-control, cheap archiving, automatic parsing to extract data provenance, and peer-reviewable verification. We show that requiring longevity of a reproducible workflow solution is realistic, without sacrificing immediate or short-term reproducibility and discuss the benefits of the criteria for scientific progress. This paper has itself been written in Maneage, with snapshot 1637cce.","language":"en","urldate":"2020-06-09","author":[{"propositions":[],"lastnames":["Akhlaghi"],"firstnames":["Mohammad"],"suffixes":[]},{"propositions":[],"lastnames":["Infante-Sainz"],"firstnames":["Raúl"],"suffixes":[]},{"propositions":[],"lastnames":["Roukema"],"firstnames":["Boudewijn","F."],"suffixes":[]},{"propositions":[],"lastnames":["Valls-Gabaud"],"firstnames":["David"],"suffixes":[]},{"propositions":[],"lastnames":["Baena-Gallé"],"firstnames":["Roberto"],"suffixes":[]}],"month":"June","year":"2020","bibtex":"@article{akhlaghi_towards_2020,\n\ttitle = {Towards {Long}-term and {Archivable} {Reproducibility}},\n\turl = {http://arxiv.org/abs/2006.03018v1},\n\tabstract = {Reproducible workflow solutions commonly use high-level technologies that\nwere popular when they were created, providing an immediate solution which is\nunlikely to be sustainable in the long term. We therefore introduce a set of\ncriteria to address this problem and demonstrate their practicality and\nimplementation. The criteria have been tested in several research publications\nand can be summarized as: completeness (no dependency beyond a POSIX-compatible\noperating system, no administrator privileges, no network connection and\nstorage primarily in plain text); modular design; minimal complexity;\nscalability; verifiable inputs and outputs; temporal provenance; linking\nanalysis with narrative; and free-and-open-source software. As a proof of\nconcept, we have implemented \"Maneage\", a solution which stores the project in\nmachine-actionable and human-readable plain-text, enables version-control,\ncheap archiving, automatic parsing to extract data provenance, and\npeer-reviewable verification. We show that requiring longevity of a\nreproducible workflow solution is realistic, without sacrificing immediate or\nshort-term reproducibility and discuss the benefits of the criteria for\nscientific progress. This paper has itself been written in Maneage, with\nsnapshot 1637cce.},\n\tlanguage = {en},\n\turldate = {2020-06-09},\n\tauthor = {Akhlaghi, Mohammad and Infante-Sainz, Raúl and Roukema, Boudewijn F. and Valls-Gabaud, David and Baena-Gallé, Roberto},\n\tmonth = jun,\n\tyear = {2020},\n}\n\n","author_short":["Akhlaghi, M.","Infante-Sainz, R.","Roukema, B. F.","Valls-Gabaud, D.","Baena-Gallé, R."],"key":"akhlaghi_towards_2020","id":"akhlaghi_towards_2020","bibbaseid":"akhlaghi-infantesainz-roukema-vallsgabaud-baenagall-towardslongtermandarchivablereproducibility-2020","role":"author","urls":{"Paper":"http://arxiv.org/abs/2006.03018v1"},"metadata":{"authorlinks":{}},"downloads":5},"bibtype":"article","biburl":"https://api.zotero.org/groups/2350194/items?key=d4zEd62chFWTgW3vq5H89sn4&format=bibtex&limit=100","creationDate":"2020-06-10T22:11:05.576Z","downloads":5,"keywords":[],"search_terms":["towards","long","term","archivable","reproducibility","akhlaghi","infante-sainz","roukema","valls-gabaud","baena-gallé"],"title":"Towards Long-term and Archivable Reproducibility","year":2020,"dataSources":["7aS5xoegGd2eqrz9s","xcS49WDoKSRX7f7Mg","TfnyHmY3zxAr7QA6G"]}