Big Data Bags: A Scalable Packaging Format for Science. D'Arcy, M., Chard, K., Foster, I., Kesselman, C., Madduri, R., Saint, N., & Wagner, R. July, 2019. Publisher: Zenodo
Paper doi abstract bibtex The need to describe and exchange large and complex data underlies the vast majority of science conducted today. Such needs arise when downloading data from a repository, moving data between remote locations, exchanging data between collaborators, and even publishing data as part of the publication process. While such examples are common, it is surprisingly difficult to describe and exchange data, and it is even more difficult when datasets are large and span multiple storage locations. To address some of these challenges we proposed the Big Data Bag (BDBag) as a data packaging format for representing and describing complex, distributed, and large datasets. In this presentation, we outline the BDBag model and describe three scenarios in which it is currently being used
@article{darcy_big_2019,
title = {Big {Data} {Bags}: {A} {Scalable} {Packaging} {Format} for {Science}},
copyright = {Creative Commons Attribution 4.0 International, Open Access},
shorttitle = {Big {Data} {Bags}},
url = {https://zenodo.org/record/3338725},
doi = {10.5281/ZENODO.3338725},
abstract = {The need to describe and exchange large and complex data underlies the vast majority of science conducted today. Such needs arise when downloading data from a repository, moving data between remote locations, exchanging data between collaborators, and even publishing data as part of the publication process. While such examples are common, it is surprisingly difficult to describe and exchange data, and it is even more difficult when datasets are large and span multiple storage locations. To address some of these challenges we proposed the Big Data Bag (BDBag) as a data packaging format for representing and describing complex, distributed, and large datasets. In this presentation, we outline the BDBag model and describe three scenarios in which it is currently being used},
urldate = {2022-01-14},
author = {D'Arcy, Mike and Chard, Kyle and Foster, Ian and Kesselman, Carl and Madduri, Ravi and Saint, Nickolaus and Wagner, Rick},
month = jul,
year = {2019},
note = {Publisher: Zenodo},
}
Downloads: 0
{"_id":"rzyzkh6rtbsbmWtLt","bibbaseid":"darcy-chard-foster-kesselman-madduri-saint-wagner-bigdatabagsascalablepackagingformatforscience-2019","author_short":["D'Arcy, M.","Chard, K.","Foster, I.","Kesselman, C.","Madduri, R.","Saint, N.","Wagner, R."],"bibdata":{"bibtype":"article","type":"article","title":"Big Data Bags: A Scalable Packaging Format for Science","copyright":"Creative Commons Attribution 4.0 International, Open Access","shorttitle":"Big Data Bags","url":"https://zenodo.org/record/3338725","doi":"10.5281/ZENODO.3338725","abstract":"The need to describe and exchange large and complex data underlies the vast majority of science conducted today. Such needs arise when downloading data from a repository, moving data between remote locations, exchanging data between collaborators, and even publishing data as part of the publication process. While such examples are common, it is surprisingly difficult to describe and exchange data, and it is even more difficult when datasets are large and span multiple storage locations. To address some of these challenges we proposed the Big Data Bag (BDBag) as a data packaging format for representing and describing complex, distributed, and large datasets. In this presentation, we outline the BDBag model and describe three scenarios in which it is currently being used","urldate":"2022-01-14","author":[{"propositions":[],"lastnames":["D'Arcy"],"firstnames":["Mike"],"suffixes":[]},{"propositions":[],"lastnames":["Chard"],"firstnames":["Kyle"],"suffixes":[]},{"propositions":[],"lastnames":["Foster"],"firstnames":["Ian"],"suffixes":[]},{"propositions":[],"lastnames":["Kesselman"],"firstnames":["Carl"],"suffixes":[]},{"propositions":[],"lastnames":["Madduri"],"firstnames":["Ravi"],"suffixes":[]},{"propositions":[],"lastnames":["Saint"],"firstnames":["Nickolaus"],"suffixes":[]},{"propositions":[],"lastnames":["Wagner"],"firstnames":["Rick"],"suffixes":[]}],"month":"July","year":"2019","note":"Publisher: Zenodo","bibtex":"@article{darcy_big_2019,\n\ttitle = {Big {Data} {Bags}: {A} {Scalable} {Packaging} {Format} for {Science}},\n\tcopyright = {Creative Commons Attribution 4.0 International, Open Access},\n\tshorttitle = {Big {Data} {Bags}},\n\turl = {https://zenodo.org/record/3338725},\n\tdoi = {10.5281/ZENODO.3338725},\n\tabstract = {The need to describe and exchange large and complex data underlies the vast majority of science conducted today. Such needs arise when downloading data from a repository, moving data between remote locations, exchanging data between collaborators, and even publishing data as part of the publication process. While such examples are common, it is surprisingly difficult to describe and exchange data, and it is even more difficult when datasets are large and span multiple storage locations. To address some of these challenges we proposed the Big Data Bag (BDBag) as a data packaging format for representing and describing complex, distributed, and large datasets. In this presentation, we outline the BDBag model and describe three scenarios in which it is currently being used},\n\turldate = {2022-01-14},\n\tauthor = {D'Arcy, Mike and Chard, Kyle and Foster, Ian and Kesselman, Carl and Madduri, Ravi and Saint, Nickolaus and Wagner, Rick},\n\tmonth = jul,\n\tyear = {2019},\n\tnote = {Publisher: Zenodo},\n}\n\n","author_short":["D'Arcy, M.","Chard, K.","Foster, I.","Kesselman, C.","Madduri, R.","Saint, N.","Wagner, R."],"key":"darcy_big_2019","id":"darcy_big_2019","bibbaseid":"darcy-chard-foster-kesselman-madduri-saint-wagner-bigdatabagsascalablepackagingformatforscience-2019","role":"author","urls":{"Paper":"https://zenodo.org/record/3338725"},"metadata":{"authorlinks":{}}},"bibtype":"article","biburl":"https://api.zotero.org/users/3649949/collections/SP6RMP59/items?key=kvw05jEWpV9zO4gNkD1KQFRV&format=bibtex&limit=100","dataSources":["h2YCcsAFQ8zE8bupW"],"keywords":[],"search_terms":["big","data","bags","scalable","packaging","format","science","d'arcy","chard","foster","kesselman","madduri","saint","wagner"],"title":"Big Data Bags: A Scalable Packaging Format for Science","year":2019}