Archival Description and Linked Data: A Preliminary Study of Opportunities and Implementation Challenges. Gracy, K. F. Archival Science, 15(3):239–294, 2015. Paper doi abstract bibtex This paper presents the results of a study to investigate how archives can connect their collections to related data sources through the use of Semantic Web technologies, specifically Linked Data. Questions explored included (a) What types of data currently available in archival surrogates such as Encoded Archival Description (EAD) finding aids and Machine-Readable Cataloging (MARC) records may be useful if converted to Linked Data? (b) For those potentially useful data points identified in archival surrogates, how might one align data structures found in those surrogates to the data structures of other relevant internal or external information sources? (c) What features of current standards and data structures present impediments or challenges that must be overcome in order to achieve interoperability among disparate data sources? To answer these questions, the researcher identified metadata elements of potential use as Linked Data in archival surrogates, as well as metadata element sets and vocabularies of data sets that could serve as pathways to relevant external data sources. Data sets chosen for the study included DBpedia and schema.org; metadata element sets examined included Friend of a Friend (FOAF), GeoNames, and Linking Open Description of Events (LODE). The researcher then aligned tags found in the EAD encoding standard to related classes and properties found in these Linked Data sources and metadata element sets. To investigate the third question about impediments to incorporating Linked Data in archival descriptions, the researcher analyzed the locations and frequencies at which controlled and uncontrolled access points (personal and family name, corporate name, geographic name, and genre/form entities) appeared in a sample of MARC and EAD archival descriptive records by using a combination of hand counts and the natural language processing (NLP) tool, OpenCalais. The results of the location and frequency analysis, combined with the results of the alignment process, helped the researcher identify several critical challenges currently impeding interoperability among archival information systems and relevant Linked Data sources, including differences in granularity between archival and other data source vocabularies, and inadequacies of current encoding standards to support semantic tagging of potential access points embedded in free text areas of archival surrogates.
@article{gracy_archival_2015,
title = {Archival {Description} and {Linked} {Data}: {A} {Preliminary} {Study} of {Opportunities} and {Implementation} {Challenges}},
volume = {15},
issn = {1389-0166, 1573-7519},
shorttitle = {Archival description and linked data},
url = {10.1007/s10502-014-9216-2},
doi = {10.1007/s10502-014-9216-2},
abstract = {This paper presents the results of a study to investigate how archives can connect their collections to related data sources through the use of Semantic Web technologies, specifically Linked Data. Questions explored included (a) What types of data currently available in archival surrogates such as Encoded Archival Description (EAD) finding aids and Machine-Readable Cataloging (MARC) records may be useful if converted to Linked Data? (b) For those potentially useful data points identified in archival surrogates, how might one align data structures found in those surrogates to the data structures of other relevant internal or external information sources? (c) What features of current standards and data structures present impediments or challenges that must be overcome in order to achieve interoperability among disparate data sources? To answer these questions, the researcher identified metadata elements of potential use as Linked Data in archival surrogates, as well as metadata element sets and vocabularies of data sets that could serve as pathways to relevant external data sources. Data sets chosen for the study included DBpedia and schema.org; metadata element sets examined included Friend of a Friend (FOAF), GeoNames, and Linking Open Description of Events (LODE). The researcher then aligned tags found in the EAD encoding standard to related classes and properties found in these Linked Data sources and metadata element sets. To investigate the third question about impediments to incorporating Linked Data in archival descriptions, the researcher analyzed the locations and frequencies at which controlled and uncontrolled access points (personal and family name, corporate name, geographic name, and genre/form entities) appeared in a sample of MARC and EAD archival descriptive records by using a combination of hand counts and the natural language processing (NLP) tool, OpenCalais. The results of the location and frequency analysis, combined with the results of the alignment process, helped the researcher identify several critical challenges currently impeding interoperability among archival information systems and relevant Linked Data sources, including differences in granularity between archival and other data source vocabularies, and inadequacies of current encoding standards to support semantic tagging of potential access points embedded in free text areas of archival surrogates.},
language = {en},
number = {3},
urldate = {2020-12-16},
journal = {Archival Science},
author = {Gracy, Karen F.},
year = {2015},
pages = {239--294},
}
Downloads: 0
{"_id":"P5gGQNS46Hurxm2pn","bibbaseid":"gracy-archivaldescriptionandlinkeddataapreliminarystudyofopportunitiesandimplementationchallenges-2015","authorIDs":[],"author_short":["Gracy, K. F."],"bibdata":{"bibtype":"article","type":"article","title":"Archival Description and Linked Data: A Preliminary Study of Opportunities and Implementation Challenges","volume":"15","issn":"1389-0166, 1573-7519","shorttitle":"Archival description and linked data","url":"10.1007/s10502-014-9216-2","doi":"10.1007/s10502-014-9216-2","abstract":"This paper presents the results of a study to investigate how archives can connect their collections to related data sources through the use of Semantic Web technologies, specifically Linked Data. Questions explored included (a) What types of data currently available in archival surrogates such as Encoded Archival Description (EAD) finding aids and Machine-Readable Cataloging (MARC) records may be useful if converted to Linked Data? (b) For those potentially useful data points identified in archival surrogates, how might one align data structures found in those surrogates to the data structures of other relevant internal or external information sources? (c) What features of current standards and data structures present impediments or challenges that must be overcome in order to achieve interoperability among disparate data sources? To answer these questions, the researcher identified metadata elements of potential use as Linked Data in archival surrogates, as well as metadata element sets and vocabularies of data sets that could serve as pathways to relevant external data sources. Data sets chosen for the study included DBpedia and schema.org; metadata element sets examined included Friend of a Friend (FOAF), GeoNames, and Linking Open Description of Events (LODE). The researcher then aligned tags found in the EAD encoding standard to related classes and properties found in these Linked Data sources and metadata element sets. To investigate the third question about impediments to incorporating Linked Data in archival descriptions, the researcher analyzed the locations and frequencies at which controlled and uncontrolled access points (personal and family name, corporate name, geographic name, and genre/form entities) appeared in a sample of MARC and EAD archival descriptive records by using a combination of hand counts and the natural language processing (NLP) tool, OpenCalais. The results of the location and frequency analysis, combined with the results of the alignment process, helped the researcher identify several critical challenges currently impeding interoperability among archival information systems and relevant Linked Data sources, including differences in granularity between archival and other data source vocabularies, and inadequacies of current encoding standards to support semantic tagging of potential access points embedded in free text areas of archival surrogates.","language":"en","number":"3","urldate":"2020-12-16","journal":"Archival Science","author":[{"propositions":[],"lastnames":["Gracy"],"firstnames":["Karen","F."],"suffixes":[]}],"year":"2015","pages":"239–294","bibtex":"@article{gracy_archival_2015,\n\ttitle = {Archival {Description} and {Linked} {Data}: {A} {Preliminary} {Study} of {Opportunities} and {Implementation} {Challenges}},\n\tvolume = {15},\n\tissn = {1389-0166, 1573-7519},\n\tshorttitle = {Archival description and linked data},\n\turl = {10.1007/s10502-014-9216-2},\n\tdoi = {10.1007/s10502-014-9216-2},\n\tabstract = {This paper presents the results of a study to investigate how archives can connect their collections to related data sources through the use of Semantic Web technologies, specifically Linked Data. Questions explored included (a) What types of data currently available in archival surrogates such as Encoded Archival Description (EAD) finding aids and Machine-Readable Cataloging (MARC) records may be useful if converted to Linked Data? (b) For those potentially useful data points identified in archival surrogates, how might one align data structures found in those surrogates to the data structures of other relevant internal or external information sources? (c) What features of current standards and data structures present impediments or challenges that must be overcome in order to achieve interoperability among disparate data sources? To answer these questions, the researcher identified metadata elements of potential use as Linked Data in archival surrogates, as well as metadata element sets and vocabularies of data sets that could serve as pathways to relevant external data sources. Data sets chosen for the study included DBpedia and schema.org; metadata element sets examined included Friend of a Friend (FOAF), GeoNames, and Linking Open Description of Events (LODE). The researcher then aligned tags found in the EAD encoding standard to related classes and properties found in these Linked Data sources and metadata element sets. To investigate the third question about impediments to incorporating Linked Data in archival descriptions, the researcher analyzed the locations and frequencies at which controlled and uncontrolled access points (personal and family name, corporate name, geographic name, and genre/form entities) appeared in a sample of MARC and EAD archival descriptive records by using a combination of hand counts and the natural language processing (NLP) tool, OpenCalais. The results of the location and frequency analysis, combined with the results of the alignment process, helped the researcher identify several critical challenges currently impeding interoperability among archival information systems and relevant Linked Data sources, including differences in granularity between archival and other data source vocabularies, and inadequacies of current encoding standards to support semantic tagging of potential access points embedded in free text areas of archival surrogates.},\n\tlanguage = {en},\n\tnumber = {3},\n\turldate = {2020-12-16},\n\tjournal = {Archival Science},\n\tauthor = {Gracy, Karen F.},\n\tyear = {2015},\n\tpages = {239--294},\n}\n\n","author_short":["Gracy, K. F."],"key":"gracy_archival_2015","id":"gracy_archival_2015","bibbaseid":"gracy-archivaldescriptionandlinkeddataapreliminarystudyofopportunitiesandimplementationchallenges-2015","role":"author","urls":{"Paper":"https://api.zotero.org/groups/2386895/collections/XX2NLPN2/10.1007/s10502-014-9216-2"},"metadata":{"authorlinks":{}},"downloads":0},"bibtype":"article","biburl":"https://api.zotero.org/groups/2386895/collections/XX2NLPN2/items?format=bibtex&limit=100","creationDate":"2021-01-26T14:28:50.126Z","downloads":0,"keywords":[],"search_terms":["archival","description","linked","data","preliminary","study","opportunities","implementation","challenges","gracy"],"title":"Archival Description and Linked Data: A Preliminary Study of Opportunities and Implementation Challenges","year":2015,"dataSources":["wPWgDzyxsGksjg6mb","k3QfbE45mGbcFcKRM","Ce3zJ448FfXtsN7qp"]}