Workflows to Open Provenance Graphs, round-trip. Missier, P. & Goble, C. Future Generation Computer Systems (FGCS), 27(6):812–819, April, 2011.
doi  abstract   bibtex   
The Open Provenance Model is designed to capture relationships amongst data values, and amongst processors that produce or consume those values. While OPM graphs are able to describe aspects of a workflow execution, capturing the structure of the workflows themselves is understandably beyond the scope of the OPM specification, since the graphs may be generated by a broad variety of processes, which may not be formal workflows at all. % In particular, OPM does not address two questions: firstly, whether for any OPM graph there exists a ${\}textbackslashbackslash$textit\plausible\ workflow, in some model, which could have generated the graph. And secondly, which information should be captured as part of an OPM graph that is derived from the execution of some known type of workflow, so that the workflow structure and the execution trace can both be inferred back from the graph. % Motivated by the need to address the ${\}textbackslashbackslash$textit\Third Provenance Challenge\ using Taverna workflows and provenance, in this paper we explore such notion of ${\}textbackslashbackslash$textit\lossless-ness\ of OPM graphs relative to Taverna workflows. % For the first question, we show that Taverna is a suitable model for representing plausible OPM-generating processes. For the second question, we show how augmenting OPM with two types of annotations makes it lossless with respect to Taverna. We support this claim by presenting a two-way mapping between OPM graphs and Taverna workflows.
@article{missier_workflows_2011-1,
	title = {Workflows to {Open} {Provenance} {Graphs}, round-trip},
	volume = {27},
	doi = {http://dx.doi.org/10.1016/j.future.2010.10.012},
	abstract = {The Open Provenance Model is designed to capture relationships amongst data values, and amongst processors that produce or consume those values. While OPM graphs are able to describe aspects of a workflow execution, capturing the structure of the workflows themselves is understandably beyond the scope of the OPM specification, since the graphs may be generated by a broad variety of processes, which may not be formal workflows at all. \% In particular, OPM does not address two questions: firstly, whether for any OPM graph there exists a \${\textbackslash}textbackslashbackslash\$textit\{plausible\} workflow, in some model, which could have generated the graph. And secondly, which information should be captured as part of an OPM graph that is derived from the execution of some known type of workflow, so that the workflow structure and the execution trace can both be inferred back from the graph. \% Motivated by the need to address the \${\textbackslash}textbackslashbackslash\$textit\{Third Provenance Challenge\} using Taverna workflows and provenance, in this paper we explore such notion of \${\textbackslash}textbackslashbackslash\$textit\{lossless-ness\} of OPM graphs relative to Taverna workflows. \% For the first question, we show that Taverna is a suitable model for representing plausible OPM-generating processes. For the second question, we show how augmenting OPM with two types of annotations makes it lossless with respect to Taverna. We support this claim by presenting a two-way mapping between OPM graphs and Taverna workflows.},
	number = {6},
	journal = {Future Generation Computer Systems (FGCS)},
	author = {Missier, Paolo and Goble, Carole},
	month = apr,
	year = {2011},
	keywords = {\#provenance, \#OPM, OPM},
	pages = {812--819},
}

Downloads: 0