The small from the Big: discovering models and mechanisms with machine learning. February 2017.
abstract   bibtex   
This paper proposes a new discussion on Big Data” as the primary source of model-building in science, when the computational architecture used is machine learning or evolutionary algorithms (and possibly a combination of them). Is Big Data mining a proper method of discovering and building models? The focus is on discovery and building (rather than justification or confirmation) of existing theories or models; and on relevant epistemic and pragmatic aspects of these computational architectures (rather than on the quantitative features of Big Data). Two competing (or, more mildly put, complementing) types of models used in biology and cognitive science are scrutinized: mechanism modeling and computational modeling (mostly network modeling and dynamical modeling). This paper aims to show that far from becoming irrelevant in the Big Data era, network models and mechanisms can be discovered and built from Big Data. The argument is based on the concept of patterns in data, discussed in the context of the relation between data and models (Bogen, Woodward, McAllister). It relates it then to the concept of “small patterns” in Big Data (Floridi) as hidden aspects of mechanistic models, hard to fathom by scientists. Machine learning, as a tool to categorize and characterize real patterns is assessed in the context of mechanistic and network models. Evolutionary computation is assessed as a method to optimize the search for mechanisms and complex networks. To compare and contrast the mechanistic account and its alternatives, this argument builds on two concepts central to all approaches: modularity (as related to decomposability), and organization (Bechtel, Darden, Craver), which both come in degrees and can be discovered through machine learning or evolutionary computation in Big Data (cf. E. Ratti and W. Pietsch). The paper concludes with the claim that Big Data, when it qualifies as scientific evidence, most likely has and will have a fundamental impact on the way we discover and build computational models in science.
@unpublished{SmallBigDiscovering2017a,
	title = {The small from the {Big}: discovering models and mechanisms with machine learning},
	copyright = {All rights reserved},
	abstract = {This paper proposes a new discussion on  Big Data” as the primary source of model-building in science, when the computational architecture used is machine learning or evolutionary algorithms (and possibly a combination of them). Is Big Data mining a proper method of discovering and building models? The focus is on discovery and building (rather than justification or confirmation) of existing theories or models; and on relevant epistemic and pragmatic aspects of these  computational architectures (rather than on the quantitative features of Big Data). Two competing (or, more mildly put, complementing) types of models used in biology and cognitive science are scrutinized: mechanism modeling and
computational modeling (mostly network modeling and dynamical modeling).
This paper aims to show that far from becoming irrelevant in the Big Data era, network models and mechanisms can be discovered and built from Big Data. The argument is based on the concept of patterns in data, discussed in the context of the relation between data and models (Bogen, Woodward, McAllister). It relates it then to the concept of “small patterns” in Big Data (Floridi) as hidden aspects of mechanistic models, hard to fathom by scientists.
Machine learning, as a tool to categorize and characterize real patterns is assessed in the context of mechanistic and network models. Evolutionary computation is assessed as a method to optimize the search for mechanisms and complex networks.
To compare and contrast the mechanistic account and its alternatives, this argument builds on two
concepts central to all approaches: modularity (as related to decomposability), and organization
(Bechtel, Darden, Craver), which both come in degrees and can be discovered through machine learning or evolutionary computation in Big Data (cf. E. Ratti and W. Pietsch). The paper concludes with the
claim that Big Data, when it qualifies as scientific evidence, most likely has and will have a fundamental impact on the way we discover and build computational models in science.},
	month = feb,
	year = {2017},
}

Downloads: 0