Automated Data Collection with R: A Practical Guide to Web Scraping and Text Mining. Munzert, S.; Rubba, C.; Meißner, P.; and Nyhuis, D. John Wiley & Sons, January, 2015.
abstract   bibtex   
A hands on guide to web scraping and text mining for both beginners and experienced users of R Introduces fundamental concepts of the main architecture of the web and databases and covers HTTP, HTML, XML, JSON, SQL. Provides basic techniques to query web documents and data sets (XPath and regular expressions). An extensive set of exercises are presented to guide the reader through each technique. Explores both supervised and unsupervised techniques as well as advanced techniques such as data scraping and text management. Case studies are featured throughout along with examples for each technique presented. R code and solutions to exercises featured in the book are provided on a supporting website.
@book{munzert_automated_2015,
	title = {Automated {Data} {Collection} with {R}: {A} {Practical} {Guide} to {Web} {Scraping} and {Text} {Mining}},
	isbn = {978-1-118-83481-7},
	shorttitle = {Automated {Data} {Collection} with {R}},
	abstract = {A hands on guide to web scraping and text mining for both beginners and experienced users of R  Introduces fundamental concepts of the main architecture of the web and databases and covers HTTP, HTML, XML, JSON, SQL. Provides basic techniques to query web documents and data sets (XPath and regular expressions). An extensive set of exercises are presented to guide the reader through each technique. Explores both supervised and unsupervised techniques as well as advanced techniques such as data scraping and text management. Case studies are featured throughout along with examples for each technique presented. R code and solutions to exercises featured in the book are provided on a supporting website.},
	language = {en},
	publisher = {John Wiley \& Sons},
	author = {Munzert, Simon and Rubba, Christian and Meißner, Peter and Nyhuis, Dominic},
	month = jan,
	year = {2015},
	keywords = {Computers / Databases / Data Mining, Mathematics / Probability \& Statistics / Stochastic Processes}
}
Downloads: 0