Big Data Benchmarks of High-Performance Storage Systems on Commercial Bare Metal Clouds. Lee, H. & Fox, G. In IEEE International Conference on Cloud Computing (IEEE CLOUD 2019), 5, 2019.
Big Data Benchmarks of High-Performance Storage Systems on Commercial Bare Metal Clouds [pdf]Website  doi  abstract   bibtex   
Bare metal servers are widely available on public clouds to provide direct access to hardware and the system configuration with high performance storage and network devices are well suited for big data applications. Highly-optimized server with additional CPU core count and dense storage may lead to better performance in certain workloads and to ensure responsiveness of deployed services. Recent work on Hadoop ecosystems has addressed the performance improvement of scale-up machines configured with SSD storage and increased network bandwidth. The paper evaluates big data processing on dedicated clusters and provides the performance analysis of NVMe devices and SSD block storage options available on Amazon, Google, Microsoft, and Oracle Clouds. We show the benchmark results along with the system performance tests as we want to demonstrate the compute resource requirements for large-scale applications. The system capacity and limits for the underlying servers are described along with the cost analysis of scaling workloads on these platforms.

Downloads: 0