High-Performance Big Data Management Across Cloud Data Centers

Radu Tudoran 1, 2
1 KerData - Scalable Storage for Clouds and Beyond
Inria Rennes – Bretagne Atlantique , IRISA-D1 - SYSTÈMES LARGE ÉCHELLE
Abstract : The easily-accessible computation power offered by cloud infrastructures coupled with the revolution of Big Data are expanding the scale and speed at which data analysis is performed. The cloud resources for computation and storage are spread among globally distributed data centers. Enabling fast data transfers in such scenarios becomes particularly important for scientific applications for which moving the processing close to data is rather expensive or not feasible (e.g. genome mapping, high-energy physics simulations, large sensors network). Analyzing how clouds can become “Big Data - friendly”, and what are the best options to provide data-oriented cloud services to address applications needs are the key goals of this thesis. In this talk, we present our contributions for providing high performance data management for applications running across multiple cloud data centers. We start by focusing on the scalability aspects of single-site processing and show how the MapReduce model can be extended across multi-sites. Next, we present a transfer service architecture that enables configurable cost-performance optimizations for inter-site transfers. This transfer scheme is then leveraged in the context of real-time streaming across cloud data centers. Finally, we investigate the viability of leveraging this data movement solution as a cloud-provided service, following a Transfer-as-a-Service paradigm based on a flexible pricing scheme.
Keywords : Cloud
Document type :
Theses
Complete list of metadatas

Cited literature [139 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01093767
Contributor : Radu Tudoran <>
Submitted on : Wednesday, January 7, 2015 - 10:45:08 AM
Last modification on : Friday, November 16, 2018 - 1:40:48 AM
Long-term archiving on : Saturday, April 15, 2017 - 7:06:32 AM

Identifiers

  • HAL Id : tel-01093767, version 1

Citation

Radu Tudoran. High-Performance Big Data Management Across Cloud Data Centers. Computer science. ENS Rennes, 2014. English. ⟨tel-01093767⟩

Share

Metrics

Record views

1072

Files downloads

17553