D. J. Abadi, D. Carney, U. Çetintemel, M. Cherniack, C. Convey et al., Aurora: a new model and architecture for data stream management, The VLDB Journal The International Journal on Very Large Data Bases, vol.12, issue.2, pp.120-139, 2003.
DOI : 10.1007/s00778-003-0095-z

R. Agarwal, G. Juve, and E. Deelman, Peer-to-Peer Data Sharing for Scientific Workflows on Amazon EC2, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.82-89, 2012.
DOI : 10.1109/SC.Companion.2012.23

W. Allcock, GridFTP: Protocol Extensions to FTP for the Grid, In: Global Grid ForumGFD-RP, vol.20, 2003.

W. Allcock, J. Bresnahan, R. Kettimuthu, M. Link, C. Dumitrescu et al., The Globus Striped GridFTP Framework and Server, ACM/IEEE SC 2005 Conference (SC'05), 2005.
DOI : 10.1109/SC.2005.72

B. Kurian and A. , Grid Eigen Trust a Framework for Computing Reputation in Grids, 2003.

G. Ananthanarayanan and R. H. Katz, Greening the Switch, Proceedings of the 2008 Conference on Power Aware Computing and Systems. HotPower'08

P. David and . Anderson, BOINC: A System for Public-Resource Computing and Storage, Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing, pp.4-10, 2004.

. Azure and . Centers, building-you r-dream-devops-dashboard-with-the-new-azure-preview-portal, 2014.

. Azure-failure and . Incident, http://azure.microsoft.com/blog/2012/03/09/summary-of-windo ws-azure-service-disruption-on, 2012.

J. Balewski, J. Lauret, D. Olson, I. Sakrejda, D. Arkhipkin et al., Offloading peak processing to virtual farm by STAR experiment at RHIC, Journal of Physics: Conference Series, 2012.
DOI : 10.1088/1742-6596/368/1/012011

C. Balkesen, N. Dindar, M. Wetter, and N. Tatbul, RIP, Proceedings of the 7th ACM international conference on Distributed event-based systems, DEBS '13, pp.3-14, 2013.
DOI : 10.1145/2488222.2488257

A. Baptista, B. Howe, J. Freire, D. Maier, and C. T. Silva, Scientific Exploration in the Era of Ocean Observatories, Computing in Science & Engineering, vol.10, issue.3, pp.53-58, 2008.
DOI : 10.1109/MCSE.2008.83

G. Bell, T. Hey, and A. Szalay, Beyond the Data Deluge, pp.1297-1298, 2009.

B. Biller and B. L. Nelson, Modeling and generating multivariate time-series input processes using a vector autoregressive technique, ACM Transactions on Modeling and Computer Simulation, vol.13, issue.3, pp.211-237, 2003.
DOI : 10.1145/937332.937333

T. Bishop, Data Center 2.0 A Roadmap for Data Center Transformation, 2013.

B. Bond, Best Practices for Developing on Window Azure

I. Botan, G. Alonso, P. M. Fischer, D. Kossmann, and N. Tatbul, Flexible and scalable storage management for data-intensive stream processing, Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology, EDBT '09, pp.934-945, 2009.
DOI : 10.1145/1516360.1516467

Y. Bu, B. Howe, M. Balazinska, and M. D. Ernst, HaLoop, Proc. VLDB Endow, pp.285-296, 2010.
DOI : 10.14778/1920841.1920881

K. Budati, J. Sonnek, A. Chandra, and J. Weissman, Ridge, Proceedings of the 16th international symposium on High performance distributed computing , HPDC '07, pp.55-64, 2007.
DOI : 10.1145/1272366.1272374

R. Buyya, Market-Oriented Cloud Computing: Vision, Hype, and Reality of Delivering Computing as the 5th Utility, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.1-978, 2009.
DOI : 10.1109/CCGRID.2009.97

B. Calder, J. Wang, A. Ogus, N. Nilakantan, A. Skjolsvold et al., Mian Fahim ul Haq, Muhammad Ikram ul Haq Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency, Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles. SOSP '11, pp.143-157, 2011.

V. Ümit, K. Çatalyürek, B. Kaya, and . Uçar, Integrated data placement and task assignment for scientific workflows in clouds, Proceedings of the fourth international workshop on Data-intensive distributed computing. DIDC '11, pp.45-54, 2011.

B. Chandramouli, J. Goldstein, R. Barga, M. Riedewald, and I. Santos, Accurate latency estimation in a distributed event processing system, 2011 IEEE 27th International Conference on Data Engineering, pp.255-266, 2011.
DOI : 10.1109/ICDE.2011.5767926

F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach et al., Bigtable, ACM Transactions on Computer Systems, vol.26, issue.2, pp.1-4, 2008.
DOI : 10.1145/1365815.1365816

H. Chen, F. Tang, P. Tino, and X. Yao, Model-based kernel for efficient time series analysis, Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '13, pp.392-400, 2013.
DOI : 10.1145/2487575.2487700

X. Cheng, S. Su, Z. Zhang, H. Wang, F. Yang et al., Virtual network embedding through topology-aware node ranking, ACM SIGCOMM Computer Communication Review, vol.41, issue.2, 2011.
DOI : 10.1145/1971162.1971168

B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon et al., PNUTS, Proc. VLDB Endow. 1
DOI : 10.14778/1454159.1454167

V. Benoit-da-mota, E. Frouin, S. Duchesnay, G. Laguitton, J. Varoquaux et al., A fast computational framework for genome-wide association studies with neuroimaging data, 20th International Conference on Computational Statistics, 2012.

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

G. Decandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman et al., Dynamo, ACM SIGOPS Operating Systems Review, vol.41, issue.6, pp.205-220, 2007.
DOI : 10.1145/1323293.1294281

E. Deelman, G. Singh, M. Su, J. Blythe, Y. Gil et al., Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems, Scientific Programming, vol.13, issue.3, pp.219-237, 2005.
DOI : 10.1155/2005/128026

G. Demartini, B. Trushkowsky, T. Kraska, and M. J. Franklin, CrowdQ: Crowdsourced Query Understanding, 2003.

M. Dorier, G. Antoniu, F. Cappello, M. Snir, and L. Orf, Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O, 2012 IEEE International Conference on Cluster Computing, pp.155-163, 2012.
DOI : 10.1109/CLUSTER.2012.26

URL : https://hal.archives-ouvertes.fr/hal-00715252

E. Dumbill, What is Big Data? Tech. rep. Oreilly Radar, 2012.

N. Edwards, M. Watkins, M. Gates, A. Coles, E. Deliot et al., High-speed Storage Nodes for the Cloud, 2011 Fourth IEEE International Conference on Utility and Cloud Computing, pp.25-32, 2011.
DOI : 10.1109/UCC.2011.14

J. Ekanayake, H. Li, B. Zhang, T. Gunarathne, S. Bae et al., Twister, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.810-818, 2010.
DOI : 10.1145/1851476.1851593

P. Fan, Z. Chen, J. Wang1, Z. Zheng, and M. R. Lyu, A topology-aware method for scientific application deployment on cloud, Concurrency and Computattion: Practice and Experience, 2012.
DOI : 10.1504/IJWGS.2014.064937

P. Fan, Z. Chen, J. Wang, Z. Zheng, and M. R. Lyu, Topology-Aware Deployment of Scientific Applications in Cloud Computing, 2012 IEEE Fifth International Conference on Cloud Computing, 2012.
DOI : 10.1109/CLOUD.2012.70

E. Feller, L. Ramakrishnan, and C. Morin, On the Performance and Energy Efficiency of Hadoop Deployment Models Grid'5000 Grid'5000, The IEEE International Conference on Big Data 2013, 2013.

I. Foster, R. Kettimuthu, S. Martin, S. Tuecke, D. Milroy et al., Campus bridging made easy via Globus services, Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment on Bridging from the eXtreme to the campus and beyond, XSEDE '12, pp.1-50, 2012.
DOI : 10.1145/2335755.2335847

I. Foster, A. Chervenak, D. Gunter, K. Keahey, R. Madduri et al., Enabling PETASCALE Data Movement and Analysis, In: Scidac Review, 2009.

M. J. Franklin, B. Trushkowsky, P. Sarkar, and T. Kraska, Crowdsourced Enumeration Queries, Proceedings of the 2013 IEEE International Conference on Data Engineering (ICDE 2013). ICDE '13, pp.673-684, 2013.

S. Gamage, R. Rao-kompella, D. Xu, and A. Kangarlou, Protocol Responsibility Offloading to Improve TCP Throughput in Virtualized Environments, ACM Transactions on Computer Systems, vol.31, issue.3, pp.1-7, 2013.
DOI : 10.1145/2518037.2491463

J. Gantz, D. Reinsel, . The, . Digital, and . In, Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East, Tech. rep. Internet Data Center(IDC ), 2012.

B. Gedik, H. Andrade, K. Wu, P. S. Yu, and M. Doo, SPADE, Proceedings of the 2008 ACM SIGMOD international conference on Management of data , SIGMOD '08, pp.1123-1134, 2008.
DOI : 10.1145/1376616.1376729

D. Ghoshal, R. Shane-canon, and L. Ramakrishnan, I/O performance of virtualized cloud environments, Proceedings of the second international workshop on Data intensive computing in the clouds, DataCloud-SC '11
DOI : 10.1145/2087522.2087535

W. Seattle, ISBN: 978-1-4503-1144-1, pp.71-80, 2011.

L. Golab, M. T. Özsu, and S. Rec, Issues in data stream management, ACM SIGMOD Record, vol.32, issue.2, pp.5-14, 2003.
DOI : 10.1145/776985.776986

L. A. Bautista-gomez and F. Cappello, Improving Floating Point Compression through Binary Masks, IEEE BigData, 2013.

A. Gomez-iglesias, A. Ernst, and G. Singh, Scalable Multi Swarm-Based Algorithms with Lagrangian Relaxation for Constrained Problems, 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, pp.1073-1080
DOI : 10.1109/TrustCom.2013.241

J. Gray, A Transformed Scientific Method. http://research.microsoft.com/en-us/um/pe ople/gray/talks/NRC-CSTB_eScience.ppt, 2007.

J. Gray and A. Szalay, Science In An Exponential World, Nature, vol.44023, 2006.

A. Greenberg, J. Hamilton, D. A. Maltz, and P. Patel, The cost of a cloud, ACM SIGCOMM Computer Communication Review, vol.39, issue.1, pp.68-73, 2008.
DOI : 10.1145/1496091.1496103

Y. Gu and R. L. Grossman, Sector and Sphere: the design and implementation of a high-performance data cloud, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol.17, issue.1897, pp.1897-2429, 2009.
DOI : 10.1098/rsta.2009.0053

T. Gunarathne, T. Wu, J. Y. Choi, S. Bae, and J. Qiu, Cloud Computing Paradigms for Pleasingly Parallel Biomedical Applications, Concurr. Comput. : Pract. Exper, vol.2317, pp.2338-2354, 2011.

T. Gunarathne, T. Wu, J. Qiu, and G. Fox, MapReduce in the Clouds for Science, 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp.565-572, 2010.
DOI : 10.1109/CloudCom.2010.107

T. Gunarathne, B. Zhang, T. Wu, and J. Qiu, Scalable parallel computing on clouds using Twister4Azure iterative MapReduce, Future Generation Computer Systems, vol.29, issue.4, pp.167-739, 2012.
DOI : 10.1016/j.future.2012.05.027

A. Gupta, O. D. Sahin, D. Agrawal, and . Abbadi, Meghdoot: Content-Based Publish/Subscribe over P2P Networks, Proceedings of the 5th ACM International Conference on Middleware. Middleware '04, pp.254-273, 2004.
DOI : 10.1007/3-540-45518-3_18

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.3996

T. J. Hacker, B. D. Noble, and B. D. Athey, Adaptive data block scheduling for parallel TCP streams, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005., pp.265-275, 2005.
DOI : 10.1109/HPDC.2005.1520970

I. F. Haddad, PVFS: A Parallel Virtual File System for Linux Clusters, In: Linux J, 2000.

P. Hande, M. Chiang, R. Calderbank, and S. Rangan, Network Pricing and Rate Allocation with Content Provider Participation, IEEE INFOCOM 2009, The 28th Conference on Computer Communications, pp.990-998, 2009.
DOI : 10.1109/INFCOM.2009.5062010

M. Hayes and S. Shah, Hourglass: A library for incremental processing on Hadoop, 2013 IEEE International Conference on Big Data, pp.742-752, 2013.
DOI : 10.1109/BigData.2013.6691647

. Hdinsight, Hadoop on Azure). https://www.hadooponazure.com

M. Bingsheng-he, Z. Yang, R. Guo, B. Chen, W. Su et al., Comet: Batched Stream Processing for Data Intensive Distributed Computing, Proceedings of the 1st ACM Symposium on Cloud Computing. SoCC '10, pp.63-74, 2010.

. Heavyload, http://www.jam-software.com/heavyload

T. Hey, S. Tansley, and K. M. Tolle, The Fourth Paradigm ??? Data-Intensive Scientific Discovery, pp.978-0982544204, 2009.
DOI : 10.1007/978-3-642-33299-9_1

T. Hey and A. E. Trefethen, Cyberinfrastructure for e-Science, In: Science, vol.3085723, pp.817-821, 2005.

H. Hiden, S. Woodman, P. Watson, and J. Ca?a, Developing cloud applications using the e-Science Central platform, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, vol.19, issue.1983, 1983.
DOI : 10.1098/rsta.2012.0085

H. Hiden, S. Woodman, P. Watson, M. Catt, M. Trenell et al., Improving the scalability of movement monitoring workflows: An architecture for the integration of the Hadoop File System into e-Science Central, Proceedings of 1st Conference on Digital Research, 2012.

Z. Hill, J. Li, M. Mao, A. Ruiz-alvarez, and M. Humphrey, Early observations on the performance of Windows Azure, In: Sci. Program, vol.19, pp.2-3, 2011.

R. Immich, E. Cerqueira, and M. Curado, Adaptive video-aware FEC-based mechanism with unequal error protection scheme, Proceedings of the 28th Annual ACM Symposium on Applied Computing, SAC '13, pp.981-988
DOI : 10.1145/2480362.2480550

. Resilin, Elastic MapReduce over Multiple Clouds, IEEE Computer Society, 2013.

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, Dryad: distributed data-parallel programs from sequential building blocks, Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007. EuroSys '07, pp.59-72, 2007.

A. Ishii and T. Suzumura, Elastic Stream Computing with Clouds, 2011 IEEE 4th International Conference on Cloud Computing, pp.195-202
DOI : 10.1109/CLOUD.2011.11

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.453.423

K. R. Jackson, L. Ramakrishnan, K. J. Runge, and R. C. Thomas, Seeking supernovae in the clouds, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.421-429, 2010.
DOI : 10.1145/1851476.1851538

G. Juve, E. Deelman, G. Bruce-berriman, B. P. Berman, and P. Maechling, An Evaluation of the Cost and Performance of Scientific Workflows on Amazon EC2, Journal of Grid Computing, vol.3, issue.3???4, pp.5-21, 2012.
DOI : 10.1007/s10723-012-9207-6

W. Kabsch, A solution for the best rotation to relate two sets of vectors, Acta Crystallographica Section A, vol.32, issue.5, pp.922-923, 1976.
DOI : 10.1107/S0567739476001873

V. Kalavri and V. Vlassov, MapReduce: Limitations, Optimizations and Open Issues, 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, pp.1031-1038
DOI : 10.1109/TrustCom.2013.126

G. Khanna, U. Catalyurek, T. Kurc, R. Kettimuthu, P. Sadayappan et al., A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP

G. Khanna, U. Catalyurek, T. Kurc, R. Kettimuthu, P. Sadayappan et al., Using overlays for efficient data transfer over shared wide-area networks, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-47, 2008.
DOI : 10.1109/SC.2008.5213292

R. Kienzler, R. Bruggmann, A. Ranganathan, and N. Tatbul, Stream as You Go: The Case for Incremental Data Access and Processing in the Cloud, 2012 IEEE 28th International Conference on Data Engineering Workshops, pp.159-166, 2012.
DOI : 10.1109/ICDEW.2012.69

T. Kosar and M. Livny, A framework for reliable and efficient data placement in distributed computing systems, Journal of Parallel and Distributed Computing, vol.65, issue.10, pp.1146-1157, 2005.
DOI : 10.1016/j.jpdc.2005.04.019

T. Kosar, E. Arslan, B. Ross, and B. Zhang, StorkCloud, Proceedings of the 4th ACM workshop on Scientific cloud computing, Science Cloud '13, pp.29-36, 2013.
DOI : 10.1145/2465848.2465855

A. Lakshman and P. Malik, Cassandra, ACM SIGOPS Operating Systems Review, vol.44, issue.2, pp.35-40, 2010.
DOI : 10.1145/1773912.1773922

C. Lal, V. Laxmi, and M. Singh-gaur, A rate adaptive and multipath routing protocol to support video streaming in MANETs, Proceedings of the International Conference on Advances in Computing, Communications and Informatics, ICACCI '12, pp.262-268, 2012.
DOI : 10.1145/2345396.2345440

D. Laney, 3D Data Management: Controlling Data Volume, Veracity, and Variety. Tech. rep. Meta Group, p.94, 2011.

N. Laoutaris, M. Sirivianos, X. Yang, and P. Rodriguez, Interdatacenter Bulk Transfers with Netstitcher, Proceedings of the ACM SIGCOMM 2011 Conference. SIGCOMM '11, pp.74-85, 2011.

I. Legrand, H. Newman, R. Voicu, C. Cirstoiu, C. Grigoras et al., MonALISA: An agent based, dynamic service system to monitor, control and optimize distributed systems 40 {YEARS} {OF} CPC: A celebratory issue focused on quality software for high performance, grid and novel computing architectures, In: Computer Physics Communications, vol.18012, pp.2472-2498, 2009.

W. Liu, B. Tieman, R. Kettimuthu, and I. Foster, A data transfer framework for large-scale science experiments, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.717-724, 2010.
DOI : 10.1145/1851476.1851582

W. Liu, B. Tieman, R. Kettimuthu, and I. Foster, A data transfer framework for large-scale science experiments, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.717-724, 2010.
DOI : 10.1145/1851476.1851582

S. Loesing, M. Hentschel, T. Kraska, and D. Kossmann, Stormy, Proceedings of the 2012 Joint EDBT/ICDT Workshops on, EDBT-ICDT '12, pp.55-60, 2012.
DOI : 10.1145/2320765.2320789

Y. Luo and B. Plale, Hierarchical MapReduce Programming Model and Scheduling Algorithms CCGRID '12, Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp.769-774, 2012.

S. Kasper-grud, L. Madsen, Y. Su, and . Zhou, Grand Challenge: MapReduce-style Processing of Fast Sensor Data, Proceedings of the 7th ACM International Conference on Distributed Event-based Systems. DEBS '13, pp.313-318, 2013.

S. Mao, X. Cheng, Y. T. Hou, . Hanifd, and . Sherali, Multiple Description Video Multicast in Wireless Ad Hoc Networks, Mobile Networks and Applications, vol.11, issue.1, pp.63-73, 2006.
DOI : 10.1007/s11036-005-4461-5

D. Moise, G. Antoniu, and L. Bougé, On-the-fly Task Execution for Speeding Up Pipelined Mapreduce Euro-Par'12, Proceedings of the 18th International Conference on Parallel Processing, pp.526-537, 2012.

M. Henry, A. R. Monti, S. S. Butt, and . Vazhkudai, CATCH: A Cloud- Based Adaptive Data Transfer Service for HPC, Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium. IPDPS '11, pp.1242-1253, 2011.

B. Nicolae, P. Riteau, and K. Keahey, Bursting the Cloud Data Bubble: Towards Transparent Storage Elasticity in IaaS Clouds, 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014.
DOI : 10.1109/IPDPS.2014.25

URL : https://hal.archives-ouvertes.fr/hal-00947599

B. Nicolae, G. Antoniu, L. Bougé, D. Moise, and R. Carpen-amarie, BlobSeer: Next-generation data management for large scale infrastructures, Journal of Parallel and Distributed Computing, vol.71, issue.2, pp.169-184, 2011.
DOI : 10.1016/j.jpdc.2010.08.004

URL : https://hal.archives-ouvertes.fr/inria-00511414

B. Nicolae, G. Antoniu, L. Bougé, D. Moise, and R. Carpen-amarie, BlobSeer: Next-generation data management for large scale infrastructures, Journal of Parallel and Distributed Computing, vol.71, issue.2, pp.169-184, 2011.
DOI : 10.1016/j.jpdc.2010.08.004

URL : https://hal.archives-ouvertes.fr/inria-00511414

B. Nicolae, J. Bresnahan, K. Keahey, and G. Antoniu, Going back and forth, Proceedings of the 20th international symposium on High performance distributed computing, HPDC '11, pp.147-158, 2011.
DOI : 10.1145/1996130.1996152

URL : https://hal.archives-ouvertes.fr/inria-00570682

K. Hiraga, O. Tatebe, and N. Soda, Gfarm grid file system, New Generation Computing, pp.257-275, 2010.

O. Grizzly, http://openstack.org/software/grizzly

A. Padmanabhan, S. Wang, G. Cao, M. Hwang, Y. Zhao et al., FluMapper, Proceedings of the Conference on Extreme Science and Engineering Discovery Environment Gateway to Discovery, XSEDE '13, pp.1-33, 2013.
DOI : 10.1145/2484762.2484821

. Pan-starrs, Panoramic Survey Telescope and Rapid Response System

A. M. Parker, Understanding the Universe. Tech. rep. Towards 2020 Science, Microsoft Corporation, 2006.

A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. Dewitt et al., A Comparison of Approaches to Largescale Data Analysis, Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data. SIGMOD '09. Providence, pp.165-178, 2009.

J. Poline, C. Lalanne, A. Tenenhaus, E. Duchesnay, B. Thirion et al., Imaging genetics: bio-informatics and biostatistics challenges, 19th International Conference on Computational Statistics, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00523236

P. N. Krishna, T. Puttaswamy, M. Nandagopal, and . Kodialam, Frugal Storage for Cloud File Systems, Proceedings of the 7th ACM European Conference on Computer Systems. EuroSys '12, pp.71-84, 2012.

C. Raiciu, C. Pluntke, S. Barre, A. Greenhalgh, D. Wischik et al., Data center networking with multipath TCP, Proceedings of the Ninth ACM SIGCOMM Workshop on Hot Topics in Networks, Hotnets '10, pp.1-10, 2010.
DOI : 10.1145/1868447.1868457

S. Sakr, A. Liu, D. M. Batista, and M. Alomari, A Survey of Large Scale Data Management Approaches in Cloud Environments, IEEE Communications Surveys & Tutorials, vol.13, issue.3, pp.311-336, 2011.
DOI : 10.1109/SURV.2011.032211.00087

URL : https://hal.archives-ouvertes.fr/inria-00623093

S. Sakr, A. Liu, D. M. Batista, and M. Alomari, A Survey of Large Scale Data Management Approaches in Cloud Environments, IEEE Communications Surveys & Tutorials, vol.13, issue.3, pp.311-336, 2011.
DOI : 10.1109/SURV.2011.032211.00087

URL : https://hal.archives-ouvertes.fr/inria-00623093

S. Shakkottai and R. Srikant, Economics of Network Pricing With Multiple ISPs, IEEE/ACM Transactions on Networking, vol.14, issue.6, pp.1233-1245, 2006.
DOI : 10.1109/TNET.2006.886393

M. A. Sharaf, P. K. Chrysanthis, and A. Labrinidis, Tuning QoD in Stream Processing Engines In: Proceedings of the Twenty-First Australasian Conference on Database Technologies - ADC '10, pp.103-112, 2010.

Y. Simmhan, C. Van-ingen, G. Subramanian, and J. Li, Bridging the Gap between Desktop and the Cloud for eScience Applications, 2010 IEEE 3rd International Conference on Cloud Computing, pp.474-481, 2010.
DOI : 10.1109/CLOUD.2010.72

A. Singh, M. Srivatsa, and L. Liu, Search-as-a-service, ACM Transactions on the Web, vol.3, issue.4, pp.1-13, 2009.
DOI : 10.1145/1594173.1594175

I. Stoica, R. Morris, D. Liben-nowell, D. R. Karger, M. F. Kaashoek et al., Chord: a scalable peer-to-peer lookup protocol for internet applications, IEEE/ACM Transactions on Networking, vol.11, issue.1, pp.17-32, 2003.
DOI : 10.1109/TNET.2002.808407

B. Muhammad-adnan-tariq, K. Koldehofe, and . Rothermel, Efficient Contentbased Routing with Network Topology Inference, Proceedings of the 7th ACM International Conference on Distributed Event-based Systems. DEBS '13, pp.51-62, 2013.

B. Muhammad-adnan-tariq, G. G. Koldehofe, K. Koch, and . Rothermel, Distributed Spectral Cluster Management: A Method for Building Dynamic Publish/Subscribe Systems, Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems. DEBS '12, pp.213-224, 2012.

B. Muhammad-adnan-tariq, G. G. Koldehofe, I. Koch, K. Khan, and . Rothermel, Meeting Subscriber-defined QoS Constraints in Publish/Subscribe Systems, Concurr. Comput. : Pract. Exper, vol.2317, pp.2140-2153, 2011.

A. R. Thakar, A. S. Szalay, P. Z. Kunszt, and J. Gray, Migrating a multiterabyte archive from object to relational databases, Computing in Science & Engineering, vol.5, issue.5, pp.16-29, 2003.
DOI : 10.1109/MCISE.2003.1225857

M. Mao, Location, Location, Location!: Modeling Data Proximity in the Cloud, Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks. Hotnets- IX, pp.1-15, 2010.

M. Mao, Location, location, location!: modeling data proximity in the cloud, Proceedings of the 9th ACM SIGCOMM Workshop on Hot Topics in Networks. Hotnets-IX

D. Toshniwal and R. C. Joshi, Finding Similarity in Time Series Data by Method of Time Weighted Moments ADC '05, Proceedings of the 16th Australasian Database Conference, pp.155-164, 2005.

E. Vairavanathan, S. Al-kiswany, L. Beltrão-costa, Z. Zhang, D. S. Katz et al., A Workflow-Aware Storage System: An Opportunity Study CCGRID '12, Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp.326-334, 2012.

V. Valancius, C. Lumezanu, N. Feamster, R. Johari, and V. V. Vazirani, How many tiers?, ACM SIGCOMM Computer Communication Review, vol.41, issue.4, pp.194-205, 2011.
DOI : 10.1145/2043164.2018459

M. Luis, L. Vaquero, J. Rodero-merino, M. Caceres, and . Lindner, A Break in the Clouds: Towards a Cloud Definition, In: SIGCOMM Comput. Commun. Rev, vol.39, issue.1, pp.50-55, 2008.

V. Kumar-vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar et al., Apache Hadoop YARN, Proceedings of the 4th annual Symposium on Cloud Computing, SOCC '13, pp.1-5, 2013.
DOI : 10.1145/2523616.2523633

U. Verner, A. Schuster, M. Silberstein, and A. Mendelson, Scheduling processing of real-time data streams on heterogeneous multi-GPU systems, Proceedings of the 5th Annual International Systems and Storage Conference on, SYSTOR '12, pp.1-8, 2012.
DOI : 10.1145/2367589.2367596

K. Venkatesh, V. , and N. Nagappan, Characterizing Cloud Computing Hardware Reliability, Proceedings of the 1st ACM Symposium on Cloud Computing. SoCC '10, pp.193-204, 2010.

J. Vöckler, G. Juve, E. Deelman, M. Rynge, and B. Berriman, Experiences using cloud computing for a scientific workflow application, Proceedings of the 2nd international workshop on Scientific cloud computing, ScienceCloud '11, pp.15-24, 2011.
DOI : 10.1145/1996109.1996114

J. Wang, D. Crawl, and I. Altintas, Kepler + Hadoop, Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, WORKS '09, pp.1-12, 2009.
DOI : 10.1145/1645164.1645176

Z. Wang, X. Liu, Y. Liu, J. Liang, and V. Vinciotti, An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series, IEEE/ACM Trans. Comput. Biol. Bioinformatics, vol.63, pp.410-419, 2009.

T. White, Scalable Multi Swarm-Based Algorithms with Lagrangian Relaxation for Constrained Problems In: Hadoop: The definitive guide. O'Reilly Media, p.9780596521981, 2009.

G. Wills, V. Chang, and R. Walters, Business Integration As a Service, In: Int. J. Cloud Appl. Comput, vol.2, issue.1, pp.16-40, 2012.

E. Yildirim and T. Kosar, Network-aware end-to-end data throughput optimization, Proceedings of the first international workshop on Network-aware data management, NDM '11, pp.21-30, 2011.
DOI : 10.1145/2110217.2110221

M. Zaharia, T. Das, H. Li, T. Hunter, S. Shenker et al., Discretized streams, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP '13, pp.423-438
DOI : 10.1145/2517349.2522737

S. Zawoad, A. Kumar-dutta, and R. Hasan, SecLaaS, Proceedings of the 8th ACM SIGSAC symposium on Information, computer and communications security, ASIA CCS '13, pp.219-230, 2013.
DOI : 10.1145/2484313.2484342

X. Zhang and F. Xu, Survey of Research on Big Data Storage, 2013 12th International Symposium on Distributed Computing and Applications to Business, Engineering & Science, pp.76-80, 2013.
DOI : 10.1109/DCABES.2013.21

Y. Zhang, Q. Gao, L. Gao, and C. Wang, iMapReduce: A Distributed Computing Framework for Iterative Computation, Journal of Grid Computing, vol.10, issue.4, pp.47-68, 2012.
DOI : 10.1007/s10723-012-9204-9