C. ?-boyan-kolev, P. Bondiombouy, R. Valduriez, R. Jiménez-peris, J. Pau et al., Demonstration of the CloudMdsQL Multistore System. BDA'2016: Gestion de données-principes, technologies et applications, p.14, 2016.

B. Carlyna-bondiombouy, P. Kolev, O. Valduriez, and . Levchenko, Multistore Big Data Integration with CloudMdsQL. BDA'2016: Gestion de données-principes, technologies et applications, p.5, 2016.

P. ?-boyan-kolev, C. Valduriez, R. Bondiombouy, R. Jiménez-peris, J. Pau et al., CloudMdsQL: querying heterogeneous cloud data stores with a common language. Distributed and Parallel Databases, vol.34, pp.463-503, 2016.

B. Carlyna-bondiombouy, O. Kolev, P. Levchenko, and . Valduriez, Multistore Big Data Integration with CloudMdsQL, Transactions Large-Scale Dataand Knowledge-Centered Systems, vol.28, pp.48-74, 2016.

C. ?-boyan-kolev, O. Bondiombouy, P. Levchenko, R. Valduriez, R. Jiménez-peris et al., Design and Implementation of the CloudMdsQL Multistore System, International Conference on Cloud Computing and Services Science (CLOSER), 2016.

C. ?-boyan-kolev, P. Bondiombouy, R. Valduriez, R. Jiménez-peris, J. Pau et al., The CloudMdsQL Multistore System, ACM SIGMOD International Conference on Data Management, 2113 ? ?2116, 2016.

B. Carlyna-bondiombouy, O. Kolev, P. Levchenko, and . Valduriez, Integrating Big Data and Relational Data with a Functional SQL-like Query Language, Database and Expert Systems Applications (DEXA), 2015.

?. Bondiombouy, Query Processing in Cloud Multistore Systems. BDA'2015: Gestion de données-principes, technologies et applications, 2015.
URL : https://hal.archives-ouvertes.fr/lirmm-01181253

, FLAT_MAP

, REDUCE( SUM ), MAP( TUPLE, 1 )

, FLAT_MAP( lambda data: product(data, CREATE NAMED EXPRESSION experts_alt(kw string, expert string)@hdfs = { * SCAN(TEXT, 'posts.txt

, The comparisons reveal several trends: the ability to integrate relational data (stored in RDBMS) with other kinds of data stores; the growing importance of accessing HDFS within Hadoop and the fact that most systems provide a relational/ SQL-like abstraction, ner (CloudMdsQL), schema management, and query processing

. Graphbase,

. Grid5000,

. Infinitegraph,

J. ,

. Sparksee,

. Titan,

A. , A. Bajda-pawlikowski, K. Abadi, D. Rasin, A. And et al., Hadoopdb: An architectural hybrid of mapreduce and DBMS technologies for analytical workloads, Proceedings of the VLDB Endowment (PVLDB), vol.2, pp.922-933, 2009.

A. , M. Xin, R. Lian, C. Huai, Y. Liu et al., Spark SQL: relational data processing in spark, ACM SIGMOD Int. Conf. on Management of Data, pp.1383-1394, 2015.

A. , R. And, H. , and J. , Eddies: Continuously adaptive query processing, ACM SIGMOD Int. Conf. on Management of Data, pp.261-272, 2000.

B. , C. Rehrmann, R. Faerber, F. And, R. et al., Funsql: it is time to make SQL functional, Int. Conf. on Extending Database Technology (EDBT, pp.41-46, 2012.

B. , C. Kolev, B. Levchenko, O. And, V. et al., Integrating big data and relational data with a functional sql-like query language, Int. Conf. on Database and Expert Systems Applications (DEXA) (2015), pp.170-185
URL : https://hal.archives-ouvertes.fr/lirmm-01181242

B. , C. Kolev, B. Levchenko, O. And, V. et al., Multistore big data integration with cloudmdsql, Trans. on Large-Scale Data-and Knowledge-Centered Systems (TLDKS), vol.28, pp.48-74, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01345712

B. , C. And, V. , and P. , Query processing in multistore systems: an overview, Int. Journal of Cloud Computing (IJCC), vol.5, pp.309-346, 2016.

B. and R. V. Learning-neo4j, , 2014.

B. , F. Bursztyn, D. , A. Ileana, I. And et al., Invisible glue: Scalable self-tuning multi-stores, Int. Conf. on Innovative Data Systems Research (CIDR), p.7, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01087624

Ç. Et, A. L. , and U. , S-store: A streaming newsql system for big velocity applications, Proceedings of the VLDB Endowment (PVLDB), vol.7, pp.1633-1636, 2014.

C. , F. Dean, J. Ghemawat, S. Hsieh, W. Wallach et al., Bigtable: A distributed storage system for structured data, ACM Trans. on Computer Systems, vol.26, pp.1-4, 2008.

D. , S. And, V. , and P. , A FAD for data intensive applications, IEEE Trans. Knowl. Data Eng, vol.4, pp.34-51, 1992.

D. , J. And, G. , and S. Mapreduce, Simplified data processing on large clusters, Symposium on Operating Systems Design and Implementation (OSDI), pp.137-150, 2004.

D. , G. Hastorun, D. Jampani, M. Kakulapati, G. Lakshman et al., Dynamo: amazon's highly available key-value store, ACM Symposium on Operating Systems Principles (SOSP, pp.205-220, 2007.

D. , D. Halverson, A. Nehme, R. Shankar, S. Aguilar-saborit et al., Split query processing in polybase, ACM SIGMOD Int. Conf. on Management of Data, pp.1255-1266, 2013.

D. , A. Halevy, A. Y. And-ives, and Z. G. , Principles of Data Integration, 2012.

D. , J. Elmore, A. Stonebraker, M. Balazinska, M. Howe et al., The bigdawg polystore system, SIGMOD Record, vol.44, pp.11-16, 2015.

E. , S. Schelter, S. Tzoumas, K. Warneke, D. And et al., Iterative parallel data processing with stratosphere: an inside look, ACM SIGMOD Int. Conf. on Management of Data, pp.1053-1056, 2013.

F. U. , Y. Ong, K. W. Papakonstantinou, Y. And, Z. et al., FORWARD: data-centric uis using declarative templates that efficiently wrap third-party javascript components, Proceedings of the VLDB Endowment (PVLDB), vol.7, pp.1649-1652, 2014.

G. , V. Teletia, N. Patel, J. Halverson, A. And et al., Indexing HDFS data in PDW: splitting the data from the index, Proceedings of the VLDB Endowment (PVLDB), vol.7, pp.1520-1528, 2014.

G. , S. Gobioff, H. And, L. , and S. , The google file system, ACM Symposium on Operating Systems Principles (SOSP, pp.29-43, 2003.

G. , P. Gryz, J. Hoppe, A. Ma, W. And et al., Query rewrites with views for XML in DB2, Int. Conf. on Data Engineering, pp.1339-1350, 2009.

G. , A. Mumick, I. S. And, S. , and V. S. , Maintaining views incrementally, ACM SIGMOD Int. Conf. on Management of Data, pp.157-166, 1993.

H. , L. Kossmann, D. Wimmers, E. And, Y. et al., Optimizing queries across diverse data sources, Int. Conf. on Very Large Databases (VLDB, pp.276-285, 1997.

H. , P. Mathäss, T. And, Z. , and M. , An evaluation of approaches to federated query processing over linked data, Int. Conf. on Semantic Systems, 2010.

H. , H. Sankaranarayanan, J. Tatemura, J. Lefevre, J. And et al., A multi-store system for evolutionary analytics. Proceedings of the VLDB Endowment (PVLDB), vol.6, pp.1180-1181, 2013.

H. , B. E. Valduriez, P. And, D. , and S. , Parallelizing FAD using compile-time analysis techniques, IEEE Data Engineering Bulletin, vol.12, pp.9-15, 1989.

H. , F. Cortes, T. Kolbeck, B. Stender, J. Focht et al., The xtreemfs architecture-a case for 7. Bibliography object-based file systems in grids, Concurrency and Computation: Practice and Experience, vol.20, pp.2049-2060, 2008.

K. , B. Bondiombouy, C. Levchenko, O. Valduriez, P. Jiménezperis et al., Design and implementation of the cloudmdsql multistore system, Int. Conf. on Cloud Computing and Services Science (CLOSER), pp.352-359, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01341172

K. , B. Bondiombouy, C. Valduriez, P. Jiménez-peris, R. Pau et al., The cloudmdsql multistore system, ACM SIGMOD Int. Conf. on Management of Data, pp.2113-2116, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01288025

K. , B. Pau, R. Levchenko, O. Valduriez, P. Jiménez-peris et al., Benchmarking polystores: The cloudmdsql experience, IEEE Int. Conf. on Big Data, pp.2574-2579, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01415582

K. , B. Valduriez, P. Bondiombouy, C. Jiménez-peris, R. Pau et al., Cloudmdsql: querying heterogeneous cloud data stores with a common language, Distributed and Parallel Databases, vol.34, pp.463-503, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01184016

L. , J. Sankaranarayanan, J. Hacigümüs, H. Tatemura, J. Polyzotis et al., MISO: souping up big data query processing with a multistore system, ACM SIGMOD Int. Conf. on Management of Data, pp.1591-1602, 2014.

L. and M. , Data integration: A theoretical perspective, ACM SIGMOD/PODS (Principles of Database Systems) Conf, pp.233-246, 2002.

L. , Z. H. , C. , H. J. And, S. et al., Efficient support of xquery update facility in XML enabled RDBMS, Int. Conf. on Data Engineering (ICDE, pp.1394-1404, 2012.

M. , E. Beckman, B. And, and G. M. Bierman, LINQ: reconciling object, relations and XML in the .net framework, ACM SIGMOD Int. Conf. on Management of Data, p.706, 2006.

K. W. Ong, Y. Papakonstantinou, V. And, and R. , The SQL++ semi-structured data model and query language: A capabilities survey of sql-onhadoop, nosql and newsql databases, ACM Computing Research Repository, 2014.

Ö. , M. T. And, V. , and P. , Principles of Distributed Database Systems, 2011.

P. , E. Akbarinia, R. And, D. , and M. E. , P2P Techniques for Decentralized Applications. Synthesis Lectures on Data Management, 2012.
URL : https://hal.archives-ouvertes.fr/lirmm-00748635

P. , E. Hawkins, T. And, M. , and P. , The Definitive Guide to MongoDB:The NoSQL Database for Cloud and Desktop Computing, 2010.

R. and R. , Data management in the cloud, Int. Conf. on Data Engineering (ICDE, p.5, 2009.

B. Shao, H. Wang, L. I. And, and Y. , Trinity: a distributed graph engine on a memory cloud, ACM SIGMOD Int. Conf. on Management of Data, pp.505-516, 2013.

S. , A. Wilkinson, K. Castellanos, M. And, D. et al., Qox-driven ETL design: reducing the cost of ETL consulting engagements, ACM SIGMOD Int. Conf. on Management of Data, pp.953-960, 2009.

S. , A. Wilkinson, K. Castellanos, M. And, D. et al., Optimizing analytic data flows for multiple execution engines, ACM SIGMOD Int. Conf. on Management of Data, pp.829-840, 2012.

S. and M. , Operating system support for database management, Communications of the ACM, vol.24, pp.412-418, 1981.

S. , M. Abadi, D. Dewitt, D. Madden, S. Paulson et al., Mapreduce and parallel dbmss: friends or foes? Communications of the ACM, vol.53, pp.64-71, 2010.

S. , M. Wong, E. Kreps, P. And-held, and G. , The design and implementation of ingres, ACM Trans. on Database Systems, vol.1, pp.198-222, 1976.

T. , A. Raschid, L. And, V. , and P. , Scaling access to heterogeneous data sources with DISCO, IEEE Trans. Knowl. Data Eng, vol.10, pp.808-823, 1998.

V. , P. And, D. , and S. , Functional SOL (fsol), an SQL upwardcompatible database programming language, Information Sciences, vol.62, issue.3, pp.183-203, 1992.

W. , S. Brandt, S. Miller, E. Long, D. And et al., Ceph: A scalable, high-performance distributed file system, Symposium on Operating Systems Design and Implementation (OSDI, pp.307-320, 2006.

W. and T. , Hadoop-The Definitive Guide: Storage and Analysis at Internet Scale, 2012.

W. and G. , Mediators in the architecture of future information systems, IEEE Computer, vol.25, pp.38-49, 1992.

W. , C. M. And-robertson, and E. L. , Relational languages for metadata integration, ACM Trans. on Database Systems, vol.30, pp.624-660, 2005.

Y. , T. Zou, T. Ozcan, F. Gonscalves, R. And et al., Joins for hybrid warehouses: Exploiting massive parallelism and enterprise data warehouses, Int. Conf. on Extending Database Technology (EDBT) (2015), pp.373-384

Z. , M. Chowdhury, M. Franklin, M. Shenker, S. And et al., Spark: Cluster computing with working sets, USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) (2010), pp.10-10

Z. , M. And, R. , and T. , Querying combined cloud-based and relational databases, Int. Conf. on Cloud and Service Computing (CSC), pp.330-335, 2011.

Z. , Q. And, L. , and P. , A query sampling method of estimating local cost parameters in a multidatabase system, Int. Conf. on Data Engineering (ICDE), pp.144-153, 1994.

Z. , Q. And, L. , and P. , Global query processing and optimization in the cords multidatabase system, Int. Conf. on Parallel and Distributed Computing Systems, pp.640-647, 1996.

Z. , Q. Sun, Y. And, M. , and S. , Developing cost models with qualitative variables for dynamic multidatabase environments, Int. Conf. on Data Engineering (ICDE, pp.413-424, 2000.