Demonstration of the CloudMdsQL Multistore System. BDA'2016: Gestion de données-principes, technologies et applications, p.14, 2016. ,
Multistore Big Data Integration with CloudMdsQL. BDA'2016: Gestion de données-principes, technologies et applications, p.5, 2016. ,
CloudMdsQL: querying heterogeneous cloud data stores with a common language. Distributed and Parallel Databases, vol.34, pp.463-503, 2016. ,
Multistore Big Data Integration with CloudMdsQL, Transactions Large-Scale Dataand Knowledge-Centered Systems, vol.28, pp.48-74, 2016. ,
Design and Implementation of the CloudMdsQL Multistore System, International Conference on Cloud Computing and Services Science (CLOSER), 2016. ,
The CloudMdsQL Multistore System, ACM SIGMOD International Conference on Data Management, 2113 ? ?2116, 2016. ,
Integrating Big Data and Relational Data with a Functional SQL-like Query Language, Database and Expert Systems Applications (DEXA), 2015. ,
Query Processing in Cloud Multistore Systems. BDA'2015: Gestion de données-principes, technologies et applications, 2015. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01181253
, FLAT_MAP
, REDUCE( SUM ), MAP( TUPLE, 1 )
, FLAT_MAP( lambda data: product(data, CREATE NAMED EXPRESSION experts_alt(kw string, expert string)@hdfs = { * SCAN(TEXT, 'posts.txt
, The comparisons reveal several trends: the ability to integrate relational data (stored in RDBMS) with other kinds of data stores; the growing importance of accessing HDFS within Hadoop and the fact that most systems provide a relational/ SQL-like abstraction, ner (CloudMdsQL), schema management, and query processing
,
,
,
,
,
,
Hadoopdb: An architectural hybrid of mapreduce and DBMS technologies for analytical workloads, Proceedings of the VLDB Endowment (PVLDB), vol.2, pp.922-933, 2009. ,
Spark SQL: relational data processing in spark, ACM SIGMOD Int. Conf. on Management of Data, pp.1383-1394, 2015. ,
Eddies: Continuously adaptive query processing, ACM SIGMOD Int. Conf. on Management of Data, pp.261-272, 2000. ,
Funsql: it is time to make SQL functional, Int. Conf. on Extending Database Technology (EDBT, pp.41-46, 2012. ,
Integrating big data and relational data with a functional sql-like query language, Int. Conf. on Database and Expert Systems Applications (DEXA) (2015), pp.170-185 ,
URL : https://hal.archives-ouvertes.fr/lirmm-01181242
Multistore big data integration with cloudmdsql, Trans. on Large-Scale Data-and Knowledge-Centered Systems (TLDKS), vol.28, pp.48-74, 2016. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01345712
Query processing in multistore systems: an overview, Int. Journal of Cloud Computing (IJCC), vol.5, pp.309-346, 2016. ,
, , 2014.
Invisible glue: Scalable self-tuning multi-stores, Int. Conf. on Innovative Data Systems Research (CIDR), p.7, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01087624
S-store: A streaming newsql system for big velocity applications, Proceedings of the VLDB Endowment (PVLDB), vol.7, pp.1633-1636, 2014. ,
Bigtable: A distributed storage system for structured data, ACM Trans. on Computer Systems, vol.26, pp.1-4, 2008. ,
A FAD for data intensive applications, IEEE Trans. Knowl. Data Eng, vol.4, pp.34-51, 1992. ,
Simplified data processing on large clusters, Symposium on Operating Systems Design and Implementation (OSDI), pp.137-150, 2004. ,
Dynamo: amazon's highly available key-value store, ACM Symposium on Operating Systems Principles (SOSP, pp.205-220, 2007. ,
Split query processing in polybase, ACM SIGMOD Int. Conf. on Management of Data, pp.1255-1266, 2013. ,
Principles of Data Integration, 2012. ,
The bigdawg polystore system, SIGMOD Record, vol.44, pp.11-16, 2015. ,
Iterative parallel data processing with stratosphere: an inside look, ACM SIGMOD Int. Conf. on Management of Data, pp.1053-1056, 2013. ,
FORWARD: data-centric uis using declarative templates that efficiently wrap third-party javascript components, Proceedings of the VLDB Endowment (PVLDB), vol.7, pp.1649-1652, 2014. ,
Indexing HDFS data in PDW: splitting the data from the index, Proceedings of the VLDB Endowment (PVLDB), vol.7, pp.1520-1528, 2014. ,
The google file system, ACM Symposium on Operating Systems Principles (SOSP, pp.29-43, 2003. ,
Query rewrites with views for XML in DB2, Int. Conf. on Data Engineering, pp.1339-1350, 2009. ,
Maintaining views incrementally, ACM SIGMOD Int. Conf. on Management of Data, pp.157-166, 1993. ,
Optimizing queries across diverse data sources, Int. Conf. on Very Large Databases (VLDB, pp.276-285, 1997. ,
An evaluation of approaches to federated query processing over linked data, Int. Conf. on Semantic Systems, 2010. ,
, A multi-store system for evolutionary analytics. Proceedings of the VLDB Endowment (PVLDB), vol.6, pp.1180-1181, 2013.
Parallelizing FAD using compile-time analysis techniques, IEEE Data Engineering Bulletin, vol.12, pp.9-15, 1989. ,
The xtreemfs architecture-a case for 7. Bibliography object-based file systems in grids, Concurrency and Computation: Practice and Experience, vol.20, pp.2049-2060, 2008. ,
Design and implementation of the cloudmdsql multistore system, Int. Conf. on Cloud Computing and Services Science (CLOSER), pp.352-359, 2016. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01341172
The cloudmdsql multistore system, ACM SIGMOD Int. Conf. on Management of Data, pp.2113-2116, 2016. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01288025
Benchmarking polystores: The cloudmdsql experience, IEEE Int. Conf. on Big Data, pp.2574-2579, 2016. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01415582
Cloudmdsql: querying heterogeneous cloud data stores with a common language, Distributed and Parallel Databases, vol.34, pp.463-503, 2016. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01184016
MISO: souping up big data query processing with a multistore system, ACM SIGMOD Int. Conf. on Management of Data, pp.1591-1602, 2014. ,
Data integration: A theoretical perspective, ACM SIGMOD/PODS (Principles of Database Systems) Conf, pp.233-246, 2002. ,
Efficient support of xquery update facility in XML enabled RDBMS, Int. Conf. on Data Engineering (ICDE, pp.1394-1404, 2012. ,
LINQ: reconciling object, relations and XML in the .net framework, ACM SIGMOD Int. Conf. on Management of Data, p.706, 2006. ,
The SQL++ semi-structured data model and query language: A capabilities survey of sql-onhadoop, nosql and newsql databases, ACM Computing Research Repository, 2014. ,
Principles of Distributed Database Systems, 2011. ,
P2P Techniques for Decentralized Applications. Synthesis Lectures on Data Management, 2012. ,
URL : https://hal.archives-ouvertes.fr/lirmm-00748635
The Definitive Guide to MongoDB:The NoSQL Database for Cloud and Desktop Computing, 2010. ,
Data management in the cloud, Int. Conf. on Data Engineering (ICDE, p.5, 2009. ,
Trinity: a distributed graph engine on a memory cloud, ACM SIGMOD Int. Conf. on Management of Data, pp.505-516, 2013. ,
Qox-driven ETL design: reducing the cost of ETL consulting engagements, ACM SIGMOD Int. Conf. on Management of Data, pp.953-960, 2009. ,
Optimizing analytic data flows for multiple execution engines, ACM SIGMOD Int. Conf. on Management of Data, pp.829-840, 2012. ,
Operating system support for database management, Communications of the ACM, vol.24, pp.412-418, 1981. ,
Mapreduce and parallel dbmss: friends or foes? Communications of the ACM, vol.53, pp.64-71, 2010. ,
The design and implementation of ingres, ACM Trans. on Database Systems, vol.1, pp.198-222, 1976. ,
Scaling access to heterogeneous data sources with DISCO, IEEE Trans. Knowl. Data Eng, vol.10, pp.808-823, 1998. ,
Functional SOL (fsol), an SQL upwardcompatible database programming language, Information Sciences, vol.62, issue.3, pp.183-203, 1992. ,
Ceph: A scalable, high-performance distributed file system, Symposium on Operating Systems Design and Implementation (OSDI, pp.307-320, 2006. ,
Hadoop-The Definitive Guide: Storage and Analysis at Internet Scale, 2012. ,
Mediators in the architecture of future information systems, IEEE Computer, vol.25, pp.38-49, 1992. ,
Relational languages for metadata integration, ACM Trans. on Database Systems, vol.30, pp.624-660, 2005. ,
Joins for hybrid warehouses: Exploiting massive parallelism and enterprise data warehouses, Int. Conf. on Extending Database Technology (EDBT) (2015), pp.373-384 ,
Spark: Cluster computing with working sets, USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) (2010), pp.10-10 ,
Querying combined cloud-based and relational databases, Int. Conf. on Cloud and Service Computing (CSC), pp.330-335, 2011. ,
A query sampling method of estimating local cost parameters in a multidatabase system, Int. Conf. on Data Engineering (ICDE), pp.144-153, 1994. ,
Global query processing and optimization in the cords multidatabase system, Int. Conf. on Parallel and Distributed Computing Systems, pp.640-647, 1996. ,
Developing cost models with qualitative variables for dynamic multidatabase environments, Int. Conf. on Data Engineering (ICDE, pp.413-424, 2000. ,