,
,
,
, , vol.145
,
156 9.4.1 Jeux de données, requêtes et configuration, vol.156 ,
,
, Ces données stockées demeurent indispensables dans plusieurs domaines d'application tels que l'e-santé, la biologie, l'étude de la biodiversité, etc., où le croisement de données statiques et dynamique est requis. Pour la gestion combinée de données RDF statiques et dynamiques, plusieurs systèmes RSP tels que C-SPARQL, CQELS et CQELS Cloud prennent en charge le croisement (jointure) entre données RDF statiques et dynamiques. Ces systèmes réalisent la jointure 9.4 Évaluations Cette section évalue le système proposé à partir de la section 9, Les données RDF stockées sont très importantes dans le contexte des flux de données RDF. Ces données concernent les ontologies de domaine associées à un domaine d'application des flux de données, des données RDF résumées et/ou historiées des données RDF à fréquence de rafraîchissement très lent (hebdomadaire, mensuel, semestriel, trimestriel
, Nous utilisons trois ensembles de données du monde réel utilisés dans les benchmarks SRBench, vol.4
, Ces jeux de données contiennent (i ) des données RDF en continu collectées depuis les stations météorologiques des US (LinkedSensorData 5 ), (i i ) des données RDF stockées décrivant l'emplacement des stations (GeoNames 6 ) et (i i i ) les localisations de GeoNames décrites dans un autre jeu de données RDF stocké
, La Figure 9.7 présente un aperçu des flux de données utilisés et des jeux de données stockées. En tant que flux de fichiers RDF, nous utilisons un ensemble de données Linked-5
165 10.1.2 2 ème contribution : résumé orienté graphe de flux de données RDF 166 10.1.3 3 ème contribution : interrogation de flux de données RDF compressées au format RDSZ ,
168 10.2.1 Exploration et élagage de graphes RDF ,
Enabling ontology-based access to streaming data sources, The Semantic Web-ISWC 2010, pp.96-111, 2010. ,
Mining frequent patterns in data streams at multiple time granularities, vol.212, pp.191-212, 2003. ,
Introducing rdf graph summary with application to assisted sparql formulation, 23rd International Workshop on Database and Expert Sytems Applications, 2012. ,
Srbench : a streaming rdf/sparql benchmark, International Semantic Web Conference, pp.641-657, 2012. ,
, Chiffres internet -2017
Models and issues in data stream systems, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp.1-16, 2002. ,
Issues in data stream management, ACM Sigmod Record, vol.32, issue.2, pp.5-14, 2003. ,
Condensative stream query language for data streams, Proceedings of the eighteenth conference on Australasian database, vol.63, pp.113-122, 2007. ,
Window query processing for joining data streams with relations, Proceedings of the 2007 conference of the center for advanced studies on Collaborative research. IBM Corp, pp.188-202, 2007. ,
Stream : the stanford stream data manager (demonstration description), Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp.665-665, 2003. ,
Telegraphcq : An architectural status report, IEEE Data Eng. Bull, vol.26, issue.1, pp.11-18, 2003. ,
Aurora : a new model and architecture for data stream management, The VLDB Journal-The International Journal on Very Large Data Bases, vol.12, issue.2, pp.120-139, 2003. ,
Monitoring streams : a new class of data management applications, Proceedings of the 28th international conference on Very Large Data Bases. VLDB Endowment, pp.215-226, 2002. ,
Cql : A language for continuous queries over streams and relations, Database Programming Languages, pp.1-19, 2004. ,
Streaming sparql-extending sparql to process data streams, The Semantic Web : Research and Applications, pp.448-462, 2008. ,
C-sparql : Sparql for continuous querying, 2009. ,
Ep-sparql : a unified language for event processing and stream reasoning, Proceedings of the 20th international conference on World wide web, pp.635-644, 2011. ,
A native and adaptive approach for unified processing of linked streams and linked data, The Semantic Web-ISWC, pp.370-388, 2011. ,
Sparkwave : continuous schema-enhanced pattern matching over rdf data streams, Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems, pp.58-68, 2012. ,
Elastic and scalable processing of linked stream data in the cloud, International Semantic Web Conference, pp.280-297, 2013. ,
High-performance distributed stream reasoning using s4, Ordring Workshop at ISWC, 2011. ,
Strider : A hybrid adaptive distributed rdf stream processing engine, International Semantic Web Conference, pp.559-576, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01740499
Dionysus : Towards query-aware distributed processing of rdf graph streams, EDBT/ICDT Workshops. Citeseer, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01288723
Scalable semantic web data management using vertical partitioning, Proceedings of the 33rd international conference on Very large data bases. VLDB Endowment, pp.411-422, 2007. ,
Yars2 : A federated repository for querying graph structured data from the web, The Semantic Web, pp.211-224, 2007. ,
Virtuoso : Rdf support in a native rdbms, Semantic Web Information Management, pp.501-519, 2010. ,
High-performance, massively scalable distributed systems using the mapreduce software framework : the shard triple-store," in Programming support innovations for emerging distributed applications, p.4, 2010. ,
Rdfpeers : a scalable distributed rdf repository based on a structured peer-to-peer network, Proceedings of the 13th international conference on World Wide Web, pp.650-657, 2004. ,
Atlas : Storing, updating and querying rdf (s) data on top of dhts, Web Semantics : Science, Services and Agents on the World Wide Web, vol.8, issue.4, pp.271-277, 2010. ,
Storage and retrieval of large rdf graph using hadoop and mapreduce, CloudCom, vol.9, pp.680-686, 2009. ,
H 2 rdf+ : High-performance distributed joins over large-scale rdf graphs, Big Data, 2013 IEEE International Conference on, pp.255-263, 2013. ,
Adaptive partitioning for very large rdf data, 2015. ,
Triad : a distributed sharednothing rdf engine based on asynchronous message passing, Proceedings of the 2014 ACM SIGMOD international conference on Management of data, pp.289-300, 2014. ,
Scalable sparql querying of large rdf graphs, Proceedings of the VLDB Endowment, vol.4, pp.1123-1134, 2011. ,
Scaling queries over big rdf graphs with semantic hash partitioning, Proceedings of the VLDB Endowment, vol.6, pp.1894-1905, 2013. ,
Scalable sparql querying using path partitioning, IEEE 31st International Conference on, pp.795-806, 2015. ,
Towards effective partition management for large graphs, Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp.517-528, 2012. ,
A distributed graph engine for web scale rdf data, Proceedings of the VLDB Endowment, vol.6, pp.265-276, 2013. ,
A graph partitioning approach to distributed rdf stores, Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on, pp.411-418, 2012. ,
Rdsz : an approach for lossless rdf stream compression, European Semantic Web Conference, pp.52-67, 2014. ,
Efficient rdf interchange (eri) format for rdf data streams, International Semantic Web Conference, pp.244-259, 2014. ,
On the dependancies of queries execution time and memory consumption in c-sparql, Proceedings of the IADIS International Conference Applied Computing. AC 2015 Proceedings, 2015. ,
C-sparql extension for sampling rdf graphs streams, Advances in Knowledge Discovery and Management, pp.23-40, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01663811
Drss : Distributed rdf sparql streaming, International Conference on Software Engineering Research, Management and Applications, pp.125-145, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01558287
An efficient approach for real-time processing of rdsz-based compressed rdf streams, International Conference on Software Engineering Research, Management and Applications, pp.147-166, 2017. ,
Graph-oriented summary for optimized resource description framework graphs streams processing, vol.12, p.2703, 2018. ,
Fast sparql join processing between distributed streams and stored rdf graphs using bloom filters, 2018 12th International Conference on Research Challenges in Information Science (RCIS), pp.1-12, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01913389
Extension de c-sparql pour l'échantillonnage de flux de graphes rdf, Revue des Nouvelles Technologies de l'Information, 2015. ,
Weaving the Web, Ed. HarperCollins Publishers, 1999. ,
The semantic web, Scientific American, vol.284, issue.5, pp.28-37, 2001. ,
, , 2009.
Groupe de travail Réseau, Request for Comments : 3987, Catégorie : Standards Track, 2005. ,
An evaluation of triplestore technologies for large data stores ,
SPARQL/Update : A language for updating RDF graphs, HP Laboratories Bristol, 2007. ,
SPARQL Query Language for RDF ,
, W3C® (MIT, ERCIM, Keio, Beihang), 2013.
Semantics and complexity of sparql, ACM Transactions on Database Systems (TODS), vol.34, issue.3, p.16, 2009. ,
Data stream query processing, In ICDE, vol.5, p.1145, 2005. ,
Gigascope : a stream database for network applications, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp.647-651, 2003. ,
Niagaracq : A scalable continuous query system for internet databases, ACM SIGMOD Record, vol.29, pp.379-390, 2000. ,
Towards sensor database systems, International Conference on mobile Data management, pp.3-14, 2001. ,
An integration framework for sensor networks and data stream management systems, Proceedings of the Thirtieth international conference on Very large data bases, vol.30, pp.1361-1364, 2004. ,
Querying rdf streams with c-sparql, ACM SIGMOD Record, vol.39, issue.1, pp.20-26, 2010. ,
An execution environment for c-sparql queries, Proceedings of the 13th International Conference on Extending Database Technology, pp.441-452, 2010. ,
C-sparql : a continuous query language for rdf data streams, International Journal of Semantic Computing, vol.4, issue.01, pp.3-25, 2010. ,
Continuous queries and real-time analysis of social semantic data with c-sparql, Proceedings of Social Data on the Web Workshop at the 8th International Semantic Web Conference, 2010. ,
A semantics for a query language over sensors, streams and relations, Sharing Data, Information and Knowledge, pp.87-99, 2008. ,
Stream reasoning and complex event processing in etalis, 2012. ,
, Real-time complex event recognition and reasoning-a logic programming approach, Applied Artificial Intelligence, vol.26, issue.1-2, pp.6-57, 2012.
Linked stream data processing engines : Facts and figures, The Semantic Web-ISWC 2012, pp.300-312, 2012. ,
A fast algorithm for the many pattern/many object pattern matching problem, Artificial Intelligence, vol.19, pp.17-37, 1982. ,
Sparql-st : Extending sparql to support spatiotemporal queries," in Geospatial semantics and the semantic web, pp.61-86, 2011. ,
Modeling and querying metadata in the semantic sensor web : The model strdf and the query language stsparql, Extended Semantic Web Conference, pp.425-439, 2010. ,
Semantic management of streaming data, Proceedings of the 2nd International Conference on Semantic Sensor Networks-Volume, vol.522, pp.80-95, 2009. ,
Applied temporal rdf : Efficient temporal querying of rdf data with sparql, pp.308-322, 2009. ,
T-sparql : A tsql2-like temporal query language for rdf, ADBIS (Local Proceedings). Citeseer, pp.21-30, 2010. ,
Representing and querying validity time in rdf and owl : A logic-based approach, Web Semantics : Science, Services and Agents on the World Wide Web, vol.12, pp.3-21, 2012. ,
On correctness in rdf stream processor benchmarking, The Semantic Web-ISWC 2013, pp.326-342, 2013. ,
Résumé généraliste de flux de données, 2007. ,
Load shedding for aggregation queries over data streams, Proceedings. 20th International Conference on, pp.350-361, 2004. ,
Load shedding in a data stream manager, Proceedings of the 29th international conference on Very large data bases, vol.29, pp.309-320, 2003. ,
Revue des Nouvelles Technologies de l'Information, Extraction et Gestion des Connaissances, pp. RNTI-E, vol.19, pp.247-254, 2010. ,
Random sampling with a reservoir, ACM Transactions on Mathematical Software (TOMS), issue.1, pp.37-57, 1985. ,
, Faster methods for random sampling, Communications of the ACM, issue.7, pp.703-718, 1984.
On biased reservoir sampling in the presence of stream evolution, Proceedings of the 32nd international conference on Very large data bases, pp.607-618, 2006. ,
Sampling from a moving window over streaming data, Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, pp.633-634, 2002. ,
Sampling time-based sliding windows in bounded space, Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp.379-392, 2008. ,
A dip in the reservoir : Maintaining sample synopses of evolving datasets, Proceedings of the 32nd international conference on Very large data bases, pp.595-606, 2006. ,
On random sampling over joins, ACM SIGMOD Record, vol.28, pp.263-274, 1999. ,
Approximate join processing over data streams, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp.40-51, 2003. ,
Classification as a Tool for Research, pp.307-314, 2010. ,
Ripple joins for online aggregation, ACM SIGMOD Record, vol.28, pp.287-298, 1999. ,
How to summarize the universe : Dynamic maintenance of quantiles, Proceedings of the 28th international conference on Very Large Data Bases, pp.454-465, 2002. ,
Space-efficient online computation of quantile summaries, ACM SIGMOD Record, vol.30, pp.58-66, 2001. ,
Birch : an efficient data clustering method for very large databases, ACM SIGMOD Record, vol.25, pp.103-114, 1996. ,
A framework for clustering evolving data streams, Proceedings of the 29th international conference on Very large data bases, vol.29, pp.81-92, 2003. ,
Improved histograms for selectivity estimation of range predicates, ACM Sigmod Record, vol.25, pp.294-305, 1996. ,
Approximate query answering using histograms, IEEE Data Eng. Bull, vol.22, issue.4, pp.5-14, 1999. ,
Data-streams and histograms, Proceedings of the thirty-third annual ACM symposium on Theory of computing, pp.471-475, 2001. ,
Computing iceberg queries efficiently, Internaational Conference on Very Large Databases (VLDB'98), 1999. ,
Approximate frequency counts over data streams, Proceedings of the 28th international conference on Very Large Data Bases, pp.346-357, 2002. ,
Probabilistic counting algorithms for data base applications, Journal of computer and system sciences, issue.2, pp.182-209, 1985. ,
URL : https://hal.archives-ouvertes.fr/inria-00076244
Space/time trade-offs in hash coding with allowable errors, Communications of the ACM, pp.422-426, 1970. ,
Scalable bloom filters, Information Processing Letters, issue.6, pp.255-261, 2007. ,
Summary cache : a scalable widearea web cache sharing protocol, IEEE/ACM Transactions on Networking (TON), issue.3, pp.281-293, 2000. ,
Multi-dimensional analysis of data streams using stream cubes, Data Streams, pp.103-125, 2007. ,
Multidimensional regression analysis of time-series data streams, Proceedings of the 28th international conference on Very Large Data Bases, pp.323-334, 2002. ,
Datastream clustering over tilted windows through sampling, p.127, 2006. ,
Le fia : un nouvel automate permettant l'extraction efficace d'itemsets fréquents dans les flots de données, pp.157-168, 2008. ,
Spams, une nouvelle approche incrémentale pour l'extraction de motifs séquentiels fréquents dans les data streams, pp.205-216, 2009. ,
Summarizing linked data rdf graphs using approximate graph pattern mining, Proc. 19th International Conference on Extending Database Technology (EDBT), 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01416814
Using graph summarization for join-ahead pruning in a distributed rdf engine, 2014. ,
Automatic weight generation and class predicate stability in rdf summary graphs ,
Query-oriented summarization of rdf graphs, INRIA Saclay, 2017. ,
Dynamic resource provisioning for data streaming applications in a cloud environment, Cloud Computing Technology and Science (CloudCom), pp.441-448, 2010. ,
Dynamic control of data streaming and processing in a virtualized environment, IEEE Transactions on Automation Science and Engineering, vol.9, issue.2, pp.365-376, 2012. ,
Échantillonnage de flux de données sémantiques : Une approche orientée graphe, EGC, pp.485-486, 2015. ,
Sampling techniques, 2007. ,
Random sampling with a reservoir, ACM Transactions on Mathematical Software (TOMS), vol.11, issue.1, pp.37-57, 1985. ,
Sampling from a moving window over streaming data, Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, pp.633-634, 2002. ,
Comprendre les connexions sociales dans les communautés : comment utiliser l'analyse des réseaux sociaux ? guide pratique (traduit par e. lucia). alliance de recherche universités-communautés-défis des communautés côtières, 2014. ,
Social network analysis of study environment, JIOS, vol.34, 2010. ,
Contributions à l'étude des réseaux sociaux : propagation, fouille, collecte de données, 2012. ,
Les mesures locales d'un réseau, 2010. ,
, Les mesures globales d'un réseau, 2010.
Rdf compression : basic approaches, Proceedings of the 19th international conference on World wide web, pp.1091-1092, 2010. ,
Logical linked data compression, Extended Semantic Web Conference, pp.170-184, 2013. ,
Scalable rdf data compression with mapreduce, Concurrency and Computation : Practice and Experience, vol.25, issue.1, pp.24-39, 2013. ,
Compressed k2-triples for full-in-memory rdf engines, 2011. ,
Binary rdf representation for publication and exchange (hdt), Web Semantics : Science, Services and Agents on the World Wide Web, vol.19, pp.22-41, 2013. ,
Rdsz : an approach for lossless rdf stream compression, European Semantic Web Conference, pp.52-67, 2014. ,
Efficient rdf interchange (eri) format for rdf data streams, International Semantic Web Conference, pp.244-259, 2014. ,
, , 2013.
What is zookeeper, 2014. ,
, Apache storm
S4 : Distributed stream computing platform, 2010 IEEE International Conference on Data Mining Workshops, pp.170-177, 2010. ,
Redis in action, 2013. ,
Lubm : A benchmark for owl knowledge base systems, Web Semantics : Science, Services and Agents on the World Wide Web, vol.3, issue.2, pp.158-182, 2005. ,
Space/time trade-offs in hash coding with allowable errors, Communications of the ACM, vol.13, issue.7, pp.422-426, 1970. ,
A finegrained evaluation of sparql endpoint federation systems, Semantic Web, vol.7, issue.5, pp.493-518, 2016. ,
Approximate continuous query answering over streams and dynamic linked data sets, International Conference on Web Engineering, pp.307-325, 2015. ,
Proactive replication of dynamic linked data for scalable rdf stream processing ,
Citybench : A configurable benchmark to evaluate rsp engines using smart city datasets, International Semantic Web Conference, pp.374-389, 2015. ,
Network applications of bloom filters : A survey, Internet mathematics, vol.1, issue.4, pp.485-509, 2004. ,
Scalable bloom filters, Information Processing Letters, vol.101, issue.6, pp.255-261, 2007. ,
, The gbif integrated publishing toolkit : facilitating the efficient publishing of biodiversity data on the internet, vol.9, p.102623, 2014.
Sharing and accessing biodiversity data globally through gbif, ESRI User Conf. Citeseer, 2005. ,
Patorc : Pattern oriented compression for semantic data streams, OTM Confederated International Conferences" On the Move to Meaningful Internet Systems, pp.193-209, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01433855
, Requête Q4 Query q4 AS SELECT ?sensor ?, value FROM STREAM <ht t p : //ex.sh >
, WHERE { ?eventID ssn :hasValue observation ; ssn :isProducedBy ?sensor
, SensorOutput. ?observation qudt :numValue ?value ; qudt :unit
, A.5 Requête Q5 QUERY q5 AS SELECT ?aSub ?anObj (count( distinct ?subject) as ?count)
,
, WHERE { ?subject weather :type ?object . ?aSub sens-obs :aProp ?anObj
, sens-obs :anotherProp ?anotherObj
, { select ?name (count( distinct ?object) as ?count2)
,
, /r eposi t or y.or g /bi bl i o.r d f > WHERE { ?object weather :name ?name
, aProp ?aName .} GROUP BY ?name }} GROUP by ?aSub ?anObj }
,
,
, FROM REPOSITORY <LSM> WHERE { ?sens om-owl :processLocation ?sensLocation ; om-owl :generatedObs ?obs . ?sensLocation wgs84_pos :alt "%Altitude%"??xsd :float ; wgs84_pos :lat "%Latitude%"??xsd :float ; wgs84_pos :long "%Longitude%"??xsd :float. ?obs om-owl :observedPrty weather :_AirTemp ; om-owl :result [om-owl :floatValue ?temp].} GROUP BY ?sens B.2 Requête Q10 SELECT DISTINCT ?lat ?, SELECT (MIN( ?temperature) AS ?minTemperature) (MAX( ?temperature) AS ?maxTemperature) FROM STREAM <LObD>
, ?sensor om-owl :processLocation ?sensorLocation . ?sensorLocation wgs84_pos :alt ?alt
, Requête Q11 SELECT DISTINCT ?sensor FROM STREAM <LObD> [RANGE 60s TUMBLING] FROM REPOSITORY <LSM> WHERE { ?sensor om-owl :generatedObservation ?observation
, om-owl :hasLocatedNearRel [om-owl :hasLocation ?nearbyLocation
, ?observation a ?observationType
, om-owl :observedProperty ?observationProperty
, om-owl :result
, SELECT AVG( ?value2) AS ?avgValue WHERE { ?sensor2 om-owl :generatedObservation ?observation2
, FILTER ( sameTerm( ?nearbyLocation, ?nearbyLocation2)) ?observation2 a ?observationType
, om-owl :observedProperty ?observationProperty
, FILTER ( ABS( ?value -?avgValue) / ?avgValue> "0.10"??xsd :float)
, FROM REPOSITORY <LSM> FROM REPOSITORY <GeoNames> WHERE { ?sensor om-owl :generatedObservation ?temperatureObservation ; om-owl :generatedObservation ?humidityObservation, Requête Q12 SELECT ?name (AVG( ?temperature) AS ?avgTemperature) (AVG( ?humidity) AS ?avgHumidity) FROM STREAM <LObD>
, om-owl :hasLocatedNearRel [om-owl :hasLocation ?nearbyLocation
, ?temperatureObservation om-owl :observedProperty weather :_AirTemperature ; om-owl :result
, ?humidityObservation om-owl :observedProperty weather :_RelativeHumidity ; om-owl :result
, i"))} } UNION { SELECT ?name WHERE ?nearbyLocation gn :parentFeature+ ?parentFeature. ?parentFeature gn :featureClass ?parentClass ; gn :name | gn :officialName ?name, { SELECT ?name WHERE { ?nearbyLocation gn :featureClass ?featureClass ; gn :name | gn :officialName ?name ; gn :population ?population. FILTER ( ?population > 15000 && REGEX( ?featureClass, "P
, FROM REPOSITORY <LSM> FROM REPOSITORY <GeoNames> WHERE { ?airport gn :featureClass ?airportClass ; wgs84_pos :lat ?lat ; wgs84_pos :long ?long ; gn :name|gn :officialName ?airportName ; gn :parentFeature+ ?city, city gn :featureClass ?cityClass. rdf :type/rdfs :subClassOf* yago :Hurricane111467018 ; dbpprop :damages ?damage. ?nearbyLocation gn :parentFeature* ?area. ?area gn :name|gn :officialName ?areaName.}