, Ère du Big Data et le besoin d'un traitement adapté, p.174

, Problèmes d'apprentissage artificiel non surpervisé, p.175

. , 177 Des matrices aléatoires structurées, une approche pour de l'apprentissage à grande échelle, p.177

, Apprentissage de codes compacts binaires de flux de données massives via le hashing hypercubique pour la recherche des plus proches voisins, p.178

. , Clustering de données massives à partir d'un arbre couvrant minimum

G. , Des matrices aléatoires structurées, une approche pour de l'apprentissage à grande échelle

. , Apprentissage de codes compacts binaires de flux de données massives via le hashing hypercubique pour la recherche des plus proches voisins

. , Clustering de données massives à partir d'un arbre couvrant minimum

, Sécurité de l'usage du sketching de graphe, p.186

. .. Passage-À-l'échelle-de-l'algorithme, , p.187

. , 5.1 Rappel des contributions et inscription dans le sujet, p.190

G. , Apprentissage de codes compacts binaires de flux de données massives via le hashing hypercubique pour la recherche des plus proches voisins 183

, Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS'17), vol.54, pp.20-22

, ? La contribution de la Section G.3 sur l'apprentissage de codes binaires compacts pour des données en grande dimension a été publiée à la Conférence Internationale de l'Acoutisque

A. Morvan, A. Souloumiac, C. Gouy-pailler, and J. Atif,

, Streaming Binary Sketching based on Subspace Tracking and Diagonal Uniformization Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.15-20

, Une extension de ce travail présentant des garanties théoriques supplémentaires avec une application au clustering préservant la vie privée différentielle a été publiée à la Conférence, ? L'approche sketching de graphe pour du clustering de flux de données massives a été publiée à la Conférence SIAM Data Mining (SDM) 2018, 2018.

A. Morvan, K. Choromanski, C. Gouy-pailler, and J. Atif, Graph sketching-based Space-efficient Data Clustering, Proceedings of the SIAM International Conference on DATA MINING (SDM'18), pp.3-4
DOI : 10.1137/1.9781611975321.2

URL : https://hal.archives-ouvertes.fr/cea-01838501

R. Pinot, A. Morvan, F. Yger, C. Gouy-pailler, and J. Atif, Graph-based Clustering under Differential Privacy Proceedings of the International Conference on Uncertainty in Artificial Intelligence (UAI 2018), pp.6-10

K. Bibliography, A. Abed-meraim, Y. Chkeif, and . Hua, Fast Orthonormal PAST Algorithm, IEEE Signal Processing Letters, vol.32, issue.3, p.182, 2000.

D. Achlioptas, Database-friendly Random Projections: Johnson-Lindenstrauss with Binary Coins, J. Comput. Syst. Sci, vol.66, issue.4, pp.671-687, 2003.

C. C. Aggarwal, J. Han, J. Wang, and P. S. Yu, A Framework for Clustering Evolving Data Streams, Proceedings of the 29th International Conference on Very Large Data Bases, vol.29, p.38, 2003.

R. Agrawal and A. Swami, A One-Pass Space-Efficient Algorithm for Finding Quantiles, PROC. 7TH INTL. CONF. MANAGEMENT OF DATA (COMAD-95), 1995.

K. J. Ahn, S. Guha, and A. Mcgregor, Analyzing Graph Structure via Linear Measurements, Proceedings of the Twenty-third Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '12, vol.41, p.186, 2012.

K. J. Ahn, S. Guha, and A. Mcgregor, Graph Sketches: Sparsification, Spanners, and Subgraphs, Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS '12, p.45, 2012.

K. J. Ahn, S. Guha, and A. Mcgregor, Spectral Sparsification in Dynamic Graph Streams, p.45, 2013.

N. Ailon and B. Chazelle, Approximate nearest neighbors and the fast JohnsonLindenstrauss transform, Proceedings of the 38th STOC, vol.54, p.58, 2006.

N. Ailon and E. Liberty, An almost optimal unrestricted fast Johnson-Lindenstrauss transform, SODA, p.54, 2011.

N. Ailon, R. Jaiswal, and C. Monteleoni, Streaming k-means approximation, Advances in Neural Information Processing Systems, vol.22, p.37, 2009.

N. Ailon, Y. Chen, and H. Xu, Breaking the Small Cluster Barrier of Graph Clustering, p.41, 2013.

N. Alon, Y. Matias, and M. Szegedy, The Space Complexity of Approximating the Frequency Moments, Proceedings of the Twenty-eighth Annual ACM Symposium 194 BIBLIOGRAPHY on Theory of Computing, STOC '96, pp.20-29, 1996.

A. Andoni and P. Indyk, Near-optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions, Commun. ACM, vol.22, issue.1, p.74, 2008.

A. Andoni, P. Indyk, T. Laarhoven, I. Razenshteyn, and L. Schmidt, Practical and Optimal LSH for Angular Distance, Proceedings of the 28th International Conference on Neural Information Processing Systems, NIPS'15, vol.177, p.179, 2015.

A. Andoni, P. Indyk, T. Laarhoven, I. Razenshteyn, and L. Schmidt, Practical and Optimal LSH for Angular Distance, NIPS, vol.54, p.74, 2015.

D. Arthur and S. Vassilvitskii, The Advantages of Careful Seeding, Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '07, p.36, 2007.

T. Asano, B. Bhattacharya, M. Keil, and F. Yao, Clustering Algorithms Based on Minimum and Maximum Spanning Trees, Proceedings of the Fourth Annual Symposium on Computational Geometry, SCG '88, vol.43, p.187, 1988.

F. Aurenhammer, Voronoi Diagrams&Mdash;a Survey of a Fundamental Geometric Data Structure, ACM Comput. Surv, vol.23, issue.3, pp.345-405, 1991.

K. Azuma, Weighted sums of certain dependent random variables, Tohoku Math. J, vol.19, issue.2, p.179, 1967.

B. Bandyopadhyay, D. Fuhry, A. Chakrabarti, and S. Parthasarathy, Topological Graph Sketching for Incremental and Scalable Analytics, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM '16, p.45, 2016.

A. Bellet, A. Habrard, and M. Sebban, A survey on metric learning for feature vectors and structured data, vol.33, p.35, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01666935

V. Bentkus, On the dependence of the Berry-Esseen bound on dimension, Journal of Statistical Planning and Inference, vol.113, issue.2, p.179, 2003.

J. L. Bentley, Multidimensional Binary Search Trees Used for Associative Searching, Commun. ACM, vol.18, issue.9, p.37, 1975.

S. Bhattacharya, M. Henzinger, D. Nanongkai, and C. Tsourakakis, Space-and Time-Efficient Algorithm for Maintaining Dense Subgraphs on One-Pass Dynamic Streams, Proceedings of the Forty-seventh Annual ACM Symposium on Theory of Computing, STOC '15, p.45, 2015.

C. M. Bishop, Pattern Recognition and Machine Learning, vol.3, p.195, 2006.

A. Blum, K. Ligett, and A. Roth, A Learning Theory Approach to Non-interactive Database Privacy, Proceedings of the Fortieth Annual ACM Symposium on Theory of Computing, STOC '08, vol.154, p.157, 2008.

C. Boutsidis, D. Garber, Z. S. Karnin, and E. Liberty, Online Principal Components Analysis, Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, p.32, 2015.

S. Boyd and L. Vandenberghe, Convex Optimization, 2004.

R. S. Boyer and J. S. Moore, MJRTY-A Fast Majority Vote Algorithm, pp.105-117, 1991.

R. P. Brent, J. H. Osborn, and W. D. Smith, Bounds on determinants of perturbed diagonal matrices, p.141, 2014.

F. Cakir and S. Sclaroff, Adaptive Hashing for Fast Similarity Search, ICCV, vol.26, p.33, 2015.

F. Cakir, K. He, S. Bargal, and S. Sclaroff, MIHash: Online Hashing With Mutual Information, ICCV, vol.26, p.33, 2017.

F. Cao, M. Ester, W. Qian, and A. Zhou, Density-based clustering over an evolving data stream with noise, 2006 SIAM Conference on Data Mining, vol.37, p.39, 2006.

M. Charikar, Similarity estimation techniques from rounding algorithms, STOC, vol.177, p.179, 2002.

L. Chen, T. Yu, and R. Chirkova, WaveCluster with Differential Privacy, Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM '15, vol.154, p.157, 2015.

X. Chen, I. King, and M. R. Lyu, FROSH: FasteR Online Sketching Hashing, UAI, vol.28, p.74, 2017.

Y. Chen, S. Sanghavi, and H. Xu, Clustering Sparse Graphs, Advances in Neural Information Processing Systems, vol.25, p.41, 2012.

Y. Chen, S. Sanghavi, and H. Xu, Clustering Sparse Graphs, Advances in Neural Information Processing Systems, vol.25, p.41, 2012.

Y. Chen, A. Jalali, S. Sanghavi, and H. Xu, Clustering Partially Observed Graphs via Convex Optimization, J. Mach. Learn. Res, vol.15, issue.1, p.41, 2014.

Y. Chen, S. H. Lim, and H. Xu, Weighted Graph Clustering with Non-uniform Uncertainties, Proceedings of the 31st International Conference on International Conference on Machine Learning, vol.32, p.41, 2014.

A. Choromanska, K. Choromanski, M. Bojarski, T. Jebara, S. Kumar et al., Binary embeddings with structured hashed projections, ICML, vol.135, p.136, 2016.

K. Choromanski and V. Sindhwani, Recycling Randomness with Structure for Sublinear time Kernel Expansions, ICML, vol.56, p.142, 2016.

A. Condon and R. M. Karp, Algorithms for Graph Partitioning on the Planted Partition Model. Random Struct, Algorithms, vol.18, issue.2, p.41, 2001.

J. W. Cooley and J. W. Tukey, An Algorithm for the Machine Calculation of, Complex Fourier Series. Mathematics of Computation, vol.19, issue.90, p.131, 1965.

G. Cormode and D. Firmani, A unifying framework for l 0-sampling algorithms. Distributed and Parallel Databases, Special issue on Data Summarization on Big Data, vol.32, issue.3, p.116, 2014.

G. Cormode and S. Muthukrishnan, An Improved Data Stream Summary: The Count-min Sketch and Its Applications, J. Algorithms, vol.55, issue.1, pp.58-75, 2005.

G. Cormode and S. Muthukrishnan, Approximating Data with the Count-Min Sketch, IEEE Software, vol.29, issue.1, pp.64-69, 2012.

G. Cormode, M. Garofalakis, P. J. Haas, and C. Jermaine, Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches. Found. Trends databases, vol.4, pp.1-294, 2009.

K. Crammer, O. Dekel, J. Keshet, S. Shalev-shwartz, and Y. Singer, Online PassiveAggressive Algorithms, J. Mach. Learn. Res, vol.7, pp.551-585, 1926.

B. Dai, B. Xie, N. He, Y. Liang, A. Raj et al., Scalable Kernel Methods via Doubly Stochastic Gradients, NIPS, p.137, 2014.

A. Dasgupta, R. Kumar, T. Sarlos, and . Sparse-johnson, , p.54, 2010.

S. Dasgupta and Y. Freund, Random projection trees and low dimensional manifolds, STOC, p.135, 2008.

S. Dasgupta and A. Gupta, An elementary proof of the Johnson-Lindenstrauss Lemma, p.25, 1999.

P. J. Davis, Circulant Matrices, p.131, 1970.

W. H. Day and H. Edelsbrunner, Efficient algorithms for agglomerative hierarchical clustering methods, Journal of Classification, vol.1, issue.1, pp.7-24, 1984.

D. Defays, An efficient algorithm for a complete link method, The Computer Journal, vol.20, issue.4, pp.364-366, 1977.

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum Likelihood from Incomplete Data via the EM Algorithm, Journal of the Royal Statistical Society. Series B (Methodological), vol.39, issue.1, pp.1-38, 1977.

M. Denil, B. Shakibi, L. Dinh, M. Ranzato, and N. D. Freitas, Predicting Parameters in Deep Learning, NIPS, p.137, 2013.

T. Do, A. Doan, and N. Cheung, Learning to Hash with Binary Deep Neural Network, ECCV, p.26, 2016.

M. Durand and P. Flajolet, Loglog Counting of Large Cardinalities, ESA, pp.605-617, 2003.

C. Dwork, A Firm Foundation for Private Data Analysis, Commun. ACM, vol.54, issue.1, p.157, 2011.

C. Dwork and A. Roth, The Algorithmic Foundations of Differential Privacy, Foundations and Trends R in Theoretical Computer Science, vol.9, issue.3-4, p.164, 2013.

C. Dwork, K. Kenthapadi, F. Mcsherry, I. Mironov, and M. Naor, Our Data, Ourselves: Privacy Via Distributed Noise Generation, Eurocrypt, vol.4004, p.156, 2006.

C. Dwork, F. Mcsherry, K. Nissim, and A. Smith, Calibrating Noise to Sensitivity in Private Data Analysis, Theory of Cryptography, vol.154, p.156, 2006.

M. Ester, H. Peter-kriegel, J. Sander, and X. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise, vol.117, p.187, 1996.

T. Falkowski, A. Barth, and M. Spiliopoulou, DENGRAPH: A Density-based Community Detection Algorithm, Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI '07, p.41, 2007.

C. Feng, Q. Hu, and S. Liao, Random Feature Mapping with Signed Circulant Matrix Projection, IJCAI, p.136, 2015.

B. J. Fino and V. R. Algazi, Unified Matrix Treatment of the Fast Walsh-Hadamard Transform, IEEE Trans. Comput, vol.25, issue.11, p.129, 1976.

P. Flajolet and G. N. Martin, Probabilistic counting algorithms for data base applications, Journal of Computer and System Sciences, vol.31, issue.2, pp.182-209, 1985.
URL : https://hal.archives-ouvertes.fr/inria-00076244

P. Flajolet, É. Fusy, O. Gandouet, and F. Meunier, HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm, Conference on Analysis of Algorithms (AofA 07) of DMTCS Proceedings, pp.137-156, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00406166

P. Frankl and H. Maehara, The Johnson-Lindenstrauss Lemma and the Sphericity of Some Graphs, J. Comb. Theory Ser. A, vol.44, issue.3, pp.355-362, 1924.

M. L. Fredman and D. E. Willard, Trans-dichotomous Algorithms for Minimum Spanning Trees and Shortest Paths, J. Comput. Syst. Sci, vol.48, issue.3, pp.533-551, 1994.

H. N. Gabow, Z. Galil, T. Spencer, and R. E. Tarjan, Efficient Algorithms for Finding Minimum Spanning Trees in Undirected and Directed Graphs, Combinatorica, vol.6, issue.2, pp.109-122, 1986.

A. Gionis, P. Indyk, and R. Motwani, Similarity Search in High Dimensions via Hashing, Proceedings of the 25th International Conference on Very Large Data Bases, VLDB '99, pp.518-529, 1999.

G. H. Golub, H. A. Van-der, and . Vorst, Eigenvalue computation in the 20th century, Journal of Computational and Applied Mathematics, vol.32, issue.1-2, p.182, 2000.

Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin, Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.178, issue.12, p.182, 2013.

I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. Book in preparation for, p.136, 2016.

K. Grauman and B. Kulis, Kernelized Locality-Sensitive Hashing, IEEE Transactions on Pattern Analysis and Machine Intelligence, p.22, 2011.

O. Grygorash, Y. Zhou, and Z. Jorgensen, Minimum Spanning Tree Based Clustering Algorithms, 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06), vol.43, p.154, 2006.

S. Guha, R. Rastogi, and K. Shim, Cure: An Efficient Clustering Algorithm for Large Databases, Inf. Syst, vol.26, issue.1, p.37, 2001.

S. Guha, A. Meyerson, N. Mishra, R. Motwani, and L. O'callaghan, Clustering Data Streams: Theory and Practice, IEEE Trans. on Knowl. and Data Eng, vol.15, issue.3, p.37, 2003.

I. Guyon and A. Elisseeff, An Introduction to Variable and Feature Selection, J. Mach. Learn. Res, vol.3, pp.1157-1182, 2003.

S. Har-peled, P. Indyk, and R. Motwani, Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality, Theory of Computing, vol.8, issue.14, p.54, 2012.

T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning. Springer Series in Statistics, 2001.

M. Hay, C. Li, G. Miklau, and D. Jensen, Accurate Estimation of the Degree Distribution of Private Networks, Ninth IEEE International Conference on Data Mining, p.155, 2009.

M. R. Henzinger, P. Raghavan, and S. Rajagopalan, External Memory Algorithms. chapter Computing on Data Streams, American Mathematical Society, pp.107-118, 1999.

J. Herault and B. Ans, Neuronal network with modifiable synapses: decoding of composite sensory messages under unsupervised and permanent learning, vol.299, pp.525-533

A. Hinrichs and J. Vybíral, Johnson-Lindenstrauss lemma for circulant matrices, Random Struct. Algorithms, vol.39, issue.3, p.129, 2011.

S. Ho and S. Ruan, Preserving Privacy for Interesting Location Pattern Mining from Trajectory Data, Trans. Data Privacy, vol.6, issue.1, p.157, 2013.

P. W. Holland, K. B. Laskey, and S. Leinhardt, Stochastic blockmodels: First steps, Social Networks, vol.5, issue.2, p.41, 1983.

H. Hotelling, Analysis of a complex of statistical variables into principal components, J. Educ. Psych, vol.24, p.27, 1933.

L. Huang, Q. Yang, and W. Zheng, Online Hashing, IJCAI, vol.26, p.33, 2013.

P. Huang, H. Avron, T. Sainath, V. Sindhwani, and B. Ramabhadran, Kernel Methods match Deep Neural Networks on TIMIT, ICASSP, vol.135, p.137, 2014.

Z. Huang and P. Peng, Dynamic Graph Stream Algorithms in o(n) Space. CoRR, p.45, 2016.

M. Huhtanen and A. Perämäki, Factoring Matrices into the Product of Circulant and Diagonal Matrices, Journal of Fourier Analysis and Applications, vol.21, issue.5, p.137, 2015.

P. Indyk and R. Motwani, Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality, Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, STOC '98, vol.177, p.179, 1998.

A. K. Jain and R. C. Dubes, Algorithms for Clustering Data, vol.7, p.96, 1988.

W. Johnson and J. Lindenstrauss, Extensions of Lipschitz mappings into a Hilbert space, Conference in modern analysis and probability, vol.26, p.54, 1982.

I. Jolliffe, Principal Component Analysis, p.30, 1986.

H. Jégou, M. Douze, C. Schmid, and P. Pérez, Aggregating local descriptors into a compact image representation, CVPR, vol.28, p.74, 2010.

T. Kailath and A. Sayed, Fast Reliable Algorithms for Matrices with Structure, Society for Industrial and Applied Mathematics, vol.131, p.132, 1999.

D. M. Kane and J. Nelson, A Sparser Johnson-Lindenstrauss Transform, p.25, 2010.

D. R. Karger, P. N. Klein, and R. E. Tarjan, A Randomized Linear-time Algorithm to Find Minimum Spanning Trees, J. ACM, vol.42, issue.2, pp.321-328, 1995.

Z. Karnin and E. Liberty, Online PCA with Spectral Bounds, COLT, p.32, 2015.

S. P. Kasiviswanathan, K. Nissim, S. Raskhodnikova, and A. Smith, Analyzing Graphs with Node Differential Privacy, Proceedings of the 10th Theory of Cryptography Conference on Theory of Cryptography, TCC'13, vol.155, p.200, 2013.

L. Kaufman and P. J. Rousseeuw, Clustering by means of medoids. Statistical Data Analysis Based on the L 1-Norm and Related Methods, vol.7, p.96, 1987.

W. Kong and W. Li, Isotropic Hashing, NIPS, vol.86, p.182, 2012.

H. Lai, Y. Pan, Y. Liu, and S. Yan, Simultaneous feature learning and hash coding with deep neural networks, CVPR, vol.26, p.34, 2015.

Q. Le, T. Sarlós, and A. Smola, Fastfood-computing hilbert space expansions in loglinear time, ICML, vol.135, p.137, 2013.

Y. Lecun, Y. Bengio, and G. Hinton, Deep learning, Nature, vol.521, issue.7553, p.136, 2015.

Y. Lee, Spherical Hashing. In CVPR, p.26, 2012.

C. Leng, J. Wu, J. Cheng, X. Bai, and H. Lu, Online sketching hashing, CVPR, vol.178, p.182, 2015.

C. Leng, J. Wu, J. Cheng, X. Zhang, and H. Lu, Hashing for Distributed Data, ICML, p.26, 2015.

J. Li, Restructuring of Deep Neural Network Acoustic Models with Singular Value Decomposition, Interspeech, p.137, 2013.

E. Liberty, Simple and Deterministic Matrix Sketching, Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '13, vol.31, p.32, 2013.

E. Liberty, N. Ailon, and A. Singer, Dense fast random projections and lean Walsh transforms, RANDOM, p.54, 2008.

V. E. Liong, J. Lu, G. Wang, P. Moulin, and J. Zhou, Deep hashing for compact binary codes learning, CVPR, pp.2475-2483, 1926.

W. Liu, J. Wang, and S. Fu-chang, Hashing with graphs, ICML, p.26, 2011.

W. Liu, J. Wang, R. Ji, Y. Jiang, and S. Chang, Supervised hashing with kernels, CVPR, p.26, 2012.

W. Liu, C. Mu, S. Kumar, and S. Chang, Discrete Graph Hashing, NIPS, p.26, 2014.

S. Lloyd, Least Squares Quantization in PCM, IEEE Trans. Inf. Theor, vol.28, issue.2, p.96, 1982.

U. Luxburg, A Tutorial on Spectral Clustering. Statistics and Computing, vol.17, issue.4, p.41, 2007.

U. V. Luxburg and O. Bousquet, Distance-Based Classification with Lipschitz Functions, J. Mach. Learn. Res, vol.5, pp.669-695, 2004.

Q. Lv, W. Josephson, Z. Wang, M. Charikar, and K. Li, Multi-probe LSH: Efficient Indexing for High-dimensional Similarity Search, Proceedings of the 33rd International Conference on Very Large Data Bases, VLDB '07, p.23, 2007.

, BIBLIOGRAPHY, vol.201

Q. Ma, S. Muthukrishnan, and M. Sandler, Frugal Streaming for Estimating Quantiles, pp.77-96, 2013.

J. Mairal, P. Koniusz, Z. Harchaoui, and C. Schmid, Convolutional Kernel Networks, p.137, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01005489

J. Matou?ek, On Variants of the Johnson-Lindenstrauss Lemma. Random Struct, Algorithms, vol.33, issue.2, pp.142-156, 1925.

A. Mcgregor, Graph Stream Algorithms: A Survey, SIGMOD Rec, vol.43, issue.1, p.45, 2014.

F. Mcsherry, Privacy Integrated Queries, booktitle = Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD). Association for Computing Machinery, vol.154, p.157, 2009.

F. Mcsherry and K. Talwar, Mechanism Design via Differential Privacy, Annual IEEE Symposium on Foundations of Computer Science (FOCS), p.156, 2007.

M. Moczulski, M. Denil, J. Appleyard, and N. De-freitas, ACDC: A Structured Efficient Linear Layer, ICLR, p.137, 2016.

D. Moulavi, P. A. Jaskowiak, R. J. Campello, A. Zimek, and J. Sander, Density-Based Clustering Validation, Proceedings of the 2014 SIAM International Conference on Data Mining, p.101, 2014.

Y. Mülle, C. Clifton, and K. Böhm, Privacy-Integrated Graph Clustering Through Differential Privacy, EDBT/ICDT Workshops, p.157, 2015.

J. Munro and M. Paterson, Selection and sorting with limited storage, Theoretical Computer Science, vol.12, issue.3, pp.315-323, 1980.

S. Muthukrishnan, Data Streams: Algorithms and Applications. Found, Trends Theor. Comput. Sci, vol.1, issue.2, p.115, 2005.

M. C. Nascimento and A. C. De-carvalho, Spectral methods for graph clustering-A survey, European Journal of Operational Research, vol.211, issue.2, p.96, 2011.

, FIPS 180-1. Secure Hash Standard, 1995.

A. Y. Ng, M. I. Jordan, and Y. Weiss, On Spectral Clustering: Analysis and an algorithm, Advances in Neural Information Processing Systems, vol.14, p.43, 2002.

H. H. Nguyen, A. Imine, and M. Rusinowitch, Detecting Communities Under Differential Privacy, Proceedings of the 2016 ACM on Workshop on Privacy in the Electronic Society, WPES '16, p.157, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01393266

K. Nissim, S. Raskhodnikova, and A. Smith, Smooth sensitivity and sampling in private data analysis, Proceedings of the thirty-ninth annual ACM symposium on Theory of computing-STOC, vol.154, p.157, 2007.

M. Norouzi, D. J. Fleet, and . Cartesian-k-means, CVPR, pp.3017-3024, 2013.

S. Oymak and B. Hassibi, Finding Dense Clusters via "Low Rank + Sparse" Decomposition. CoRR, abs/1104, vol.5186, p.41, 2011.

V. Y. Pan, Structured Matrices and Polynomials: Unified Superfast Algorithms, p.131, 2001.

K. Pearson, On Lines and Planes of Closest Fit to Systems of Points in Space, Philosophical Magazine, vol.2, p.27, 1901.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research, vol.12, p.117, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

M. Pilanci and M. J. Wainwright, Randomized sketches of convex programs with sharp guarantees, ISIT, vol.135, p.136, 2014.

M. Pilanci and M. J. Wainwright, Newton Sketch: A Linear-time Optimization Algorithm with Linear-Quadratic Convergence, vol.135, p.143, 2015.

R. Pinot, Minimum spanning tree release under differential privacy constraints, vol.155, p.158, 2018.

M. Raginsky and S. Lazebnik, Locality-sensitive binary codes from shift-invariant kernels, NIPS, p.22, 2009.

A. Rahimi and B. Recht, Random Features for Large-Scale Kernel Machines, NIPS, p.135, 2007.

W. Rand, Objective criteria for the evaluation of clustering methods, Journal of the American Statistical Association, vol.66, issue.336, p.120, 1971.

R. Raziperchikolaei and M. Á. Carreira-perpiñán, Optimizing affinity-based binary hashing using auxiliary coordinates, NIPS, p.26, 2016.

K. Rohe, S. Chatterjee, and B. Yu, Spectral clustering and the high-dimensional stochastic blockmodel, Ann. Statist, vol.39, issue.4, p.41, 2011.

P. J. Rousseeuw, Silhouettes: A graphical aid to the interpretation and validation of cluster analysis, Journal of Computational and Applied Mathematics, vol.20, p.120, 1987.

S. T. Roweis and L. K. Saul, Nonlinear dimensionality reduction by locally linear embedding, SCIENCE, vol.290, issue.3, pp.2323-2326, 2000.

T. N. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, Lowrank matrix factorization for Deep Neural Network training with high-dimensional output targets, ICASSP, p.137, 2013.

S. E. Schaeffer, Survey: Graph Clustering, Comput. Sci. Rev, vol.1, issue.1, p.154, 2007.

B. Schölkopf, A. Smola, and K. Müller, Nonlinear Component Analysis As a Kernel Eigenvalue Problem, Neural Comput, vol.10, issue.5, pp.1299-1319, 1998.

A. Sealfon, Shortest Paths and Distances with Differential Privacy, Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems-PODS, p.155, 2016.

J. Shi and J. Malik, Normalized cuts and image segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.8, p.43, 2000.

R. Sibson, SLINK: An optimally efficient algorithm for the single-link cluster method, The Computer Journal, vol.16, issue.1, pp.30-34, 1973.
DOI : 10.1093/comjnl/16.1.30

URL : https://academic.oup.com/comjnl/article-pdf/16/1/30/1196082/160030.pdf

V. Sindhwani, T. N. Sainath, and S. Kumar, Structured Transforms for SmallFootprint Deep Learning, Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems, p.137, 2015.

N. Sundaram, A. Turmukhametova, N. Satish, T. Mostak, P. Indyk et al., Streaming Similarity Search over One Billion Tweets Using Parallel Locality-sensitive Hashing, Proc. VLDB Endow, vol.6, p.179, 2013.
DOI : 10.14778/2556549.2556574

URL : http://www.vldb.org/pvldb/vol6/p1930-sundaram.pdf

J. B. Tenenbaum, V. Silva, and J. C. Langford, A Global Geometric Framework for Nonlinear Dimensionality Reduction, Science, vol.290, issue.5500, p.2319, 2000.
DOI : 10.1126/science.290.5500.2319

K. Terasawa and Y. Tanaka, Spherical LSH for Approximate Nearest Neighbor Search on Unit Hypersphere, WADS, vol.180, p.190, 2007.
DOI : 10.1007/978-3-540-73951-7_4

URL : https://lib-repos.fun.ac.jp/dspace/bitstream/10445/3004/4/kterasaw_2007_01_wads.pdf

C. F. Van-loan, The Ubiquitous Kronecker Product, J. Comput. Appl. Math, vol.123, issue.1-2, pp.85-100, 2000.

J. S. Vitter, Random Sampling with a Reservoir, ACM Trans. Math. Softw, vol.11, issue.1, p.37, 1985.
DOI : 10.1145/3147.3165

URL : http://www.cs.umd.edu/~samir/498/vitter.pdf

J. Vybíral, A variant of the Johnson-Lindenstrauss lemma for circulant matrices, Journal of Functional Analysis, vol.260, issue.4, p.129, 2011.

J. Wang, S. Kumar, and S. Chang, Semi-Supervised Hashing for Large-Scale Search, IEEE Trans. Pattern Anal. Mach. Intell, issue.12, p.26, 2012.

J. Wang, W. Liu, S. Kumar, and S. Chang, Learning to Hash for Indexing Big Data-A Survey, Proceedings of the IEEE, issue.1, p.52, 2016.
DOI : 10.1109/jproc.2015.2487976

J. Wang, T. Zhang, N. Song, H. T. Sebe, and . Shen, A Survey on Learning to Hash, IEEE Trans. on Pattern Anal. and Mach. Intell, vol.26, issue.99, p.74, 2018.
DOI : 10.1109/tpami.2017.2699960

Y. Weiss, A. Torralba, and R. Fergus, Spectral Hashing. In NIPS, p.26, 2008.

E. P. Xing, A. Y. Ng, M. I. Jordan, and S. Russell, Distance Metric Learning, with Application to Clustering with Side-information, Proceedings of the 15th International Conference on Neural Information Processing Systems, NIPS'02, p.34, 2002.

Y. Xu, V. Olman, and D. Xu, Clustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees, vol.18, p.204, 2002.

Z. Yang, M. Moczulski, M. Denil, N. De-freitas, A. Smola et al., Deep Fried Convnets. In ICCV, p.137, 2015.
DOI : 10.1109/iccv.2015.173

F. Yu, S. Kumar, Y. Gong, and S. Chang, Circulant Binary Embedding, ICML, vol.26, p.74, 2014.

C. T. Zahn, Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters, IEEE Trans. Comput, vol.20, issue.1, p.154, 1943.
DOI : 10.1109/t-c.1971.223083

URL : http://www.slac.stanford.edu/pubs/slacpubs/0500/slac-pub-0672.pdf

T. Zhang, R. Ramakrishnan, and M. Livny, BIRCH: An Efficient Data Clustering Method for Very Large Databases, Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, SIGMOD '96, p.38, 1996.

X. Zhang, F. X. Yu, R. Guo, S. Kumar, S. Wang et al., Fast Orthogonal Projection Based on Kronecker Product, IEEE International Conference on Computer Vision (ICCV), vol.129, p.133, 2015.
DOI : 10.1109/iccv.2015.335