P. Absil, Optimization algorithms on matrix manifolds, 2008.

J. Alexander and A. Hirschowitz, Polynomial interpolation in several variables, J. Algebraic Geom, vol.4, pp.201-222, 1995.

A. Almeida, The constrained block-parafac decomposition. presentation at tricap2006, chania, greece, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00417703

S. Gaoyun-an, Q. Liu, and . Ruan, A sparse neighborhood preserving nonnegative tensor factorization algorithm for facial expression recognition, Pattern Anal. Appl, vol.20, issue.2, pp.453-471, 2017.

C. A. Andersson and R. Bro, The n-way toolbox for matlab. Chemometrics and Intelligent Laboratory Systems, vol.52, pp.1-4, 2000.

W. Austin, G. Ballard, and T. G. Kolda, Parallel tensor compression for large-scale scientific data, IEEE International Parallel and Distributed Processing Symposium, 2016.

B. W. Bader and T. G. Kolda, Algorithm 862 : Matlab tensor classes for fast algorithm prototyping, ACM Trans. Math. Software, vol.32, pp.635-653, 2006.

B. W. Bader and T. G. Kolda, Efficient matlab computations with sparse and factored tensors, SIAM J. Sci. Comput, vol.30, pp.205-231, 2007.

B. W. Bader and T. G. Kolda, Matlab tensor toolbox, version 2.2, 2007.

G. Ballard, A. Klin, and T. G. Kolda, Tuckermpi : A parallel c++/mpi software package for large-scale data compression via the tucker tensor decomposition. arxiv, 2019.

B. Muthu-manikandan-baskaran, N. Meister, R. Vasilache, and . Lethin, Efficient and scalable computations with sparse tensors, IEEE Conference on High Performance Extreme Computing, pp.1-6, 2012.

A. Beck and L. Tetruashvili, On the convergence of block coordinate descent type methods, SIAM J. OPTIM, vol.23, issue.4, pp.2037-2060, 2013.

S. Becker and M. Osman-asif, Low-rank tucker decomposition of large tensors using tensorsketch, Advances in Neural Information Processing Systems, pp.10117-10127, 2018.

J. M. Berge and N. D. Sidiriopolous, On uniqueness in candecomp/ parafac, Psychometrika, vol.67, pp.399-409, 2002.

S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, Distributed optimization and statistical learning via the alternating direction method of multipliers, Found. Trends Mach. Learn, vol.3, issue.1, pp.1-122, 2011.

C. F. Caiafa and A. Cichocki, Generalizing the column-row matrix decomposition to multiway arrays, Linear Algebra and its Applications, vol.433, pp.557-573, 2010.

J. D. Carroll and J. J. Chang, Analysis of individual differences in multidimensional scaling via an n-way generalization of "eckart-young" decomposition. Psychometrika, vol.35, pp.283-319, 1970.

J. D. Carroll and J. J. Chang, Analysis of individual differences in multidimensional scaling via an n-way generalization of "eckart-young" decomposition, vol.35, pp.283-319, 1970.

J. Casebeer, M. Colomb, and P. Smaragdis, Deep tensor factorization for spatially-aware scene decomposition. Arxiv, abs, 1391.

B. , Parallel proportional profiles and other principles for determining the choice of factors by rotation, Psychometrika, vol.9, pp.267-283, 1944.

R. B. Cattell, The three basic factor-analytic research designs-their interrelations and derivatives, Psych. Bull, vol.49, pp.499-452, 1952.

T. Venkatesan, . Chakaravarthy, W. Jee, D. J. Choi, P. Joseph et al., On optimizing distributed tucker decomposition for sparse tensors, Proceedings of the 2018 International Conference on Supercomputing, pp.374-384, 2018.

M. Che and Y. Wei, Randomized algorithms for the approximations of tucker and the tensor train decompositions, Advances in Computational Mathematics, pp.1-34, 2018.

D. Choi, J. Jang, and U. Kang, Fast, accurate, and scalable method for sparse coupled matrix-tensor factorization, 2017.

D. Choi and L. Sael, Snect : Scalable network constrained tucker decomposition for integrative multi-platform data analysis, 2017.

A. Cichocki and P. Anh-huy, Fast local algorithms for large scale nonnegative matrix and tensor factorizations, IEICE transactions on fundamentals of electronics, communications and computer sciences, vol.92, pp.708-721, 2009.

A. Cichocki, N. Lee, I. Oseledets, A. Phan, Q. Zhao et al., Tensor networks for dimensionality reduction and large-scale optimization : Part 1 low-rank tensor decompositions. Foundations and Trends R in Machine Learning, vol.9, pp.249-429, 2016.

A. Cichocki, N. Lee, I. Oseledets, A. Phan, Q. Zhao et al., Tensor networks for dimensionality reduction and large-scale optimizations. part 2 applications and future perspectives, Foundations and Trends R in Machine Learning, vol.9, pp.249-429, 2017.

A. Cichocki, D. Mandic, L. De-lathauwer, G. Zhou, Q. Zhao et al., Tensor Decompositions for Signal Processing Applications : From two-way to multiway component analysis, IEEE Signal Processing Magazine, vol.32, issue.2, pp.145-163, 2015.

A. Cichocki, Q. Zhao, and S. Xie, Efficient nonnegative tucker decompositions : Algorithms and uniqueness, IEEE Transactions on Image Processing, vol.24, issue.12, pp.4990-5003, 2015.

P. Comon, Tensors : A brief introduction, IEEE Signal Processing Magazine, vol.31, pp.44-53, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00923279

P. Comon, G. Golub, L. Lim, and B. Mourrain, Symmetric tensors and symmetric tensor rank, SIAM J. Matrix Anal. Appl, vol.30, issue.3, pp.1254-1279, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00327599

P. Comon and M. Sorensen, Tensor diagonalization by orthogonal transforms, 2007.

,. S. Graham-cormode, K. Muthukrishnan, Q. Yi, and . Zhang, Continuous sampling from distributed streams, J. ACM, vol.59, issue.2, 2012.

A. Dan, The convergence of sparsified gradient methods, Advances in Neural Information Processing Systems, pp.5977-5987, 2018.

. Lieven-de-lathauwer, A link between the canonical decomposition in multilinear algebra and simultaneous matrix diagonalization, SIAM J. Matrix Anal. Appl, vol.28, issue.3, pp.642-666, 2006.

J. F. Delmas, B. Jourdain, and B. Lapeyre, Processus Aléatoires

P. Drineas, R. Kannan, and M. W. Mahoney, Fast monte carlo algorithms for matrices ii : Computing a low-rank approximation to a matrix, SIAM J. Comput, vol.36, issue.1, pp.158-183, 2006.

P. Drineas and M. W. Mahoney, A randomized algorithm for a tensorbased generalization of the singular value decomposition, Linear Algebra and its Applications, vol.420, pp.553-571, 2007.

D. Erdös and P. Miettinen, Walk 'n' merge : A scalable algorithm for boolean tensor factorization, IEEE 13th International Conference on Data Mining, pp.1037-1042, 2013.

C. Févotte and J. Idier, Algorithms for nonnegative matrix factorization with the ?-divergence, Neural computation, vol.23, issue.9, pp.2421-2456, 2011.

. Walter-gander, Algorithms for the qr-decomposition, 1980.

J. F. Gemmeke, T. Virtanen, and A. Hurmalainen, Exemplar-based sparse representations for noise robust automatic speech recognition, Trans. Audio, Speech and Lang. Proc, vol.19, issue.7, pp.2067-2080, 2011.

G. H. Golub and C. F. Van-loan, Matrix Computations, 1996.

X. Guo, S. Miron, D. Brie, S. Zhu, and X. Liao, A candecomp/parafac perspective on uniqueness of doa estimation using a vector sensor array, IEEE Transactions on Signal Processing, vol.59, pp.3475-3481, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00574566

H. Van-hamme, An on-line nmf model for temporal pattern learning : Theory with application to automatic speech recognition. LVA/ICA, LNCS, vol.7191, pp.306-313, 2012.

F. , M. Harper, and J. A. Konstan, The movielens datasets : History and context, TiiS, vol.5, 2015.

A. Harshman, Foundations of the parafac procedure : Models and conditions for an "explanatory" multi-modal factor analysis. UCLA Working Papers in Phonetics, vol.16, pp.1-84, 1970.

R. A. Harshman, Foundations of the parafac procedure : Models and conditions for an "explanatory" multi-modal factor analysis. UCLA Working Papers in Phonetics, vol.16, pp.1-84, 1970.

T. Hazan, S. Polak, and A. Shashua, Sparse image coding using a 3d non-negative tensor factorization. ICCV, vol.1, pp.50-57, 2005.

F. L. Hitchcock, The expression of a tensor or a polyadic as a sum of products, J. Math. Phys, vol.6, pp.164-189, 1927.

F. L. Hitchcock, Multilple invariants and generalized rank of a p-way matrix or tensor, J. Math. Phys, vol.7, pp.39-79, 1927.

P. Honeine, Analyzing sparse dictionaries for online learning with kernels, IEEE Transactions on Signal Processing, vol.63, pp.6343-6353, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01965568

P. Honeine, Entropies of overcomplete kernel dictionaries, Bulletin of Mathematical Sciences and Applications, vol.16, pp.1-19, 2016.

I. Jeon, E. E. Papalexakis, U. Kang, and C. Faloutsos, Ha-ten2 : Billion-scale tensor decompositions, IEEE 31st International Conference on Data Engineering, pp.1047-1058, 2015.

A. Kapteyn, H. Neudecker, and T. Wansbeek, An approach to n-mode components analysis, Psychometrika, vol.51, issue.2, pp.269-275, 1986.

A. Karatzoglou, X. Amatriain, L. Baltrunas, and N. Oliver, Multiverse recommendation : n-dimensional tensor factorization for context-aware collaborative filtering, 2010.

H. Kasai and B. Mishra, Low-rank tensor completion :a riemannian manifold preconditioning approach, ICML, vol.48, pp.1012-1021, 2016.

O. Kaya, High performance parallel algorithms for tensor decompositions, 2017.
URL : https://hal.archives-ouvertes.fr/tel-01623523

O. Kaya and B. Uccar, High performance parallel algorithms for the tucker decomposition of sparse tensors, 45th International Conference on Parallel Processing (ICPP), pp.103-112, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01354894

S. Kim, Online kernel dictionary learning, IEEE Global Conference on Signal and Information Processing (GlobalSIP), pp.103-107, 2015.

Y. Kim and S. Choi, Nonnegative tucker decomposition, IEEE Conference on Computer Vision and Pattern Recognition, pp.1-8, 2007.

A. Kodewitz, Methods for large volume image analysis applied to early detection of alzheimer's disease by analysis of fdg-pet scans, 2013.

T. G. Kolda, Multilinear operators for higher-order decompositions, 2006.

T. G. Kolda and B. W. Bader, Tensor decompositions and applications, SIAM Rev, vol.51, issue.3, pp.455-500, 2009.

T. G. Kolda and J. Sun, Scalable tensor decompositions for multi-aspect data mining, Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pp.363-372, 2008.

J. Kossaifi, Y. Panagakis, A. Anandkumar, and M. Pantic, Tensorly : Tensor learning in python, Journal of Machine Learning Research, vol.20, issue.26, pp.1-6, 2019.

M. Pieter, J. Kroonenberg, and . De-leeuw, Principal component analysis of threemode data by means of alternating least squares algorithms, Psychometrika, vol.45, issue.1, pp.69-97, 1980.

J. B. , Three-way arrays : Rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics, Linear Algebra Appl, vol.18, pp.95-138, 1977.

J. B. , Statement of some current results about three-way arrays. manuscript, 1983.

J. B. , Multiway data analysis. chapter Rank, Decomposition, and Uniqueness for 3-way and N-way Arrays, pp.7-18, 1989.

A. N. Langville and W. J. Stewart, The kronecker product and stochastic automata networks, J. Comput. Appl. Math, vol.167, issue.2, pp.429-447, 2004.

B. D. Lieven-de-lathauwer, J. Moor, and . Vandewalle, A multilinear singular value decomposition, SIAM J. Matrix Anal. Appl, vol.21, issue.4, pp.1253-1278, 2000.

B. D. Lieven-de-lathauwer, J. Moor, and . Vandewalle, On the best rank-1 and rank-(r1,r2,. . .,rn) approximation of higher-order tensors, SIAM J. Matrix Anal. Appl, vol.21, issue.4, pp.1324-1342, 2000.

D. Lee, J. Lee, and H. Yu, Fast tucker factorization for large-scale tensor completion, IEEE International Conference on Data Mining (ICDM), pp.1098-1103, 2018.

J. Lee, D. Choi, and L. Sael, Ctd : Fast, accurate, and interpretable method for static and dynamic tensor decompositions, PloS one, vol.13, 2018.

N. Li, Variants of als on tensor decompositions and applications. Partial fulfillment of the requirements for the degree of Doctor of Philosophy, 2013.

X. Li, H. Zhou, and L. Li, Tucker tensor regression and neuroimaging analysis, Statistics in Biosciences, 2013.

X. Li, T. Zhao, R. Arora, H. Liu, and M. Hong, On faster convergence of cyclic block coordinate descent-type methods for strongly convex minimization, J. Mach. Learn. Res, vol.18, issue.1, pp.6741-6764, 2017.

X. Li, K. Selccuk-candan, and M. L. Sapino, M2td : Multi-task tensor decomposition for sparse ensemble simulations, IEEE 34th International Conference on Data Engineering (ICDE), pp.1144-1155, 2018.

X. Lian, Y. Huang, Y. Li, and J. Liu, Asynchronous parallel stochastic gradient for nonconvex optimization, Proceedings of the 28th International Conference on Neural Information Processing Systems, vol.2, pp.2737-2745, 2015.

E. Liberty, Simple and deterministic matrix sketching, Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '13, pp.581-588, 2013.

M. W. Mahoney, M. Maggioni, and P. Drineas, Tensor-cur decompositions for tensor-based data, SIGKDD, pp.327-336, 2006.

J. Mairal, F. Bach, J. Ponce, and G. Sapiro, Online dictionary learning for sparse coding. ICML '09, pp.689-696, 2009.

C. Navasca, L. D. Lathauwer, and S. Kindermann, Swamp reducing technique for tensor decomposition, Proc. 16th European Signal Processing Conference, 2008.

C. Navasca and D. N. Pompey, Random projections for low multilinear rank tensors, Visualization and Processing of Higher Order Descriptors for Multi-Valued Data, pp.93-106, 2015.

K. Duy, T. B. Nguyen, and . Ho, Fast parallel randomized algorithm for nonnegative matrix factorization with kl divergence for large sparse datasets, 2016.

J. Oh, K. Shin, E. E. Papalexakis, C. Faloutsos, and H. Yu, S-hot : Scalable high-order tucker decomposition, Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp.761-770, 2017.

S. Oh, N. Park, L. Sael, and U. Kang, Scalable tucker factorization for sparse tensors -algorithms and discoveries, IEEE 34th International Conference on Data Engineering (ICDE), pp.1120-1131, 2018.

G. Olikier, Tensor approximation by block term decomposition. Dissertation, Supervisors : Pierre-Antoine Absil, Reader : Yurii Nesterov, 2017.

G. Olikier, ,. Absil, and L. Lathauwer, Variable projection applied to block term decomposition of higher-order tensors, LVA/ICA, pp.139-148, 2018.

M. Park, J. Jang, and L. Sael, Vest : Very sparse tucker factorization of large-scale tensors, ArXiv, 2019.

N. Park, S. Oh, and U. Kang, Fast and scalable method for distributed boolean tensor factorization, The VLDB Journal, pp.1-26, 2019.

P. Comon, J. M. Berge, L. D. Lathauwer, and J. Castaing, Generic and typical ranks of multi-way arrays, Linear Algebra and its Applications, vol.430, pp.2997-3007, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00410058

J. Penot, Elements of differential calculus (chapter 2 of the book : Calculus without derivatives), 2013.

I. Perros, R. Chen, R. Vuduc, and J. Sun, Sparse hierarchical tucker factorization and its application to healthcare, pp.943-948, 2015.

A. Huy-phan and A. Cichocki, Extended hals algorithm for nonnegative tucker decomposition and its applications for multiway analysis and classification, Neurocomput, vol.74, issue.11, pp.1956-1969, 2011.

N. G. Polson, J. G. Scott, and B. T. Willard, Proximal algorithms in statistics and machine learning, Statistical Science, vol.30, issue.4, pp.559-581, 2015.

A. Rakotomamonjy, Supervised Representation Learning for Audio Scene Classification, IEEE/ACM Transactions on ASLP, vol.25, pp.1253-1265, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01354115

F. Roemer, G. D. Galdo, and M. Haardt, Tensor-based algorithms for learning multidimensional separable dictionaries, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3963-3967, 2014.

Q. Shi, Q. Yiu-ming-cheung, and . Zhao, Feature extraction for incomplete data via low-rank tucker decomposition, ECML PKDD, pp.564-581, 2017.

K. Shin, L. Sael, and U. Kang, Fully scalable methods for distributed tensor factorization, IEEE Trans. on Knowl. and Data Eng, vol.29, issue.1, pp.100-113, 2017.

N. D. Sidiropoulos and R. Bro, On the uniqueness of multilinear decomposition of n-way arrays, J. Chemometrics, vol.14, pp.229-239, 2000.

N. D. Sidiropoulos, E. E. Papalexakis, and C. Faloutsos, Parallel randomly compressed cubes ( paracomp ) : A scalable distributed architecture for big tensor decomposition, 2014.

N. D. Sidiropoulos, L. De-lathauwer, X. Fu, K. Huang, and E. Evangelos,

C. Papalexakis and . Faloutsos, Tensor decomposition for signal processing and machine learning, IEEE Transactions on Signal Processing, vol.65, pp.3551-3582, 2017.

S. Smith and G. Karypis, Accelerating the tucker decomposition with compressed sparse tensors, Euro-Par, 2017.

S. Soltani, M. E. Kilmer, and P. Hansen, A tensor-based dictionary learning approach to tomographic image reconstruction, BIT Numerical Mathematics, vol.56, issue.4, pp.1425-1454, 2016.

A. Stevens, Y. Pu, Y. Sun, G. Spell, and L. Carin,

, Tensor-dictionary learning with deep kruskal-factor analysis, AISTATS, 2017.

D. Sun and C. Fevotte, Alternating direction method of multipliers for non-negative matrix factorization with the beta-divergence. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing -Proceedings, pp.6201-6205, 2014.

J. Sun, S. Papadimitriou, C. Lin, N. Cao, M. Liu et al., Multivis : Content-based social network exploration through multiway visual analysis, SDM, 2009.

J. Sun, D. Tao, S. Papadimitriou, P. S. Yu, and C. Faloutsos, Incremental tensor analysis : Theory and applications. TKDD, vol.2, 2008.

S. Tan, Y. Zhang, G. Wang, X. Mou, G. Cao et al., Tensor-based dictionary learning for dynamic tomographic reconstruction, Physics in Medicine and Biology, vol.60, pp.2803-2818, 2015.

P. Tseng and S. Yun, A coordinate gradient descent method for nonsmooth separable minimization, Mathematical Programming, vol.117, pp.387-423, 2007.

C. E. Tsourakakis, Mach : Fast randomized tensor decompositions, SDM, 2009.

L. R. Tucker, Implications of factor analysis of three-way matrices for measurement of change, Problems in Measuring Change, pp.122-137, 1963.

T. Variddhisa and D. P. Mandic, Online multilinear dictionary learning for sequential compressive sensing, 2017.

N. Vervliet, O. Debals, and L. Lathauwer, Tensorlab 3.0 -numerical optimization strategies for large-scale constrained and coupled matrix/tensor factorization, 50th Asilomar Conference on Signals, Systems and Computers, pp.1733-1738, 2016.

X. Vu, C. Chaux, N. Thirion-moreau, S. Maire, and E. M. Carstea, A new penalized nonnegative third-order tensor decomposition using a block coordinate proximal gradient approach : Application to 3d fluorescence spectroscopy, Journal of Chemometrics, vol.31, issue.4, p.2859, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01387439

X. T. Vu, S. Maire, C. Chaux, and N. Thirion-moreau, A new stochastic optimization algorithm to decompose large nonnegative tensors, IEEE Signal Processing Letters, vol.22, issue.10, pp.1713-1717, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01146443

F. Wu, X. Jing, W. Zuo, R. Wang, and X. Zhu, Discriminant tensor dictionary learning with neighbor uncorrelation for image set based classification, Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp.3069-3075, 2017.

J. Xu, J. Zhou, P. Tan, X. Liu, and L. Luo, Wisdom : Weighted incremental spatio-temporal multi-task learning via tensor decomposition, International Conference on Big Data, pp.522-531, 2016.

Y. Xu, On the convergence of higher-order orthogonal iteration. Linear and Multilinear Algebra, pp.2247-2265, 2017.

K. Yoshii, R. Tomioka, D. Mochihashi, and M. Goto, Infinite positive semidefinite tensor factorization for source separation of mixture signals

, Proceedings of the 30th International Conference on International Conference on Machine Learning, vol.28, pp.576-584, 2013.

R. Yu, D. Cheng, and Y. Liu, Accelerated online low-rank tensor learning for multivariate spatio-temporal streams, Proceedings of the 32Nd International Conference on International Conference on Machine Learning, pp.238-247, 2015.

R. Zdunek and A. Cichocki, Fast nonnegative matrix factorization algorithms using projected gradient approaches for large-scale problems, Intell. Neuroscience, vol.3, issue.13, pp.1-3, 2008.

Q. Zhao, L. Zhang, and A. Cichocki, Bayesian sparse tucker models for dimension reduction and tensor completion, 2015.

S. Zhe, Y. Qi, Y. Park, Z. Xu, I. Molloy et al., Dintucker : Scaling up gaussian process models on large multidimensional arrays, AAAI, 2016.

S. Zhe, Z. Xu, X. Chu, Y. Qi, and Y. Park, Scalable nonparametric multiway data analysis, AISTATS, 2015.

S. Zhe, K. Zhang, P. Wang, K. Lee, Z. Xu et al., Distributed flexible nonlinear tensor factorization, Proceedings of the 30th International Conference on Neural Information Processing Systems, NIPS'16, pp.928-936, 2016.

N. Zheng, Q. Li, S. Liao, and L. Zhang, Flickr group recommendation based on tensor decomposition, Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.737-738, 2010.

G. Zhou, Efficient nonnegative tucker decompositions : Algorithms and uniqueness, IEEE Transactions on Image Processing, vol.24, issue.12, pp.4990-5003, 2015.

G. Zhou, A. Cichocki, and S. Xie, Decomposition of big tensors with low multilinear rank, 2014.

S. Zubair and W. Wang, Tensor dictionary learning with sparse tucker decomposition, International conference on digital signal processing(DSP), pp.1-6, 2013.