, Computer aided fault tree analysis system

, Graphiques interactifs pour la fiabilité

, Jedec global standards for the microelectronics industry. arrhenius equation for reliability

. Le,

, Markov analysis software

, On semiconductor quality and reliability handbook

, Safety tools development

, Vishay semiconductor reliability

, User's guide for the UNIRAM version 4.1 for Windowsavailability assessment methodology, Availability Systems, 1996.

, Ashrae, thermal guidleines for data processing environment, 2000.

, Petri Net Analysis (PNA), chapter 17, pp.307-316, 2005.

, Preliminary Hazard Analysis, pp.73-93, 2005.

J. Wiley and L. Sons, , 2005.

D. Behavior, , pp.73-74, 2008.

, Open Queueing Networks, issue.6, pp.169-195, 2008.

, RISK AND SAFETY OF ENGINEERED SYSTEMS, issue.1, pp.1-14

J. Wiley and L. Sons, , 2011.

, Introduction to Risk Assessment, pp.1-12, 2012.

, INCOSE Systems Engineering Handbook: A Guide for System Life Cycle Processes and Activities, 2015.

, Principles and Concepts of Cloud Computing, pp.1-32, 2016.

M. Al-fares, A. Loukissas, and A. Vahdat, A scalable, commodity data center network architecture, SIGCOMM Comput. Commun. Rev, vol.38, issue.4, pp.63-74, 2008.

R. Alshahrani and H. Peyravi, Modeling and simulation of data center networks, SIGSIM-PADS 2014 -Proceedings of the 2014 ACM Conference on SIGSIM Principles of Advanced Discrete Simulation, 2014.

J. D. Andrews and . Moss, Reliability and risk assessment, 1993.

W. Chee, C. Ang, and . Tham, Analysis and optimization of service availability in a high availability cluster with load-dependent machine availability, IEEE Transactions on Parallel and Distributed Systems, vol.18, pp.1307-1319, 2007.

M. Antal, T. Cioara, I. Anghel, R. Gorzenski, R. Januszewski et al., Reuse of Data Center Waste Heat in Nearby Neighborhoods: A Neural Networks-Based Prediction Model, Energies, vol.12, issue.5, p.814, 2019.

, ASHRAE. Ashrae handbook, 2018.

J. Athavale, M. Yoda, and Y. Joshi, Comparison of data driven modeling approaches for temperature prediction in data centers, International Journal of Heat and Mass Transfer, vol.135, pp.1039-1052, 2019.

B. Wei, C. Lin, and X. Kong, Dependability modeling and analysis for the virtual data center of cloud computing, the IEEE 13th International Conference on High Performance Computing and Communications (HPCC), 2011.

B. Beihoff, C. Oster, S. Friedenthal, C. Paredis, D. Kemp et al., A World in Motion Systems Engineering Vision 2025, vol.01, 2014.

A. H. Beitelmal and . Patel, Thermo-fluids provisioning of a high performance high density, 2006.

W. M. Bennaceur and L. Kloul, Electrical and thermal system impact on the availability of a data center's system, 3rd International Conference on System Reliability and Safety (ICSRS), pp.142-148, 2018.

W. M. Bennaceur and L. Kloul, Reliability and performance analysis of a data center's network architecture, 2018 IEEE 37th International Performance Computing and Communications Conference (IPCCC), pp.1-8, 2018.

M. Walid, L. Bennaceur, and . Kloul, Safety analysis of a Data Center system, Maîtrise des risques et transformation numérique : opportunités et menaces, 2018.

L. Walid-mokhtar-bennaceur, A. Kloul, and . Rauzy, Safety analysis of a data center's electrical system using production trees, Marco Bozzano and Yiannis Papadopoulos, pp.82-96, 2017.

M. Walid, L. Bennaceur, and . Kloul, Formal models for safety and performance analysis of a data center system, Reliability Engineering System Safety, vol.193, p.106643, 2020.

M. Steven, A. P. Biemer, and . Sage, Systems Engineering: Basic Concepts and Life Cycle, pp.145-171, 2010.

M. Bouissou, H. Bouhadana, M. Bannelier, and N. Villatte, Knowledge modelling and reliability processing: Presentation of the figaro language and associated tools, IFAC Proceedings Volumes, vol.24, pp.69-75, 1991.

C. Roger and . Burk, Systems Engineering in Professional Practice, pp.197-226, 2010.

C. Sauer and H. Daduna, Availability formulas and performance measures for separable degradable networks, Stochastics and Quality Control, vol.18, pp.165-194, 2003.

G. Callou, P. Ferreira, D. Maciel, R. Tutsch, and . Souza, An integrated modeling approach to evaluate and optimize data center sustainability, dependability and cost. Energies, vol.7, pp.238-277, 2014.

P. Carer, M. Bellvis, J. Bouissou, J. Domergue, and . Pestourie, A new method for reliability assessment of electrical power supplies with standby redundancies, Proceedings of the 7th International Conference on Probabilistic Methods Applied to Power Systems (PMAPS), 2002.

W. Cazzola, Domain-specific languages in few steps, Software Composition, pp.162-177, 2012.

C. Chen, G. Wang, J. Sun, and W. Xu, Detecting data center cooling problems using a data-driven approach, pp.1-8, 2018.

C. Chen, G. Wang, J. Sun, and W. Xu, Detecting Data Center Cooling Problems Using a Data-driven Approach, pp.1-8, 2018.

J. Cho, J. Yang, and W. Park, Evaluation of air distribution system's airflow performance for cooling energy savings in high-density data centers, Energy and Buildings, vol.68, pp.270-279, 2014.

R. Couto and S. Secci, Miguel Elias Mitre Campista, and Luís Henrique Maciel Kosmalski Costa. Reliability and survivability analysis of data center network topologies, 2015.

C. Dabrowski and F. Hunt, Using markov chain and graph theory concepts to analyze behavior in complex distributed systems, The 23rd European Modeling and Simulation Symposium, 2011.

J. B. Dugan, Automated analysis of phased-mission reliability, IEEE Transactions on Reliability, vol.40, issue.1, pp.45-52, 1991.

J. B. Dugan, S. J. Bavuso, and M. A. Boyd, Dynamic fault-tree models for fault-tolerant computer systems, IEEE Transactions on Reliability, vol.41, issue.3, pp.363-377, 1992.

J. B. Dugan, S. J. Bavuso, and M. A. Boyd, Dynamic fault-tree models for fault-tolerant computer systems, IEEE Transactions on Reliability, vol.41, issue.3, pp.363-377, 1992.

. Facebook, Engineering facebook's notes, 2017.

N. Farrington, G. Porter, S. Radhakrishnan, H. Bazzaz, V. Subramanya et al.,

, Helios: A hybrid electrical/optical switch architecture for modular data centers, vol.41, pp.339-350, 2010.

K. Caldwel, G. Arrheni, and S. Wood, A tribute to the memory of svante arrhenius, 2008.

P. Maciel, D. Tutsch, G. Callou, and J. Araujo, Models for dependability and sustainability analysis of a data center cooling architectures. Dependable systems and networks workshops, pp.1-6, 2012.

P. Gill, N. Jain, and N. Nagappan, Understanding network failures in data centers: Measurement, analysis, and implications. SIG-COMM Comput, Commun. Rev, vol.41, issue.4, pp.350-361, 2011.

A. Greenberg, J. Hamilton, N. Jain, S. Kandula, C. Kim et al., Vl2: A scalable and flexible data center network, Communications of the ACM, vol.54, pp.95-104, 2009.

M. Gudemann and F. Ortmeier, A framework for qualitative and quantitative formal model-based safety analysis, IEEE 12th International Symposium on High Assurance Systems Engineering, pp.132-141, 2010.

C. Guo, G. Lu, D. Li, H. Wu, X. Zhang et al., Bcube: A high performance, server-centric network architecture for modular data centers, ACM SIGCOMM, 2009.

C. Guo, H. Wu, K. Tan, L. Shi, Y. Zhang et al., Dcell: A scalable and fault-tolerant network structure for data centers, vol.38, pp.75-86, 2008.

. Md and . Hill, The Datacenter as a Computer, 2009.

J. Hillston and N. Thomas, Product form solution for a class of pepa models, Proceedings. IEEE International Computer Performance and Dependability Symposium. IPDS'98 (Cat. No.98TB100248), pp.152-161, 1998.

N. Jiang and M. Parashar, Enabling autonomic power-aware management of instrumented data centers, Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing, IPDPS '09, pp.1-8, 2009.

C. Kehren, Motifs formels d'architectures de systemes pour la surete de fonctionnement, 2005.
URL : https://hal.archives-ouvertes.fr/tel-00011496

L. Kloul and . Rauzy, Production trees: a new modeling methodology for production availability analyses, Reliability Engineering and System Safety, vol.167, pp.561-571, 2017.

S. Kounev, K. Sachs, J. Bacon, and A. Buchmann, A methodology for performance modeling of distributed event-based systems, 11th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC), pp.13-22, 2008.

S. Kounev, K. Bender, F. Brosig, N. Huber, and R. Okamoto, Automated simulation-based capacity planning for enterprise data fabrics, Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques, SIMUTools '11, pp.27-36, 2011.

N. Larrieu and A. Varet, Developing Model-Based Design Methods in Software Engineering, pp.1-22, 2014.

. Hu-chen, J. Liu, Z. You, G. Li, and . Tian, Fuzzy petri nets for knowledge representation and reasoning, Eng. Appl. Artif. Intell, vol.60, issue.C, pp.45-56, 2017.

D. Long and Z. Scott, A Primer for Model-Based Systems Engineering, 2011.

M. Al-fares, A. Loukissas, A. Vahdat, and . Scalable, Computer Communication ACMSIGCOM, vol.38, pp.63-74, 2008.

M. Patterson, The effect of data center temperature on energy efficiency, Proceedings of the ITHERM08, 11th Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems, 2008.

D. S. Marcon, R. R. Oliveira, L. P. Gaspary, and P. Marinho,

. Barcellos, Datacenter Networks and Relevant Standards, chapter 4, pp.73-104, 2015.

M. Marwah, . Maciel, R. Shah, T. Sharma, and . Christian, Quantifying the sustainability impact of data center availability, ACM SIGMETRICS Performance Evaluation Review, vol.37, issue.4, pp.64-68, 2010.

G. Merle, J. Roussel, J. Lesage, and A. Bobbio, Probabilistic algebraic analysis of fault trees with priority dynamic gates and repeated events. Reliability, IEEE Transactions on, vol.59, pp.250-261, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00480014

. Microsoft, Creating a greaner data center, 2013.

W. A. Montgomery, 545 Technology Square Permission to copy without fee all or part of this material is granted provided that the copies are 12 not made or distributed for direct commercial advantage , the ACM copyright notice and the title of the publication and its date appe, Technology, pp.143-149, 1979.

J. Noble, . Taivalsaari, and . Moore, Prototype-based programming: Concepts, languages and applications, 1999.

P. D. O'connor, Reliability and risk assessment, j.d. andrews and t.r. moss, longman scientific and technical, vol.11, pp.74-74, 1993.

Y. Papadopoulos, M. Walker, D. Parker, E. Rüde, R. Hamann et al., Engineering failure analysis and design optimisation with hip-hops, The Fourth International Conference on Engineering Failure Analysis, vol.18, pp.590-608, 2011.

G. S. Parnell, Introduction to Systems Engineering, pp.183-195, 2010.

C. Patel, C. Bash, R. Sharma, M. Beitelmal, and R. Friedrich, Smart cooling of data centers, vol.01, 2003.

C. Patel, C. Bash, R. Sharma, M. Beitelmal, and R. Friedrich, Smart cooling of data centers, vol.01, 2003.

M. Pedram and S. Nazarian, Thermal modeling, analysis, and management in vlsi circuits: Principles and methods, Proceedings of the IEEE, vol.94, pp.1487-1501, 2006.

J. K-camboin, P. Ferreira, . Maciel, G. Souza, and . Callou, The effects of temperature variation on data center it systems. Systems, Man, and Cybernetics (SMC), IEEE, 2013.

T. Rak, Performance modeling using queueing petri nets, Computer Networks, pp.321-335, 2017.

K. Rao, V. Gopika, V. V. Sanyasi-rao, H. S. Kushwaha, A. K. Verma et al., Dynamic fault tree analysis using monte carlo simulation in probabilistic safety assessment, Reliability Engineering & System Safety, vol.94, issue.4, pp.872-883, 2009.

N. Rasmussen, Calculating total tooling requirements for data centers, 2015.

A. Rauzy, Guarded transition systems: a new states/events formalism for reliability studies, Journal of Risk and Reliability, vol.222, pp.495-505, 2008.

A. B. Rauzy and C. Haskins, Foundations for model-based systems engineering and model-based safety assessment, Systems Engineering, vol.22, issue.2, pp.146-155, 2019.

R. Robidoux, Automated modeling of dynamic reliability block diagrams using colored petri nets, IEEE Transactions on Systems, Man, and Cybernetics -Part A: Systems and Humans, vol.40, issue.2, pp.337-351, 2010.

R. Robidoux, M. Xu, S. Zhou, . Member, and M. Xing, Automated modeling of dynamic reliability block diagrams using colored petri net, IEEE Trans Syst Man Cybern Part A Syst, vol.40, pp.337-351, 2010.

P. Rygielski and S. Kounev, Data center network throughput analysis using queueing petri nets, IEEE 34th International Conference on Distributed Computing Systems Workshops (ICDCSW), pp.100-105, 2014.

A. Sharma and R. G. Sangeetha, Reliability Analysis of Data Center Network, vol.99, p.99

J. Shin, B. Wong, and E. Sirer, Small-world datacenters, Proceedings of the 2Nd ACM Symposium on Cloud Computing, SOCC '11, vol.2, pp.1-2, 2011.

S. Silvaa, P. Silvaa, M. Romero, A. Maciela, and . Zimmermannb, Dependability evaluation of data center power infrastructures considering substation switching operations. Probabilistic Safety Assessment and Management conference, 2014.

W. J. Stewart, Introduction to the Numerical Solution of Markov Chains, 1994.

T. Wang, Towards cost-effective and low latency data center network architecture, Computer Communication, vol.82, pp.1-12, 2016.

J. H. W-pitt-turner, K. G. Seader, and . Brill, Tier classifications define site infrastructure performance, 2013.

V. Sharma and J. T. Virtamo, A finite buffer queue with priorities, Performance Evaluation, vol.47, pp.1-21, 2002.

V. Prabhakar, M. G. Varde, and . Pecht, System Reliability Modeling, pp.71-113, 2018.

G. Wang, L. Zhang, and W. Xu, What can we learn from four years of data center hardware failures, 2017 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), pp.25-36, 2017.

G. Wang, D. G. Andersen, M. Kaminsky, K. Papagiannaki, T. S. Ng et al., c-through: part-time optics in data centers, SIGCOMM Comput. Commun. Rev, vol.41, issue.4, 2010.

J. X. Wang and M. L. Roush, What Every Engineer Should Know About Risk Engineering and Management. What Every Engineer Should Know, 2000.

M. Wiboonrat, An empirical study on data center system failure diagnosis. Internet Monitoring and Protection, 2008.

X. Wang, Y. Yao, X. Wang, K. Lu, and C. Q. Carpo, Correlation-aware power optimization in data center networks, Proceedings of the 2012 IEEE INFOCOM, 2012.

J. Xiang, F. Machida, K. Tadano, K. Yanoo, W. Sun et al., A static analysis of dynamic fault trees with priority-and gates, Sixth Latin-American Symposium on Dependable Computing, pp.58-67, 2013.

J. Xie, Y. Deng, and K. Zhou, Totoro: A scalable and fault-tolerant data center network by using backup port, Network and Parallel Computing, pp.94-105, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01513881

J. Xu, L. Tang, and T. Li, System situation ticket identification using svms ensemble, Expert Syst. Appl, vol.60, issue.C, pp.130-140, 2016.

. Dy-yang, K. H. Wang, and Y. T. Kuo, Economic application in a finite capacity multi-channel queue with second optional channel, Applied Mathematics and Computation, vol.217, pp.7412-7419, 2011.

J. Zhang, H. Kim, Y. Liu, and M. A. Lundteigen, Combining system-theoretic process analysis and availability assessment: A subsea case study, Proceedings of the Institution of Mechanical Engineers, vol.233, pp.520-536, 2019.

C. Zhou, C. Yang, C. Wang, and X. Zhang, Numerical simulation on a thermal management system for a small data center, International Journal of Heat and Mass Transfer, vol.124, pp.677-692, 2018.