Fault Tolerance Detection, Mitigation and Performance Improvement in Cloud

International Journal of Computer Science and Engineering
© 2017 by SSRG - IJCSE Journal
Volume 4 Issue 4
Year of Publication : 2017
Authors : S. Veerapandi, Dr. K. Alagarsamy

How to Cite?

S. Veerapandi, Dr. K. Alagarsamy, "Fault Tolerance Detection, Mitigation and Performance Improvement in Cloud," SSRG International Journal of Computer Science and Engineering , vol. 4,  no. 4, pp. 1-4, 2017. Crossref, https://doi.org/10.14445/23488387/IJCSE-V4I4P101


 In the modern world computer become a necessary one. To work with computer system, software and hardware are necessary component. Due to technology development software and hardware prices has been increased nothing like anything. Purchasing new hardware & software become tedious process and expensive. Research found that cloud will overcome this kind of problem using the concept of usage based payment which growing as trendy technical solution and moving one. In cloud computing there are many problems persist but in our paper we have taken fault tolerance issue and sustainability. In this paper we have proposed a novel idea to improve the cloud fault tolerance and performance.


Cloud Computing, MPI, Indexing Table.


[1] Benjamin Lussier, Alexandre Lampe, Raja Chatila,JérémieGuiochet, Félix Ingrand, Marc-Olivier Killijian, David Powell, ―Fault Tolerance in Autonomous Systems: How and How Much? LAAS-CNRS 7 Avenue du Colonel Roche, F-31077 Toulouse Cedex 04, France
[2] Jianlin, Xiaoyi Lu, Lin Yu, YongqiangZou and Li Zha―Vega Warden: A Uniform User Management System for Cloud Applications ―2010 Fifth IEEE International Conference on Networking, Architecture, and Storage.
[3] Mladen A. Vouk, ―Cloud Computing – Issues, Research and Implementations, Department of Computer Science, North Carolina State University, Raleigh, North Carolina, USA, Journal of Computing and Information Technology - CIT 16, 2008, 4, 235– 246doi:10.2498 /cit.1001391
[4] AnjuBala, InderveerChana, Fault Tolerance- Challenges, Techniques and Implementation in Cloud Science Issues, Vol. 9,Issue 1, No 1, January 2012 ISSN (Online): 1694-0814
[5] Ravi Jhawar, Vincenzo Piuri and Marco Santambrogio―A Comprehensive Conceptual System level Approach to Fault Tolerance in Cloud Computing‖ IEEE
[6] Jean-clandeLaprie ―Dependable computing and fault tolerance: concepts and terminology LAAS-CNRS 7 Avenue du ColonelRoche, 31400 Toulouse, France
[7] GeoffroyVallee, KulathepCharoenpornwattana, Christian Engelmann, AnandTikotekar, Stephen L. Scott, A Framework for Proactive Fault Tolerance.
[8] SheheryarMalikandFabriceHuet ―Adaptive Fault Tolerance in Real Time Cloud Computing 2011 IEEE World Congress on Service
[9] GolamMoktaderNayeem , Mohammad Jahangir Alam, Analysis of Different Software Fault Tolerance Techniques, 2006.
[10] Wenbing Zhao, P.M. Melliar and L.E. Mose Fault Tolerance Middleware for Cloud Computing 2010 IEEE 3rd International Conference on Cloud Computing.
[11] Jayadivya S K, JayaNirmala S, Mary SairaBhanusFault Tolerance Workflow Scheduling Based on Replication and Resubmission of Tasks in Cloud Computing International Journal on Computer Science and Engineering (IJCSE)
[12] Fumio Machida, Ermeson Andrade, Dong SeongKim and Kishor S. Trivedi―Candy: Component-based Availability Modeling Framework for Cloud Service Management Using Sys-ML 2011 30th IEEE International Symposium on Reliable Distributed Systems.
[13] Sun Microsystems, Inc. ―Introduction to Cloud Computing Architecture White Paper 1st Edition, June 2009
[14] Greenberg, A., Hamilton, J., Maltz, D., and Patel, P. (2009) ‗The Cost of a Cloud: Research Problems in Data Center Networks,‘ ACM SIGCOMM Computer Communication Review, vol. 39, no. 1, pp. 68-79.
[15] ITProPortal, http://www.itproportal.com/2012/07/12/o2outage-latest-stringmajor- it- infrastructure-failures/, 2012.
[16] Gagne, M, 2007, ‗Cooking with Linux—still searching for the ultimate Linux distro?‘,Linux Journal, vol. 09, no. 161.
[17] Koren, I., & Krishna, C. M. (2010). Fault-tolerant systems: Morgan Kaufmann.
[18] Bauer, E., & Adams, R. (2012). Reliability and availability of cloud computing: John Wiley & Sons.
[19] Bilal, K, Malik, S., Khan, S. U., &Zomaya, A. (2014) ‗Trends and Challenges in Cloud Data Centers,‘ IEEE Cloud Computing Magazine, vol. 1, no. 1, pp. 10-20, 2014.
[20] Pannu, H S, Liu, J, & Fu, S 2012, ‗AAD: Adaptive Anomaly Detection System for Cloud Computing Infrastructures‘, In IEEE 31st Symposium on Reliable Distributed Systems (SRDS), pp. 396- 397.
[21] Hakkarinen, D, & Chen, Z, 2013, ‗Multi-Level Diskless Checkpointing‘, IEEE Transactions On Computers, vol. 62, no.4, pp. 772-783.
[22] Kwak, SW, & Yang, JM, 2012, ‗Optimal checkpoint placement on real-time tasks with harmonic periods‘, Journal of Computer Science and Technology, vol. 27, no. 1, pp. 105-112.
[23] Nicolae, B, &Cappello, F, 2011, ‗BlobCR: efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots‘, ACM International Conference for High Performance Computing, Networking, Storage and Analysis.
[24] Malik, S, R, Khan, S, U, and Srinivasan, S, K, 2013, ‗Modeling and Analysis of State-of-the-art VM-based Cloud Management Platforms‘, IEEE Transactions on Cloud Computing, vol. 1, no. 1, pp. 50-63.
[25] Goiri, I, Julia, F, Guitart, J, & Torres, J, 2010, ‗Checkpointbased fault-tolerant infrastructure for virtualized service providers‘, In IEEE Network Operations and Management Symposium (NOMS), pp. 455-462.
[26] Sun, D, Chang, G, Miao, C, & Wang, X 2013, Analyzing, modeling and evaluating dynamic adaptive fault tolerance strategies in cloud computing environments, The Journal of Supercomputing, 1-36.
[27] Zhang, Y, Zheng, Z, &Lyu, M R 2011, ‗BFTCloud: A byzantine fault tolerance framework for voluntary-resource cloud computing‘, In IEEE International Conference on Cloud Computing (CLOUD), pp. 444-451.
[28] Zhao, W, Melliar-Smith, PM, & Moser, L E 2010, ‗Fault tolerance middleware for cloud computing‘, In IEEE 3rd International Conference on Cloud Computing (CLOUD), pp. 67- 74.
[29] Malik, S &Huet, F 2011, ‗Adaptive Fault Tolerance in Real Time Cloud Computing‘, In IEEE World Congress on Services (SERVICES), pp. 280-287.
[30] Ko, S Y, Hoque, I, Cho, B, & Gupta, I 2010, ‗Making cloud intermediate data fault-tolerant‘, In Proceedings of the 1st ACM symposium on Cloud computing, pp. 181-192.
[31] Cogo, V V, Nogueira, A, Sousa, J, Pasin, M, Reiser, HP, &Bessani, A 2013, ‗FITCH: Supporting Adaptive Replicated Services in the Cloud‘, In Distributed Applications and Interoperable Systems, pp. 15-28. Springer Berlin Heidelberg.
[32] Zheng, Q. 2010, ‗Improving MapReduce fault tolerance in the cloud‘, In IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), pp. 1-6.
[33] OpenNebula, ―Opennebula.org: The open source toolkit for cloud computing,‖ http://opennebula.org,.
[34] Amazon, Inc, Amazon Elastic Compute Cloud (Amazon EC2). Available: http://aws.amazon.com/ec2/#pricing,.
[35] VishonikaKaushal and VishonikaKaushal, ―Autonomic fault tolerance using haproxy in cloud environment, International Journal of Advanced Engeneering Sciences and Technologies,vol. 7, 2010.
[36] Qin Zheng, ―Improving mapreduce fault tolerance in the cloud, in Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), pp. 1-6, 2010.
[37] Magdalena Slawinska, JaroslawSlawinski, and VaidySunderam, ―Unibus: Aspects of heterogeneity and fault tolerance in cloud computing, in Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), pp. 1-10, 2010.
[38] Uesheng Tan, DengliangLuo, and Jingyu Wang, ―Ccvit: Virtualization intrusion tolerance based on cloud computing, in 2nd International Conference on Information Engineering and Computer Science (ICIECS), pp. 1-6. Wuhan, China,2010.