Options for building highly available systems on AWS. Overcoming disruptions. Part 1

Even such monsters of the cloud industry like Amazon problems occur with the equipment. In connection with the recent disruptions in the US East-1 data center, this article might be helpful.

Options for building highly available systems on AWS. Overcoming outages

Fault tolerance is one of the main characteristics of all cloud systems. Every day many applications are designed and deployed on AWS without taking into account this characteristic. The reasons for this behavior can vary from technical lack of knowledge in how to design a fault tolerant system up to the high cost of creating a full-fledged high-availability systems within AWS. This article highlights several solutions that will help to overcome the disruptions of hardware providers and create a more suitable solution within AWS infrastructure.
The structure of a typical web application consists of the following levels: DNS, Load Balancer, web server, application server, database, cache. Let's take this stack and consider in detail the main points you need to consider when building highly available systems:
the

Building highly available systems on AWS
High availability at level a web server / application server
High availability load balancing / DNS
High availability at the database level
Building highly available systems across availability zones AWS
create a highly available system between regions of AWS
Building highly available systems across cloud and hosting providers

Part 2

High availability at level a web server / application server

In order to exclude a component having a single point of failure (SPOF — Single Point of Failure), it is common practice to run web applications on two or more instances of EC2 virtual servers. This solution allows to provide higher fault tolerance compared to using one server. Application server and web servers can be configured using the health check, and without it. Shown below are the most common architectural decisions for highly available systems using inspection:

The key points that you need to pay attention when building this system:

the

because the current AWS does not support Multicast Protocol at the application level, data must be synchronized using conventional Unicast TCP. For example, for Java applications you can use JGroups, Terracotta NAM or similar software to synchronize data between servers. In the simplest case, you can use one-way synchronization with rsync, more versatile and reliable solution is the use of network distributed file systems such as GlusterFS.
To store user data and information about sessions you can use Memcached EC2, ElastiCache or Amazon DynamoDB. For greater reliability, you can deploy ElastiCache cluster in different availability zones AWS.
Using ElasticIP to switch between servers, it is not recommended for highly critical systems, as this process can take up to two minutes.
User data and sessions can be stored in the database. To use this mechanism should be cautious, it is necessary to evaluate possible delays when read/write to the database.
users can upload files and documents should be stored on network file systems such as NFS, Gluster Storage Pool or Amazon S3.
Needs to be included the policy of fixing sessions at the level of Amazon ELB or reverse proxies if session is not synchronized through a single repository, database, or other similar mechanism. This approach provides high availability, but does not provide fault tolerance at the application level.

High availability load balancing / DNS

look

Multiple Nginx or HAproxy can be configured for high availability in AWS, these services can determine the service availability and distribute the requests among the available servers
Horizontal scaling of load balancers for vertical scaling. Horizontal scaling increases the number of individual machines performing the function of balancing, eliminating a single point of failure. For scaling of load balancers like Nginx and HAproxy need to develop their scripts and system images, used to scale Amazon AutoScaling is not recommended in this case.
To determine the availability of the server load balancer you can use Amazon CloudWatch monitoring or third party monitoring services such as Nagios, Zabbix, Icinga and in case of unavailability of one of the servers using scripts and command-line tools to manage EC2 to launch a new server instance to a load balancer within a few minutes.

let

High availability at the database level

Building highly available systems across availability zones AWS
create a highly available system between regions of AWS
Building highly available systems across cloud and hosting providers

Original article: harish11g.blogspot.in/2012/06/aws-high-availability-outage.html
Author: Harish Ganesan

Article based on information from habrahabr.ru

Поиск по этому блогу

computer express

Options for building highly available systems on AWS. Overcoming disruptions. Part 1

Комментарии

Отправить комментарий

Популярные сообщения из этого блога

FreeBSD + PostgreSQL: tuning the database server

The use of Lisp in production

Habr dying?