COS20019 Cloud Computing Architecture - Assignment 02 Scalable Cloud Computing Architecture (30%)
1. Lab Exercises (5% in total, 5 Lab
Exercises, 1% each)
1. Complete ACF and ACA Lab Exercises as per the instructions
2. Demo your work in the presence of your co-teacher for each Lab
Marking Criteria for Lab Exercises
Below is the marking criteria of each Lab exercise
-
Complete all work required (e.g. be able to show the relevant configuration information on the cloud environment and can explain how the relevant components work together) - 1%
-
Partially complete the work required (e.g. can show partial information but not fully understanding how the relevant components work together) - 0.5%
-
Cannot complete the work required (e.g. do not understand what to do) - 0%
2. Scalable Cloud Project - Highly Available
Photo Album Web Application (22% in total)
In the cloud project of Assignment 1, you have learnt how to deploy a Photo Album Web Application on a VPC, actually in a EC2 running as a Web Server. However, this is not a highly available environment. In case there are some hardware failure related to that particular EC2 instance running the Web Server. The whole Web Applicaiton will be down and hence cannot be used by the public users.
This is what we want to overcome in this assignment. We will configure this solution to make it highly available. In other words, we try to make the Photo Album Web Application (1) highly available and (2) a bit scalable. Yes, we want a bit scalable because there are many levels of scalability as you may have known by now. In this assignment, we can only experience one or two aspects of the scalability. Our approach in this assignment is to have multiple EC2 instances running the same Web Application so that together they share the load and hence can handle higher loading.
Furthermore, in this project, we allow users to upload photos using a web browser via the internet. When a photo is uploaded from the internet, the application will also create a thumb nail image for ease of reference.
Similar to the cloud project in Assignment 1, this project has two parts, namely infrastructure deployment and functionality requirements. They are designed to test whether you can integrate the skills learnt in the ACF and ACA Lab exercises in this subject in deploying a highly available Web application on the cloud. You are not required to program the Web Application. You will be given the code of the Web Application and instructions to revise the code.
Since we are going to make the web application scalable by having more EC2 instances running the same web application at the same time, we need to have a working system first and then we replicate the system to different EC2 instances. Hence, in this assignment, we need to deal with the functional requirements first. Once we have a working system, we can then configure our cloud infrastructure to make it highly available.
Unlike the cloud project in Assignment 1, you will only be given the requirements of each part with some hints on the steps. You need to think about the steps carefully before you start to attempt this project. Probably a revision of the relevant ACF and ACA labs may help.
Prerequisite requirements
-
Completed ACF Labs 01 - 05 (Assignment 1 Part 1)
-
Completed ACF Lab 06 and ACA Modules 9 and 13 Guided Labs (included in Part 1 Lab Exercises)
-
Completed Assignment 1
-
Know how to use AWS PHP SDK
Possible AWS Accounts for cloud project
You have a choice of two accounts / environments you can use to complete the cloud project of this assignment.
-
AWS Academy Learner Lab (recommended): accessible through AWS Canvas. Note that this is NOT the sandbox in ACA/ACF courses that you use for your ACA/ACF Labs. This is a managed environment that allows your co-teacher to gain access to your AWS console so your work can be marked/troubleshooted. This class gives you US$100 credit. Use it carefully. This account is deleted at the end of the semester.
-
Regular AWS account (NOT recommended): new AWS accounts are eligible for a free tier. This gives you more freedom, but you need to be careful as you will be charged for the services if you go outside the free tier offering. Make sure to keep track of your AWS services usage (using Billing & Cost Management Dashboard) throughout the entire learning to avoid paying fees. This account is on-going, but some services are no longer free after 12 months. If you choose this
3
option, you will need to create a (read-only) IAM user and provide its credentials to your co- teacher so your assignment can be marked properly.
Getting the files ready
Download the following files from Cloud Campus
-
EC2-setup-script-2023-ami.txt (same as the one in Assignment 1)
-
Remote_Access_to_an_EC2.pdf (MS Windows) or Remote_Access_to_an_EC2_from_a_Mac.pdf (Mac OS or Linux) (same as the one in Assignment 1)
-
Install_phpMyAdmin_on_EC2.pdf (same as the one in Assignment 1)
-
photoalbum.zip (a totally new file for Assignment 2)
This zip file contains the full source code of the Photo Album Web Application. It is different from the one in Assignment 1. Hence the set up script will be different. You need to customize it yourself.
Furthermore, after you unzip this file, there is a folder named photoalbum by default. This may conflict with your Assignment 1’s photoalbum folder. Be careful. Please be remember to put all files inside /var/www/html/photoalbum. This is different from /var/www/html/cos20019/photoalbum in Assignment 1.
Last, but not least, you need to read the file constants.php carefully. There are some extra steps in the file telling you to install the AWS SDK inside /var/www/html/.
-
photoalbum-setup-script.txt (same as the one in Assignment 1 but you need to modify this file for Assignment 2)
-
lambda-deployment-package-0.1.zip (a totally new file for Assignment 2)
This zip file contains the full source code of the CreateThumbnail Lambda function.
2.1. Part 1 Functional Requirements
Figure 1. Cloud Architecture for Highly Available Photo Album Cloud Project
Figure 1 shows the cloud infrastructure and services of the Highly Available Photo Album Cloud Project. First, we need to make sure that we have a working system in the EC2 instance named Dev Server. This EC2 instance serves two purposes:
1. as a development machine - allows developers to ensure that a working system is ready (e.g. make changes to the code and test the code properly before putting it into production use)
2. as a template image - allows us to create new EC2 instances based on the image of this machine (e.g. software installed and programming code)
The Photo Album Web Application is to be hosted on your EC2 web servers. The full source code has been provided to you in photoalbum.zip.
You need to modify the constants.php file in the provided code (carefully read the comments in the file) using available information from your S3 bucket, RDS database, and Lambda function.
The web site should be accessible through http://[your.elb.dns]/photoalbum/album.php if the directory structure in your web server is as specified in the constants.php file.
Please be very careful about this. In Assignment 1, we use the public ip v4 address to access the Photo Album Web Application. However, after we have made the Auto Scaling Group available, we use the public ip v4 address of the Elastic Load Balancer to access the Photo Album Web Application. While we prepare the Web Application (the extra one is the Photo Uploading feature see Part 1 Requirement 2 below), we need to use the public ip v4 address of the Dev Server to test our Web
Application first. Once it is OK, we then use it as the machine image to launch the Web Servers in the Auto Scaling Group.
Furthermore, this project requires AWS SDK installation on your EC2 web server instances. Read and follow the provided instructions in the constants.php file carefully.
Below are the individual functional requirements of this Photo Album Web Application.
2.1.1. Requirement 1 Photo Album (album.php)
This page lists all the photos whose meta-data are stored in the database. Programmatically, this page performs the following actions:
• Establish a connection to the RDS instance
• Request to retrieve all the records in the database table in the RDS instance
• The RDS instance will then send back all the records to the EC2 server hosting this web page
2.1.2. Requirement 2 Photo Uploading (photouploader.php)
This page allows you to upload a photo to an S3 bucket and insert its meta-data into the RDS database. In the meantime, a Lambda function called CreateThumbnail will create a resized verion of the photo that was just uploaded to S3.
Programmatically, this page performs the following actions (assuming you have all information filled and photo been selected):
-
When you click the [ Upload ] button on the page, the selected photo and its relevant meta-data will be uploaded from your local computer to an EC2 web server
-
The selected photo is then uploaded from the EC2 web server to the S3 bucket
-
The EC2 web server inserts the photo’s meta-data (title, description, creation date, keywords and reference) into the database in the RDS instance
-
The EC2 web server invokes the CreateThumbnail Lambda function with the bucket name and the photo name in the payload
-
The Lambda function downloads the photo in the bucket specified in the payload sent above, resizes it, and uploads the resized image to the same S3 bucket. The resized image is name "resized-<nameOfTheOriginalPhoto>.png"
For more details, please inspect the supplied source code.
You need to set the S3 policy properly to allow write access to your S3 bucket before you can test this upload feature. Please see Part 2 Requirement 3 for some examples in setting up the S3 policy.
2.2. Part 2 Cloud Infrastructure
This part of the assignment is to set up the infrastructure, which involves the following infrastructure as shown in Figure 1:
-
A Virtual Private Cloud (VPC) with appropriate availability zones, public and private subnets
-
A Load Balancer which is internet facing
-
A Network Address Translation, NAT, device that routes traffic to the private subnets
-
Several Security Groups to appropriately secure the access of the resources
-
An Auto Scaling Group to launch the Web Servers that run the Photo Album Web Application
-
An EC2 instance taking two roles:- (1) a Development Server and (2) a launch image for the EC2 instance in the Auto Scaling Group
-
An RDS database server instance
-
A S3 bucket that stores the photos uploaded by users
For this assignment, you have the following two options:
-
Modify your Assignment 1’s VPC to suit the requirements for this assignment, or
-
Start from scratch and build your VPC accordingly
It is totally your choice. Either way, I advise you to make a plan first and enact your plan step by step.
-
2.2.1. Requirement 1 Your VPC
The VPC used in Highly Available Photo Album Cloud Project is similar to that in Assignment 1. The following points should be noted:
-
Name: [FirstNameInitial][LastName]-vpc. For example, if your name is Man Lau, your VPC would be noted mlau-vpc, using all lower case is much easier for me. If you want, you can use a mix of the upper and lower case letters here, your choice.
-
Region: US East (N. Virginia) us-east-1
-
Two availability zones, each with a public and private subnet with suitable CIDR ranges (see
Figure 1)
-
Associate public subnets with a route table that routes to an Internet Gateway
-
Associate private subnets with a private route table that routes to a NAT device (see Part 2 Requirement 2 for details)
1. Due to some incompatibility issues on AWS’s management console, it is recommended that you create your VPC manually using the "Create VPC" button in the VPC tab. Please DO NOT use the "Start VPC Wizard" button in AWS dashboard
2. Do not use the default VPC. All services should be in your custom VPC.
2.2.2. Requirement 2 Network Address Translation (NAT)
For private EC2 instances (i.e.\ those EC2 instances in private subnets) to be able to communicate with the public internet, their private IP addresses need to be translated by a NAT device. You need to create this NAT device in a public subnet in your VPC. Please see Figure 1 for the location of this NAT device.
A NAT device is either a NAT gateway or a NAT instance (an EC2 instance taking the role of a NAT gateway).
So, you have the following two options
-
use a NAT gateway - A NAT gateway is easier to set up but very expensive. Please observe the routing from Figure 1 to configure your NAT gateway properly. Also, you need to check your costs on your Learner Labs.
-
use a NAT instance - A NAT instance is free but requires some set up. This means that you need to create an EC2 instance and configure it as a NAT server.
AWS has deprecated the relevant NAT AMI (Amazon Machine Image) for a NAT instance. Hence, you may need to create your own AMI if you choose this option.
Please see https://docs.aws.amazon.com/vpc/latest/userguide/ VPC_NAT_Instance.html for the relevant instructions on setting up a NAT instance and the relevant permissions (e.g. Security Group required).
2.2.3. Requirement 3 S3 Photo Storage
Photos are to be stored in an S3 bucket, which has been created from Assignment 1, ensuring objects stored in this S3 bucket are correctly accessible by applying the appropriate permissions and policies.
In AWS Learner Lab environment, you may not be able to create your own IAM roles due to AWS Academy’s restrictions. Nonetheless, an IAM role named "LabRole" or "LabinstanceProfile" with required permissions already exists in your management console that can be used for this assignment.
Since you will be uploading new photos to your S3 bucket, you need to set up appropriate S3 bucket policy to allow users to write to the S3 bucket.
You need to make sure that the uploading of new photos must be via your web application or done by you via the S3 Management Console, meaning that you cannot directly upload any photos to your S3 bucket using the URL of your S3 bucket.
Please see https://docs.aws.amazon.com/AmazonS3/latest/userguide/example-bucket-policies.html for the appropriate examples. You may need to modify the examples to suit your needs. There are so many options in this part. Here are two options (choose one for your assignment, you do not need to do both):
-
Allow write access via specific HTTP referrer
Please see https://docs.aws.amazon.com/AmazonS3/latest/userguide/example- bucket-policies.html#example-bucket-policies-HTTP-HTTPS-2 for an example.
-
Allow write access (e.g. s3:PutObject) from the IAM role "LabRole" or "LabinstanceProfile" to the S3 bucket
There is no single example in https://docs.aws.amazon.com/AmazonS3/latest/ userguide/example-bucket-policies.html that matches this requirement. You may need to read through some of them and modify them to suit your need.
2.2.4. Requirement 4 Load Balancing
Web request load needs to be distributed across the web servers in the auto-scaling group using an Application Load Balancer. Ensure that your Elastic Load Balancer (ELB) is running health checks on all instances.
Your ELB needs to be internet facing and routes the traffic to appropriate Web Servers, in the private subnets, running the Photo Album Web Application.
The health check path must be correctly configured (e.g. "/photoalbum/album.php", see the Functional Requirements for details). Otherwise, the health checks would fail.
2.2.5. Requirement 5 Auto Scaling
You need to create an Auto Scaling Group (ASG) that scales your Web Server’s EC2 instances automatically.
You need to define a scaling policy for your auto scaling group with at least the following rules:
-
The minimum number of servers is 2. The maximum number of servers is 4.
-
Configure a target tracking scaling policy to keep the request count per target of your ELB target group at 30 for your Auto Scaling group.
The ASG should launch instances into the private subnets.
2.2.6. Requirement 6 EC2 Web Server instance
Your EC2 Web Server instances should be based on Amazon Linux 2023 AMI, which is similar to the one used in Assignment 1. They need to be given proper permissions through an IAM role to be able to put objects into the S3 bucket and invoke the CreateThumbnail Lambda function (see Part 2 Requirement 7). The role must follow the least-privilege principle.
These instances should be automatically launched by the auto scaling group, and only accept 9 incoming traffic from the load balancer. Once launched, they should be ready to serve Photo Album users without any further human intervention. In other words, you should not have to do any configurations once the instances have been launched.
The Development (Dev) Server does not receive traffic from the ELB. The Dev server can be used to develop the custom AMI, which would contain everything needed to run the Photo Album Web Site (AWS PHP SDK, Apache Web Server, source code of the Web Application, etc.). It can also be used to manage your database (through phpMyAdmin - similar to Assignment 1).
2.2.7. Requirement 7 The CreateThumbnail Lambda function to scale the image
The CreateThumbnail Lambda function is used to scale the image in this project. This Lambda function has the following configurations:
• Name: CreateThumbnail
• Runtime: Python 3.11
• Architecture: arm64
• Execution role: LabRole
Since you cannot create your own role in the Learner Lab, we use the LabRole, which has been created for you in the Learner Lab environment. In real production environment, you need to create an IAM role with policies that allow this Lambda function to get objects from and put objects to the S3 bucket. The role must follow the least-privilege principle.
• Timeout : 30 seconds
The default is 3 seconds. It is better to change it to 30 seconds to avoid unexpected timing error.
Once the Lambda function has been created, you can upload a deployment package to add functionality to this function. The deployment package has been provided to you in lambda- deployment-package.zip. This package contains the library and full source code to resize images and dowload/upload images to S3 (for best result, please use PNG images). The package is ready to work without any modification.
In order to test this function, you can create a test event with the following input in AWS Lambda Console (via the Test tab):
{"bucketName" : "your-photo-bucket", "fileName": "your-image.png"}
An example could be
{"bucketName" : "mlau-photos-bucket", "fileName": "swinburnelogo.png"}
Since we are putting new objects to the same S3 bucket, please DO NOT set the trigger of this lambda function to S3 : All object create events. Doing so will end up having an infinite loop.
You are encourage to inspect the source code and understand the logic of this Lambda function. However, you will not be required to answer coding questions about Lambda functions. Having said that, general concepts on how Lambda functions work will be asked in the exam.
2.2.8. Rquirement 8 Database with RDS
Same RDS database created in Assignment 1.
You may think that since the Web Server EC2 instances are now in private subnets, access to phpMyAdmin from these servers would require some further configurations. That is correct. However, it is not necessary to go through phpMyAdmin from these servers. It is because we still have a Dev Server in the public subnet. And, it is acceptable to manage your DB through the Dev Server. Hence, there is no need to further configure the EC2 Web Server instances. That is also the reason why we keep the Dev Server instance in the public subnet.
2.2.9. Requirement 9 Security Groups
You need to create four to five security groups, depending on whether you choose to use a NAT Gateway or a NAT instance. Each security group is associated with a tier shown in the architecture diagram as in Figure 1:
• ELBSG: for the ELB created above
• WebServerSG : for all the web server EC2 instances in private subnets
• DBServerSG : for the RDS instance
• DevSG : for the Dev server
• NATSG : for the NAT instance
You need this NATSG if you use a NAT instance. In other words, if you use NAT gateway, you do not need this security group.
ELBSG, WebServerSG, DBServerSG and NATSG, if any, must follow the least-privilege principle, i.e., allowing all traffic from anywhere is NOT acceptable. DevSG does not have to follow the least- privilege principle.
Your RDS instance needs to be in a private subnet. Only WebServerSG security group can access it.
Security groups are stateful. See https://docs.aws.amazon.com/vpc/latest/userguide/ VPC_SecurityGroups.html for details.
If unsure about how to set up security groups and IAM roles, or unsure if your security groups and IAM roles are causing problems, you can make them wide open (allowing all traffic from anywhere, full permissions first), then tighten them later once your web application is fully functional[1]
Testing
The Photo Album Web Application should be accessible through http://[your.elb.dns]/photoalbum/album.php.
Using your Photo Album Web Application (http://[your.elb.dns]/photoalbum/photouploader.php), upload a few photos along with their meta-data
-
Check the S3 bucket to see if photos are actually uploaded and if their resized versions are created
-
Check the database to see if their meta-data is recorded
-
The Photo Album Web Application is accessible through the load balancer only
-
Terminate servers then check to see if replacement EC2 instances are automatically deployed by the ASG. Thoroughly test the functionality of the web application again once new instances have been launched
-
All EC2 targets are healthy
-
Test direct write access to your S3 bucket, which should not be publicly accessible
-
Double check all security groups and IAM roles, make sure they follow the least-privilege principle.