How to Configure Nginx as a Load Balancer with Best Steps

In this guide, we aim to demonstrate How to Configure Nginx as a Load Balancer. A load balancer is a crucial component that acts as a reverse proxy, intelligently distributing network or application traffic across a cluster of servers. This distribution ensures no single server is overwhelmed, leading to improved performance and reliability.

Load balancers are primarily deployed to enhance the capacity (supporting a larger number of concurrent users) and improve the reliability of applications. They optimize the overall application performance by offloading the burden on individual servers, which are otherwise responsible for managing and maintaining application and network sessions, as well as performing application-specific tasks. Essentially, they make applications more resilient and scalable.

Nginx, a widely-used web server software, can be readily configured as a simple yet highly effective load balancer. This allows you to significantly improve your server’s resource availability and efficiency. To follow along with the steps detailed in this guide, ensure you have access to a server and can log in as a non-root user with sudo privileges. Furthermore, you must have Nginx installed on your server.

1. Use Nginx Upstream Module as a Load Balancer

To effectively set up Nginx as a load balancer, you will primarily utilize the Nginx upstream module. For this How to Configure Nginx as a Load Balancer example, we’ll modify the default file located in the /etc/nginx/sites-available directory.

sudo vi /etc/nginx/sites-available/default

The upstream block in Nginx configuration is where you define all the backend servers that form a cluster, working together to support a specific application or microservice.

Add an upstream module like the one shown below:

upstream backend {
 server backend1.example.com;
 server backend2.example.com;
 server backend3.example.com;
}

Next, reference this upstream module within your server configuration:

server {
 location / {
 proxy_pass http://backend;
 }
}

Finally, restart the Nginx service to apply the changes:

sudo service nginx restart

After restarting, Nginx will distribute all incoming requests to the servers defined in the upstream block.

Furthermore, Nginx provides several directives that allow you to refine the load balancing behavior for optimal performance.

2. Weight in Nginx Load Balancer

One common configuration for Nginx load balancers is assigning weights to individual servers. If one server possesses more computational power than others, you can configure it to handle a larger portion of the traffic by assigning it a higher weight.

A load-balanced configuration with server weights might resemble this:

upstream backend {
 server backend1.example.com weight=1;
 server backend2.example.com weight=2;
 server backend3.example.com weight=4;
}

The default weight is 1. With a weight of 2, backend2.example.com will receive twice the traffic compared to backend1.example.com, and backend3.example.com, with a weight of 4, will handle twice the traffic of backend2.example.com and four times that of backend1.example.com.

3. Hash Load Balancing

IP Hash load balancing employs an algorithm that utilizes the source and destination IP addresses of the client and server to generate a unique hash key. This ensures that a client is consistently directed to the same backend server.

The following configuration illustrates an example:

upstream backend {
 ip_hash;
 server backend1.example.com;
 server backend2.example.com;
 server backend3.example.com down;
}

4. Max Fails in Nginx Load Balancing

Two key parameters are associated with failure detection: max_fails and fail_timeout. max_fails specifies the maximum number of failed connection attempts to a server before it’s considered inactive.

fail_timeout defines the duration for which the server is considered unavailable. After this time elapses, Nginx will attempt to reconnect to the server. The default timeout value is 10 seconds.

A sample configuration might look like this:

upstream backend {
 server backend1.example.com max_fails=3 fail_timeout=15s;
 server backend2.example.com weight=2;
 server backend3.example.com weight=4;
}

Conclusion

At this point, you should have a solid understanding of How to Configure Nginx as a Load Balancer. NGINX load balancing distributes incoming network traffic across multiple servers to ensure high availability, reliability, and optimal performance. It supports various load-balancing methods like round-robin, least connections, and IP hash. This guide provides a fundamental understanding of how to implement these strategies.

Hope you enjoy it. You may also like to read the following articles:

Nginx Reverse Proxy Setup on Ubuntu 24.04

Enable Brotli Compression in Nginx on AlmaLinux 9

Resole Error Nginx 404 Not Found

Alternative Solutions for Load Balancing

While Nginx provides a robust and versatile solution for load balancing, other alternatives exist, each with its own strengths and weaknesses. Here are two distinct approaches:

1. Using HAProxy:

HAProxy (High Availability Proxy) is another popular open-source load balancer known for its speed and reliability. While Nginx excels as both a web server and a load balancer, HAProxy is solely focused on load balancing, allowing it to potentially offer more fine-grained control and optimization for specific load balancing scenarios.

Explanation: HAProxy operates at both Layer 4 (TCP) and Layer 7 (HTTP). Layer 4 load balancing is faster and simpler, routing traffic based on TCP connections. Layer 7 load balancing, on the other hand, allows for more complex routing decisions based on HTTP headers, cookies, and URL patterns. This allows for advanced features like content-based routing, session persistence based on cookies, and SSL termination.

Code Example:

First, install HAProxy:

sudo apt update
sudo apt install haproxy

Then, configure HAProxy by editing the /etc/haproxy/haproxy.cfg file. A simple configuration for load balancing across three backend servers might look like this:

frontend http_frontend
 bind *:80
 mode http
 default_backend http_backends

backend http_backends
 mode http
 balance roundrobin
 server backend1 backend1.example.com:80 check
 server backend2 backend2.example.com:80 check
 server backend3 backend3.example.com:80 check

In this example:

frontend http_frontend: Defines a frontend named http_frontend that listens on port 80.
mode http: Specifies that the frontend handles HTTP traffic.
default_backend http_backends: Directs traffic to the http_backends backend by default.
backend http_backends: Defines a backend named http_backends.
balance roundrobin: Configures the load balancing algorithm to round robin.
server backend1 backend1.example.com:80 check: Defines a backend server named backend1 at backend1.example.com on port 80, and enables health checks. The check option ensures that HAProxy periodically checks the health of the backend server.

Finally, restart HAProxy to apply the changes:

sudo systemctl restart haproxy

2. Cloud-Based Load Balancers (e.g., AWS ELB, Google Cloud Load Balancing, Azure Load Balancer):

Cloud providers offer managed load balancing services that simplify the process of distributing traffic and managing server infrastructure. These services are highly scalable, reliable, and often include advanced features like automatic health checks, SSL termination, and integration with other cloud services.

Explanation: Cloud-based load balancers abstract away much of the complexity associated with managing load balancing infrastructure. They handle the underlying hardware and software, allowing you to focus on configuring the load balancing behavior and deploying your applications. These services typically offer various load balancing algorithms, health check options, and integration with other cloud services like auto-scaling groups. They often provide global load balancing capabilities, distributing traffic across multiple regions for improved availability and performance. Setting How to Configure Nginx as a Load Balancer is a good practice, but consider using cloud load balancers as your alternative.

Code Example (AWS ELB – via AWS CLI):

While a direct code example isn’t applicable in the same way as with Nginx or HAProxy (because these are managed services), the following AWS CLI commands illustrate the steps involved in creating a basic Application Load Balancer (ALB):

Create a Target Group:

aws elbv2 create-target-group 
--name my-target-group 
--protocol HTTP 
--port 80 
--vpc-id vpc-xxxxxxxxxxxxxxxxx

Register Targets (Instances) with the Target Group:

aws elbv2 register-targets 
--target-group-arn arn:aws:elasticloadbalancing:region:account-id:targetgroup/my-target-group/xxxxxxxxxxxxxxxx 
--targets Id=i-xxxxxxxxxxxxxxxxx,Port=80 Id=i-yyyyyyyyyyyyyyyyy,Port=80

Create an Application Load Balancer:

aws elbv2 create-load-balancer 
--name my-load-balancer 
--subnets subnet-xxxxxxxxxxxxxxxxx subnet-yyyyyyyyyyyyyyyyy 
--security-groups sg-xxxxxxxxxxxxxxxxx

Create a Listener:

aws elbv2 create-listener 
--load-balancer-arn arn:aws:elasticloadbalancing:region:account-id:loadbalancer/app/my-load-balancer/xxxxxxxxxxxxxxxx 
--protocol HTTP 
--port 80 
--default-actions Type=forward,TargetGroupArn=arn:aws:elasticloadbalancing:region:account-id:targetgroup/my-target-group/xxxxxxxxxxxxxxxx

These commands demonstrate the process of creating a target group, registering backend instances, creating the load balancer itself, and configuring a listener to forward traffic to the target group. Cloud providers typically also offer graphical interfaces (web consoles) for managing these resources.

Choosing the right load balancing solution depends on your specific requirements, infrastructure, and expertise. Nginx provides a solid foundation and great flexibility, HAProxy offers specialized performance and control, and cloud-based load balancers provide ease of management and scalability. Understand How to Configure Nginx as a Load Balancer will help you choose the best method for your own needs.