What is Nginx? Evolution and Architecture of Web Servers

Table of Contents

Nginx - This article is part of a series.

Part 1: This Article

What is Nginx?
#

Nginx (engine-x) is a lightweight, high-performance web server software. It functions not only as a web server but also as a reverse proxy, load balancer, and HTTP cache.

Nginx was designed to handle high concurrent connections and is currently used by numerous large-scale websites worldwide.

Why Was Nginx Needed?
#

In the past, Apache web server was the industry standard. However, in the early 2000s, as internet users grew exponentially, a bottleneck called the C10k problem emerged.

The C10k Problem
#

The C10k problem stands for “Connection 10,000,” meaning handling 10,000 concurrent client connections on a single server.

Important Concept Distinction

Concurrent Processing: Maintaining and managing many connections simultaneously
Throughput: Number of requests that can be processed per second

Concurrent connection handling focuses on efficient resource management and scheduling rather than raw speed.

Apache's thread-based architecture - one thread per connection

Apache’s Architectural Limitations
#

Traditional Apache had the following structural issues:

1. Process-Based Processing
#

Creates a new process or thread for each incoming request
Number of processes increases proportionally with users
Results in memory exhaustion

2. High Resource Consumption
#

Apache’s powerful extensibility allows various module additions
However, each process loads all modules into memory
Memory usage per process increases

3. Context-Switching Overhead
#

CPU cores alternate between multiple processes
Context-switching costs occur during process transitions
CPU overhead increases with more requests

Due to these issues, Apache was unsuitable for large-scale concurrent connection environments.

The Birth of Nginx
#

In 2002, Russian developer Igor Sysoev began developing Nginx to solve this problem, releasing the first version in 2004.

Nginx’s Core Goals
#

High concurrent connection handling
Low memory footprint
High performance and stability

Nginx’s Primary Roles
#

HTTP Server: Quickly serves static files (HTML, CSS, JS, images)
Reverse Proxy Server: Relays requests to backend application servers
Load Balancer: Distributes traffic across multiple servers
Mail Proxy Server: Mail server proxy functionality

Nginx Internal Architecture
#

Nginx consists of 1 Master Process and multiple Worker Processes.

Master Process Responsibilities
#

The Master Process handles:

Reading and validating configuration files
Creating and managing Worker Processes
Restarting Worker Processes on configuration changes

# Check Master Process
ps aux | grep nginx

Worker Process Responsibilities
#

Worker Processes handle actual client requests:

1. Connection Management
#

Receives listen socket from Master Process
Forms connections with clients
Maintains connections for Keep-Alive duration
One Worker handles thousands of connections simultaneously

2. Non-blocking I/O
#

Processes other tasks when no requests on connection
Responds immediately when requests arrive
Efficient processing via asynchronous Event-Driven approach

3. Thread Pool
#

Delegates time-consuming tasks (file I/O, DB queries) to Thread Pool
Worker Process continues handling other requests
Minimizes impact of blocking operations

4. CPU Core Optimization
#

Worker Processes are typically created equal to CPU core count
Each Worker pinned to specific CPU core (CPU Affinity)
Minimizes Context-Switching for performance improvement

# nginx.conf configuration example
worker_processes auto;  # Auto-create based on CPU core count
worker_cpu_affinity auto;  # Auto-set CPU affinity

Event-Driven Architecture
#

Nginx operates with Multi-process + Single-thread + Event-Driven approach:

Event Handler manages multiple connections
Processes via asynchronous Non-blocking method
Executes ready events sequentially
Maximizes resource efficiency without idle processes

This allows efficient memory and CPU usage without processes waiting idle for requests like Apache.

Nginx Advantages and Disadvantages
#

Advantages
#

1. High Concurrent Connection Capability
#

10x more concurrent connections compared to Apache
2x faster processing speed for same connection count

2. Low Resource Usage
#

Operates with fewer processes
Minimized memory usage
Fast response times with lightweight structure

3. Zero-Downtime Configuration Reload
#

nginx -s reload  # Apply configuration without service interruption

Master Process reads new configuration
Existing Workers finish current requests then terminate
New Workers handle requests with new configuration
Configuration changes without service interruption

4. Superior Static File Handling
#

Quickly serves static content like images, CSS, JS
Better static file performance than Apache

Disadvantages
#

1. Difficult Dynamic Module Development
#

Worker Process restart needed when adding modules
Harder module development compared to Apache
Partially compensated by Lua scripting

2. Windows Environment Limitations
#

Optimized for Linux/Unix environments
Performance and stability degraded on Windows
Linux recommended for production environments

3. No .htaccess Support
#

Cannot use Apache’s .htaccess files
All configuration managed in central config file
May lack flexibility in hosting environments

Key Nginx Features
#

1. Reverse Proxy
#

A reverse proxy acts as an intermediary between clients and backend servers.

Key Benefits
#

Enhanced Security: Hides actual server IP
Caching: Caches frequently requested responses
Compression: Saves bandwidth by compressing response data
SSL Processing: Handles HTTPS encryption/decryption

# Reverse proxy configuration example
location / {
    proxy_pass http://backend_server;
    proxy_set_header Host $host;
    proxy_set_header X-Real-IP $remote_addr;
}

Practical Usage Patterns
#

Nginx + Apache: Nginx handles static files, Apache handles dynamic processing
Nginx + Node.js/Python/Java: Nginx protects frontend and backend applications
Nginx + Nginx: Hierarchical configuration of multiple Nginx servers

2. Load Balancing
#

Distributes traffic across multiple backend servers to balance load evenly.

Load Balancing Algorithms
#

Round Robin (Default)
#

Distributes requests sequentially to each server
Simplest and most fair approach

upstream backend {
    server backend1.example.com;
    server backend2.example.com;
    server backend3.example.com;
}

Least Connections
#

Sends to server with fewest current connections
Suitable for requests with varying processing times

upstream backend {
    least_conn;
    server backend1.example.com;
    server backend2.example.com;
}

IP Hash
#

Determines server based on client IP hash
Useful for Session Persistence

upstream backend {
    ip_hash;
    server backend1.example.com;
    server backend2.example.com;
}

Weight
#

Assigns weight based on server performance
Sends more requests to high-performance servers

upstream backend {
    server backend1.example.com weight=3;
    server backend2.example.com weight=2;
    server backend3.example.com weight=1;
}

Health Check
#

upstream backend {
    server backend1.example.com max_fails=3 fail_timeout=30s;
    server backend2.example.com max_fails=3 fail_timeout=30s;
}

max_fails: Number of allowed failures
fail_timeout: Time to consider server down
Improves availability by automatically excluding failed servers

3. SSL/TLS Termination
#

Nginx handles HTTPS communication with clients and HTTP communication with backend.

Key Benefits
#

Removes SSL processing burden from backend servers
Centralized certificate management
Backend focuses on business logic
Nginx and backend communicate via HTTP on same internal network (security safe)

server {
    listen 443 ssl http2;
    server_name example.com;

    ssl_certificate /path/to/cert.pem;
    ssl_certificate_key /path/to/key.pem;
    ssl_protocols TLSv1.2 TLSv1.3;
    ssl_ciphers HIGH:!aNULL:!MD5;

    location / {
        proxy_pass http://backend;
    }
}

HTTP/2 Support
#

Nginx supports HTTP/2:

Multiplexing: Multiple requests simultaneously over one connection
Header Compression: Saves bandwidth
Server Push: Sends resources before client requests

4. Caching
#

Stores server responses in memory or disk for fast responses on repeated requests.

# Cache path configuration
proxy_cache_path /var/cache/nginx levels=1:2 keys_zone=my_cache:10m max_size=1g;

server {
    location / {
        proxy_cache my_cache;
        proxy_cache_valid 200 60m;  # Cache 200 responses for 60 minutes
        proxy_cache_valid 404 10m;  # Cache 404 responses for 10 minutes
        proxy_pass http://backend;
    }
}

Caching Strategies
#

Proxy Caching: Cache backend responses
FastCGI Caching: Cache dynamic content like PHP-FPM
Static File Caching: Set browser cache headers

# Static file cache header configuration
location ~* \.(jpg|jpeg|png|gif|ico|css|js)$ {
    expires 1y;
    add_header Cache-Control "public, immutable";
}

5. Compression (Gzip)
#

Compress response data to save network bandwidth

gzip on;
gzip_vary on;
gzip_min_length 1024;
gzip_types text/plain text/css text/xml text/javascript
           application/x-javascript application/xml+rss
           application/json application/javascript;

Compress text-based content by 60-80%
Improves user experience by reducing transfer time

6. Rate Limiting
#

Defend against DDoS attacks and protect servers

# Define zone
limit_req_zone $binary_remote_addr zone=mylimit:10m rate=10r/s;

server {
    location /api/ {
        limit_req zone=mylimit burst=20 nodelay;
        proxy_pass http://backend;
    }
}

Limit requests per second per IP
burst: Allow sudden traffic spikes
Essential for API server protection

Nginx vs Apache: Which Should You Choose?
#

Choose Nginx When
#

High concurrent connection handling is needed
Static file service is primary purpose
Reverse proxy/load balancer is needed
Resource efficiency is important
Modern protocol support needed (HTTP/2, HTTP/3)

Choose Apache When
#

.htaccess file-based configuration is needed
Various third-party modules are needed
Must use in Windows environment
Legacy application compatibility is important
Frequent dynamic module development

Optimal Combination: Nginx + Apache
#

Many companies use Nginx as frontend and Apache as backend:

[Client] → [Nginx] → [Apache] → [Application]
           Static    Dynamic
           SSL       PHP/Python
           Caching   Modules

Production Tips
#

1. Worker Connections Configuration
#

events {
    worker_connections 1024;  # Connections per Worker
    use epoll;  # Optimal event model for Linux
}

2. Keepalive Optimization
#

http {
    keepalive_timeout 65;
    keepalive_requests 100;
}

3. Buffer Size Tuning
#

http {
    client_body_buffer_size 16K;
    client_header_buffer_size 1k;
    client_max_body_size 8m;
    large_client_header_buffers 4 8k;
}

4. Log Optimization
#

http {
    access_log /var/log/nginx/access.log combined buffer=32k;
    error_log /var/log/nginx/error.log warn;
}

5. Security Hardening
#

# Hide version information
server_tokens off;

# Add security headers
add_header X-Frame-Options "SAMEORIGIN" always;
add_header X-Content-Type-Options "nosniff" always;
add_header X-XSS-Protection "1; mode=block" always;

Conclusion
#

Nginx has established itself as a core component of modern web infrastructure. With high performance and efficiency through Event-Driven architecture, it’s used by large-scale services like Netflix, Airbnb, and GitHub.

While Apache’s stability and extensibility remain valuable, in modern web environments where large-scale traffic handling and resource efficiency are crucial, Nginx is the more suitable choice.

Recommended Learning Path

Install Nginx in local environment and practice basic configuration
Set up reverse proxy
Configure and test load balancing
Apply SSL certificates (Let’s Encrypt)
Performance monitoring and optimization

References
#

Important Note: Nginx shows limited performance and compatibility on Windows environments, so it’s strongly recommended to use Linux/Unix systems in production!

Nginx - This article is part of a series.

Part 1: This Article

What is Nginx?#

Why Was Nginx Needed?#

The C10k Problem#

Apache’s Architectural Limitations#

1. Process-Based Processing#

2. High Resource Consumption#

3. Context-Switching Overhead#

The Birth of Nginx#

Nginx’s Core Goals#

Nginx’s Primary Roles#

Nginx Internal Architecture#

Master Process Responsibilities#

Worker Process Responsibilities#

1. Connection Management#

2. Non-blocking I/O#

3. Thread Pool#

4. CPU Core Optimization#

Event-Driven Architecture#

Nginx Advantages and Disadvantages#

Advantages#

1. High Concurrent Connection Capability#

2. Low Resource Usage#

3. Zero-Downtime Configuration Reload#

4. Superior Static File Handling#

Disadvantages#

1. Difficult Dynamic Module Development#

2. Windows Environment Limitations#

3. No .htaccess Support#

Key Nginx Features#

1. Reverse Proxy#

Key Benefits#

Practical Usage Patterns#

2. Load Balancing#

Load Balancing Algorithms#

Round Robin (Default)#

Least Connections#

IP Hash#

Weight#

Health Check#

3. SSL/TLS Termination#

Key Benefits#

HTTP/2 Support#

4. Caching#

Caching Strategies#

5. Compression (Gzip)#

6. Rate Limiting#

Nginx vs Apache: Which Should You Choose?#

Choose Nginx When#

Choose Apache When#

Optimal Combination: Nginx + Apache#

Production Tips#

1. Worker Connections Configuration#

2. Keepalive Optimization#

3. Buffer Size Tuning#

4. Log Optimization#

5. Security Hardening#

Conclusion#

References#

What is Nginx?
#

Why Was Nginx Needed?
#

The C10k Problem
#

Apache’s Architectural Limitations
#

1. Process-Based Processing
#

2. High Resource Consumption
#

3. Context-Switching Overhead
#

The Birth of Nginx
#

Nginx’s Core Goals
#

Nginx’s Primary Roles
#

Nginx Internal Architecture
#

Master Process Responsibilities
#

Worker Process Responsibilities
#

1. Connection Management
#

2. Non-blocking I/O
#

3. Thread Pool
#

4. CPU Core Optimization
#

Event-Driven Architecture
#

Nginx Advantages and Disadvantages
#

Advantages
#

1. High Concurrent Connection Capability
#

2. Low Resource Usage
#

3. Zero-Downtime Configuration Reload
#

4. Superior Static File Handling
#

Disadvantages
#

1. Difficult Dynamic Module Development
#

2. Windows Environment Limitations
#

3. No .htaccess Support
#

Key Nginx Features
#

1. Reverse Proxy
#

Key Benefits
#

Practical Usage Patterns
#

2. Load Balancing
#

Load Balancing Algorithms
#

Round Robin (Default)
#

Least Connections
#

IP Hash
#

Weight
#

Health Check
#

3. SSL/TLS Termination
#

Key Benefits
#

HTTP/2 Support
#

4. Caching
#

Caching Strategies
#

5. Compression (Gzip)
#

6. Rate Limiting
#

Nginx vs Apache: Which Should You Choose?
#

Choose Nginx When
#

Choose Apache When
#

Optimal Combination: Nginx + Apache
#

Production Tips
#

1. Worker Connections Configuration
#

2. Keepalive Optimization
#

3. Buffer Size Tuning
#

4. Log Optimization
#

5. Security Hardening
#

Conclusion
#

References
#