dev #3365: [BE] API performance optimization - aiaxio - Redmine

Actions

Copy link

dev #3365

open

[BE] API performance optimization

Added by Zahid Hassan 6 months ago.

Status:

In Progress

Priority:

High

Assignee:

Zahid Hassan

Target version:

UAT

Start date:

09/30/2025

Due date:

% Done:

Estimated time:

Spent time:

7:00 h

Description

API Performance Optimization¶

Recommendation API¶

Description:

This issue is created to track the performance improvements made to the Recommendation API's response times after adjusting the PostgreSQL connection pool configuration.

Problem:

Before optimization, the response time for the Recommendation API under load was significantly high, with a p(95) latency of 1086.48 ms. After experimenting with various configurations, especially with the connection pool, a significant improvement was observed in the response times.

Optimization Strategy:

Parallel Query Execution using Promise.all():
- In the Recommendation Controller, both the main query (for fetching recommendations) and the count query (for total record count) are executed in parallel using Promise.all(). This ensures that both queries are running concurrently, which reduces the overall time spent waiting for the database to return results.
Setting max_parallel_workers_per_gather to 1:
- By setting max_parallel_workers_per_gather to 1, we limited the number of parallel workers that could be used for queries, reducing CPU contention and overhead from parallel execution, which helped improve overall response time and stability under load.
Pool Configuration was the key area of focus. Initially, we didn't use any pooling strategy. Each time a database call was made, a new connection was created. This caused increased latency on api response time.
Previous Pool Config (before optimization):
- No configuration
Optimized Pool Config:
- Max connections: 10
- Idle timeout: 30 seconds
- Connection timeout: 2 seconds

Performance Improvement (before and after optimization):

Metric	Before Optimization	After Optimization	Improvement (%)
HTTP Request Duration (avg)	439.55 ms	302.31 ms	31.2% improvement
HTTP Request Duration (p90)	979.86 ms	644.02 ms	34.2% improvement
HTTP Request Duration (p95)	1086.48 ms	795.9 ms	26.8% improvement
HTTP Request Waiting (avg)	439.13 ms	302.15 ms	31.2% improvement
HTTP Request Waiting (p90)	979.55 ms	643.88 ms	34.2% improvement
HTTP Request Waiting (p95)	1085.6 ms	795.38 ms	26.8% improvement
Total Requests	6290	6953	10.5% increase in requests

Detailed Explanation of Performance Gains:

Reduced Max Connections:
- Reducing the max connections in the pool to 10 helped in reducing contention over the database connections and allowed the application to process requests without waiting for unused connections to be freed.
Lower Latency:
- HTTP Request Duration saw a significant reduction, with a 31.2% improvement in average latency and 26.8% in p95 response time. This improvement was due to better database connection management, preventing connection saturation.
Improved Throughput:
- The Total Requests increased by 10.5%, indicating that the application could handle more requests concurrently due to the optimized pool size and better resource management.
Efficient Resource Utilization:
- By reducing the max connections, the backend resources (CPU, memory) were used more efficiently, leading to faster processing of requests and reduced load on the server.

Additional Observations:

Blocked Time (HTTP_Request_Blocked) remained very low, indicating that there was no significant blocking in processing requests, even as the load increased.
Connection Handling: With the optimized connection pool, the time spent on establishing connections (HTTP_Request_Connecting) was very minimal, ensuring smooth and efficient database communication.

Spent time