Incident Summary On June 5, 2025, a temporary issue was observed where users were unable to view the Series Listing Page across the platform.
The root cause was traced to a misconfiguration in the infrastructure setup of a new caching service (Amazon DAX) introduced to improve system performance. The configuration error blocked communication between application services and the DAX cluster, leading to service disruptions. The issue was promptly identified and resolved by rolling back the change.
Impact Area The following functionality was impacted during the incident:
Incident Timeline
Root Cause Analysis The issue was caused by a misconfiguration in the security settings of the newly introduced Amazon DAX cluster in the production environment. The security group associated with the cluster did not allow required inbound traffic from the application services. This caused service pods to fail and led to errors on the Series Listing Page.
Next Steps and Preventive Actions
We apologize for the inconvenience caused by this incident.