Impact: The series details page on the learning site was inaccessible for 22 minutes, from 23:24 PT to 23:46 PT on 28th November, 2023.
Why it happened:
A state mismatch occurred due to a serial deployment of two applications, leading to GraphQL queries failing.
Incident timeline (PT):
What we did to fix it:
Corrective steps: The state mismatch was resolved once the deployments were completed, and no further corrective action was necessary.
Preventive steps:
We defined the order of operations in deployment scripts to prevent a recurrence of this issue.