Users were unable to log into the SimpliGov production portal because of communication issues between User Management and Authentication services
2. Root Cause Analysis:
Preliminary Findings:
The User Management Service was unable to communicate with the authentication service, causing it to restart and reprovision, preventing users from logging in or accessing authenticated forms. Error messages were presented to users as they attempted to login and 500 status code errors were shown.
SimpliGov internal employees received alerts and began addressing the underlying cause of the communication issues.
3. Mitigation
Additional resources were allocated to User Management services and Authentication services and the service cluster was reprovisioned. SimpliGov’s QA team tested the system and monitored logs to ensure that performance returned to expected levels.
4. Ongoing Preventive Measures
Service resources were evaluated and updated for the current issue and for the long-term.