After extensive consultation with relevant stakeholders, our DevOps team decided to employ a comprehensive health analytics dashboard. This platform consists of several microservices developed by our engineers that can gather metrics, review Splunk logs, and produce analytics on application environment user access.
Our engineers developed a set of custom Prometheus exporters to adapt the health analytics dashboard to the organization’s systems. These exporters monitor application health and supply the information to Prometheus. Our team also developed a loader and other microservices that grabs data from Prometheus and pushes it to a PostgreSQL database.
We then implemented analytics that offers product tracking that the organization's internal teams require in each environment. The health analytics dashboard was integrated with Splunk and a custom alert script, allowing it to push accurate and timely notifications to the teams via email, Jira, Slack, and PagerDuty.
Key features of the solution include:
- Proven health dashboard system: Our team integrated a well-proven set of microservices that form the health dashboard. This approach saved the company time and money as our system has already been extensively tested.
- Quality analytics data: The new dashboard eliminates duplication and filters out bad data, resulting in high-quality analytics. This significantly reduced the number of false positives and enabled company engineers to respond to real issues quickly.
- Automation and scalability: The new system can automatically add newly generated URLs to the Jenkins job without manual intervention. This means the system easily scales to monitor numerous URLs and applications.