These are the standards in which Banno is able to reliably assess performance.
- Server and service metrics are available at the Grafana dashboards on the links supplied in URLS. These are available 24/7 to all Banno associates and management.
- Alerting for those services and servers is done by the Prometheus monitoring system and critical alerts are sent to Pagerduty. Service alerts are routed to the development team’s Firefighter based on escalation and schedule. Server alerts and any other system alerts are routed to the current Infrastructure team Firefighter.
- Scale Meetings are held monthly. They are attended by Engineering managers and interested team leads. Upcoming launches and features, recent performance issues, and other scaling issues are discussed. Action items are then added to roadmaps.
- The Enterprise Incident Response process provides an immediate escalation path for critical issues and communication to stakeholders.