Covered in this article
Related articles

Monitoring and Alerting

Guides and principles used to monitor the platform and send alerts when necessary.

We use Prometheus for collecting metrics, Alertmanager for alerting and Grafana to monitor and diagnose issues.

Table of contents

  1. Data Collection Document guides through methods used to collect data from platform services.
  2. Data Rendering Document provides an overview of data rendering principles and suggests dashboards to use.
  3. Alerting Rules Document provides suggestions on alerting rules to implement.