A well-designed metric monitoring and alerting system plays a key role in providing clear visibility into the health of the infrastructure to ensure high availability and reliability. The diagram below explains how it works at a high level. Metrics source: This can be application servers, SQL databases, message queues, etc.
basic monitoring system