Prometheus
Prometheus is an open-source monitoring and alerting toolkit designed for reliability and scalability. It was originally developed at SoundCloud and later open-sourced as part of the Cloud Native Computing Foundation (CNCF). Prometheus is widely used in the field of DevOps and cloud-native computing to monitor systems and applications.
This key competency area includes an understanding of the Prometheus architecture, metrics, and monitoring and alerting.
Key Competencies:
-
Metrics and Monitoring - Understanding the basics of monitoring and metrics. Knowledge of what metrics are, how they are collected, and their importance in maintaining system health.
-
Prometheus Architecture - Knowledge of the architecture of Prometheus, including how it collects, stores, and queries time-series data.
-
Alerting - Ability to create alerts based on metric thresholds and conditions.