目录

What is Prometheus?

Features

What are metrics ?

Components

Architecture

When does it fit?

When does it not fit?


What is Prometheus?

Prometheus is an open-source systems monitoring and alerting toolkit originally built at SoundCloud. Since its inception in 2012, many companies and organizations have adopted Prometheus, and the project has a very active developer and user community. It is now a standalone open source project and maintained independently of any company. To emphasize this, and to clarify the project's governance structure, Prometheus joined the Cloud Native Computing Foundation in 2016 as the second hosted project, after Kubernetes.

Prometheus collects and stores its metrics as time series data, i.e. metrics information is stored with the timestamp at which it was recorded, alongside optional key-value pairs called labels.

For more elaborate overviews of Prometheus, see the resources linked from the media section.

Features

Prometheus's main features are:

  • a multi-dimensional data model with time series data identified by metric name and key/value pairs
  • PromQL, a flexible query language to leverage this dimensionality
  • no reliance on distributed storage; single server nodes are autonomous
  • time series collection happens via a pull model over HTTP
  • pushing time series is supported via an intermediary gateway
  • targets are discovered via service discovery or static configuration
  • multiple modes of graphing and dashboarding support

What are metrics ?

In layperson terms, metrics are numeric measurements, time series mean that changes are recorded over time. What users want to measure differs from application to application. For a web server it might be request times, for a database it might be number of active connections or number of active queries etc.

Metrics play an important role in understanding why your application is working in a certain way. Let's assume you are running a web application and find that the application is slow. You will need some information to find out what is happening with your application. For example the application can become slow when the number of requests are high. If you have the request count metric you can spot the reason and increase the number of servers to handle the load.

Components

The Prometheus ecosystem consists of multiple components, many of which are optional:

  • the main Prometheus server which scrapes and stores time series data
  • client libraries for instrumenting application code
  • a push gateway for supporting short-lived jobs
  • special-purpose exporters for services like HAProxy, StatsD, Graphite, etc.
  • an alertmanager to handle alerts
  • various support tools

Most Prometheus components are written in Go, making them easy to build and deploy as static binaries.

Architecture

This diagram illustrates the architecture of Prometheus and some of its ecosystem components:

Prometheus scrapes metrics from instrumented jobs, either directly or via an intermediary push gateway for short-lived jobs. It stores all scraped samples locally and runs rules over this data to either aggregate and record new time series from existing data or generate alerts. Grafana or other API consumers can be used to visualize the collected data.

When does it fit?

Prometheus works well for recording any purely numeric time series. It fits both machine-centric monitoring as well as monitoring of highly dynamic service-oriented architectures. In a world of microservices, its support for multi-dimensional data collection and querying is a particular strength.

Prometheus is designed for reliability, to be the system you go to during an outage to allow you to quickly diagnose problems. Each Prometheus server is standalone, not depending on network storage or other remote services. You can rely on it when other parts of your infrastructure are broken, and you do not need to setup extensive infrastructure to use it.

When does it not fit?

Prometheus values reliability. You can always view what statistics are available about your system, even under failure conditions. If you need 100% accuracy, such as for per-request billing, Prometheus is not a good choice as the collected data will likely not be detailed and complete enough. In such a case you would be best off using some other system to collect and analyze the data for billing, and Prometheus for the rest of your monitoring.

This documentation is open-source. Please help improve it by filing issues or pull requests.

Prometheus-普罗米修斯:高扩展性的监控和报警系统相关推荐

  1. Prometheus 普罗米修斯

    Prometheus 普罗米修斯 Prometheus由go语言开发,是一套开源的监控.报警.时间序列数据库的组合.适合监控docker容器.因为Kubernetes (K8S) 的流行带动了prom ...

  2. 一篇文章搞定Prometheus普罗米修斯监控系统的部署

    Prometheus普罗米修斯监控系统 一.普罗米修斯监控概述 1.1 什么是普罗米修斯监控 二.时间序列数据 2.1 什么是序列数据 2.2 时间序列数据特点 2.3 普罗米修斯特征 三.普罗米修斯 ...

  3. Kubernetes15——Prometheus普罗米修斯UI集群管理

    上传镜像到仓库 docker load -i prometheus-operator-v0.35.0.tar docker images | grep bitnami | awk '{system(& ...

  4. metrics类型 普罗米修斯_接近完美的监控系统—普罗米修斯

    普罗米修斯(Prometheus)是一个SoundCloud公司开源的监控系统.当年,由于SoundCloud公司生产了太多的服务,传统的监控已经无法满足监控需求,于是他们在2012年决定着手开发新的 ...

  5. 【监控】Prometheus(普罗米修斯)监控概述

    文章目录 一.监控系统概论 二.基础资源监控 2.1.网络监控 2.2.存储监控 2.3.服务器监控 2.4.中间件监控 2.5.应用程序监控(APM) 三.Prometheus 简介 3.1.什么是 ...

  6. Prometheus普罗米修斯调研笔记

    项目简介: Prometheus是最初在SoundCloud上构建的开源系统监控和警报工具: 开源自2012年,许多公司和组织都采用了Prometheus,都拥有非常活跃的开发人员和用户社区: 现在它 ...

  7. 系统监控——prometheus(普罗米修斯)的部署

    系统监控--prometheus的部署 一.prometheus简介 1.优点 2.组件 3.架构 二.环境包准备 三.安装运行 四.网页访问 五.安装图形化界面 一.prometheus简介 pro ...

  8. Prometheus普罗米修斯部署

    prometheus部署 1.获取Prometheus安装包并解压 [root@localhost /]# wget https://github.com/prometheus/prometheus/ ...

  9. prometheus普罗米修斯集监控外部k8s

    prometheus监控k8s集群 实现思路 pod性能:   使用cadvisor进行实现,监控容器的CPU.内存利用率 Node性能:   使用node-exporter实现,主要监控节点CPU. ...

  10. Prometheus普罗米修斯监控+Alertmanager预警+钉钉预警

    简单实操,可直接拷贝命令执行 前提条件: centos7  10.11.7.95 关闭selinux vi /etc/sysconfig/selinuxSELINUX=disabled setenfo ...

最新文章

  1. python3编译exe_编译 – 如何将我的Python 3应用程序编译到.exe?
  2. mimemultipart java_最佳实践 – 发送javamail mime multipart电子邮件和gmail
  3. 【.NET】SqlDateTime 溢出。必须介于 1/1/1753 12:00:00 AM 和 12/31/9999 11:59:59 PM之间
  4. Linux常用命令一 查看系统负载 版本信息 工作目录
  5. 测试人员与开发人员沟通的一些问题
  6. 计算机考试网站配置参数错误,计算机模拟考试系统_安装计算机考试模拟系统显示“数据库连接错误请联系管理员”是怎么回事...
  7. Excel VBA-批量导出图片.vba
  8. TeraTerm下载方法
  9. STM32f103系列各个型号芯片之间的程序移植
  10. ARM 开发板修改MAC地址
  11. 科普:商标图样的要求有哪些?
  12. 51单片机(十六)—— 定时器0和定时器1寄存器介绍及功能描述
  13. 【高并发】假如你网站一天有千万级访问量高并发,如何破解?
  14. wp兼容了android应用程序,WP比安卓流畅 但为什么就不好用呢?
  15. Python绘制卫星星下点轨迹图和卫星星座图
  16. 无线充电线圈绕制注意事项
  17. 「C#」异步编程玩法笔记-WinForm中的常见问题
  18. 在线Base64编码 = 图片
  19. 关于python的开发软件pycharm设置中文(无需汉化包)
  20. 某618大促项目的复盘总结

热门文章

  1. SpringBoot 项目构建 Docker 镜像调优实践
  2. Java:实现文件批量导入导出实践(兼容xls,xlsx)
  3. 面试必会之ArrayList源码分析手写ArrayList
  4. leetcode 476. 数字的补数(Number Complement)
  5. 看了一下lua的实现
  6. 原生JS替代jQuery的各种方法汇总
  7. DBUtils结果集处理
  8. B1295 [SCOI2009]最长距离 最短路
  9. 关于下载(前后端)第1部分
  10. SIP协议 会话发起协议(二)