prometheus+grafana应用监控配置
创始人
2025-01-09 13:36:14
0

配置Prometheus

官方地址:Download | Prometheus

(wegt下载压缩包,解压并重命名prometheus,文件放于/data/prometheus即可)

配置 service方法(文件放于 /etc/systemd/system/prometheus.service):

[Unit]  Description=Prometheus Server  Documentation=https://prometheus.io/docs/introduction/overview/  After=network-online.target  [Service]  Type=simple  User=prometheus Group=prometheus  ExecStart=/data/prometheus/prometheus/prometheus \   --config.file=/data/prometheus/prometheus/prometheus.yml \   --storage.tsdb.path=/data/prometheus/prometheus/data \   --storage.tsdb.retention.time=60d \   --web.enable-lifecycle   Restart=on-failure  [Install]  WantedBy=multi-user.target 

授权给用户 prometheus

# 创建用户 -M 不创建源目录 -s 指定 shell useradd -M -s /usr/sbin/nologin prometheus  # 赋权 chown prometheus:prometheus -R /data/prometheus

启动运行并检查状态

# 启动 systemctl start prometheus.service  # 检查 systemctl status prometheus.service  # 开启自启动 systemctl enable prometheus.service  # 修改单元文件,重新加载systemd配置  sudo systemctl daemon-reload  # 检查日志方法 journalctl -u prometheus.service -f

重新加载 prometheus

# -X  指定 POST 请求 curl -X POST http://localhost:9090/-/reload  

访问web地址9090:

# ip:9090 访问 prometheus    http://xxx.xxx.xxx.xxx:9090   监控指标       http://xxx.xxx.xxx.xxx:9090/metrics

配置 alertmanager

 配置文件

[Unit]  Description=Alert Manager  Wants=network-online.target After=network-online.target  [Service]  Type=simple  User=prometheus Group=prometheus  ExecStart=/data/prometheus/alertmanager/alertmanager \   --config.file=/data/prometheus/alertmanager/alertmanager.yml \   --storage.path=/data/prometheus/prometheus/data \   Restart=always  [Install]  WantedBy=multi-user.target

访问:web端口9093

配置alert服务器告警提示:

root@pass:~# cat /data/prometheus/prometheus/alert.yml  groups: - name: Prometheus alert   rules:   - alert: 服务告警     expr: up == 0     for: 30s     labels:         severity: critical     annotations:         instance: "{{ $labels.instance }}"         description: "{{ $labels.job }} 服务器"

访问web检查:

检查yml文件配置方法promtool工具:

root@pass:~# cd /data/prometheus/prometheus/ root@pass:/data/prometheus/prometheus# ./promtool check config prometheus.yml Checking prometheus.yml   SUCCESS: 1 rule files found  SUCCESS: prometheus.yml is valid prometheus config file syntax  Checking alert.yml   SUCCESS: 1 rules found

配置grafana

下载地址:Download Grafana | Grafana Labs

配置service文件:

[Unit] Description=Grafana server  Documentation=http://docs.grafana.org   [Service] Type=simple User=prometheus Group=prometheus Restart=on-failure ExecStart= /data/prometheus/grafana/bin/grafana-server \ 	--config=/data/prometheus/grafana/conf/defaults.ini \ 	--homepath=/data/prometheus/grafana  [Install] WantedBy=multi-user.target 

访问web端口3000:(默认账号及密码admin/admin)


 

配置node_exporter

下载地址同Prometheus

配置service文件:

[Unit] Description=node_exporter  Documentation=http://prometheus.io/  After=network.target   [Service] User=prometheus Group=prometheus Restart=on-failure ExecStart= /data/prometheus/node_exporter/node_exporter  [Install] WantedBy=multi-user.target 

访问web端口9100

 最后注意记得配置时将需要的配置到Prometheus.yml中:

# my global config global:   scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute.   evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.   # scrape_timeout is set to the global default (10s).  # Alertmanager configuration alerting:   alertmanagers:     - static_configs:         - targets:             - localhost:9093            # - alertmanager:9093  # Load rules once and periodically evaluate them according to the global 'evaluation_interval'. rule_files:   - "alert.yml"   # - "first_rules.yml"   # - "second_rules.yml"  # A scrape configuration containing exactly one endpoint to scrape: # Here it's Prometheus itself. scrape_configs:   # The job name is added as a label `job=` to any timeseries scraped from this config.   - job_name: "prometheus"     # metrics_path defaults to '/metrics'     # scheme defaults to 'http'.      static_configs:       - targets: ["localhost:9090"]           - job_name: "node-exporter"     scrape_interval: 15s     # metrics_path defaults to '/metrics'     # scheme defaults to 'http'.      static_configs:       - targets: ["localhost:9090"]         labels:              instance: Prometheus服务器

web页面展示

手动创建仪表板:官方地址  Grafana dashboards | Grafana Labs

复制需要的ID ,然后导入到granafa图表中即可

第一步:

第二步:

第三步:

展示结果:

相关内容

热门资讯

【数据结构】手写堆 HEAP heap【堆】掌握手写上浮、下沉、建堆函数对一组数进行堆排序直接使用接口函数heapq什么是堆&#x...
C语言基础and数据结构 C语言程序和程序设计概述 程序:可以连续执行的一条条指令的集合开发过程:C源程序(.c文件) -->...
docker 常用命令 dockerimages 获取本地所有镜像docker pull xxx 拉取镜像docker rm...
视频使用操作说明书-T8000... 视频使用操作说明书-T80005系列视频编码器如何对接海康NVR硬盘录像机,包括T80...
【Go系列】 Go的错误处理 承上启下        上一篇文章中介绍了struct和interface,在Go语言...
数据结构(4.1)——树的性质 结点数=总度数+1结点的度——结点有几个孩子(分支)度为m的树、m叉树的区别 度为m...
Mac Dock栏多屏幕漂移固... 记录一下我目前的版本是 14.5多个屏幕,Dock栏切换的方式: 把鼠标...
初识Java 乐观学习,乐观生活,才能不断前进啊!!&#x...
Java面试题--JVM大厂篇... 目录引言:正文:一、Serial GC概述二、Serial GC的优点三、Serial...
可灵重大升级!新增Web端上线... 快手视频生成大模型“可灵”(Kling),作为全球首个真正...