一、简介
1、Consul简介
-
Consul 是基于 GO 语言开发的开源工具,主要面向分布式,服务化的系统提供服务注册、服务发现和配置管理的功能。Consul 提供服务注册/发现、健康检查、Key/Value存储、多数据中心和分布式一致性保证等功能。
-
在没有使用 consul 服务自动发现的时候,我们需要频繁对 Prometheus 配置文件进行修改,无疑给运维人员带来很大的负担。引入consul之后,只需要在consul中维护监控组件配置,prometheus就能够动态发现配置
2、实验环境
IP | 操作系统 | 安装服务 |
---|---|---|
172.18.200.52 | ubuntu 22.04.1 | Docker、Prometheus、Grafana、Consul |
172.18.200.53 | ubuntu 22.04.1 | node-exporter |
二、安装Consul
1、配置docker-compose.yml
# cat docker-compose.yml
version : '3'
services:
consul:
image: consul:1.15
container_name: consul
hostname: consul
volumes:
- ./consul/config:/consul/config
- ./consul/data:/consul/data/
ports:
- 8500:8500
command: ["consul","agent","-dev","-bootstrap","-config-dir","/consul/config","-data-dir","/consul/data","-ui","-log-level","INFO","-bind","127.0.0.1","-client","0.0.0.0"]
2、启动consul
# docker-compose up -d
3、浏览器访问
http://172.18.200.52:8500
三、配置Ansible
1、安装
# apt-get install ansible
2、修改配置
# cat /etc/ansible/ansible.cfg
[defaults]
#host_key_checking = False
#error_on_undefined_vars = True
#timeout = 60
#inventory = inventory.tmp
#roles_path = /conjurinc
#remote_tmp = /tmp
host_key_checking = False
log_path = /var/log/ansible.log
四、ansible-playbook编写
1、查看目录结构
# tree ./
# tree ./
./
├── inventory
│ └── hosts
├── node_exporter_roles.yml
└── roles
├── node-exporter
│ ├── files
│ │ └── node_exporter-1.6.1.linux-amd64.tar.gz
│ ├── handlers
│ │ └── main.yml
│ ├── tasks
│ │ └── main.yml
│ └── templates
│ └── node_exporter.service.j2
└── register
├── files
│ └── consul_register.sh
└── tasks
├── main.yml
└── register.yml
2、配置hosts
# cat inventory/hosts
[linux]
172.18.200.53 name=linux-172.18.200.53
[linux:vars]
consul_ip=172.18.200.52
consul_port=8500
node_exporter_port=9100
3、配置node_exporter_roles.yml
# cat node_exporter_roles.yml
- hosts: linux
gather_facts: no
roles:
- role: node-exporte
4、配置roles/node-exporter
(1)下载exporter
下载地址:https://github.com/prometheus/node_exporter/releases/tag/v1.6.1
(2)配置handlers
# cat roles/node-exporter/handlers/main.yml
- name: restart node exporter service
systemd:
name: node_exporter
state: restarted
daemon-reload: yes
- include: roles/register/tasks/register.ymlr
(3)配置tasks
# cat roles/node-exporter/tasks/main.yml
- name: push node_exporter
unarchive:
src: node_exporter-1.6.1.linux-amd64.tar.gz
dest: /usr/local
- name: rename
shell: |
cd /usr/local
if [ ! -d node_exporter ]
then mv node_exporter-1.6.1.linux-amd64 node_exporter
fi
- name: copy node_exporter systemd
template:
src: node_exporter.service.j2
dest: /usr/lib/systemd/system/node_exporter.service
notify: restart node exporter service
- name: start node_exporter
systemd:
name: node_exporter
state: started
enabled: yes
daemon-reload: yes
- include: roles/register/tasks/main.yml
(4)配置templates
node_exporter_port:端口可以进行配置
# cat roles/node-exporter/templates/node_exporter.service.j2
[Unit]
Description=node_exporter
[Service]
ExecStart=/usr/local/node_exporter/node_exporter --web.listen-address=:{{ node_exporter_port }}
ExecReload=/bin/kill -HUP $MAINPID
KillMode=process
Restart=on-failure
[Install]
WantedBy=multi-user.target
5、配置roles/register
(1)配置files
# cat roles/register/files/consul_register.sh
#!/bin/bash
instance_id=$1
service_name=$2
ip=$3
port=$4
consul_ip=$5
consul_port=$6
curl -X PUT -d '{"id": "'"$instance_id"'","name": "'"$service_name"'","address": "'"$ip"'","port": '"$port"',"tags": ["'"$service_name"'"],"checks": [{"http": "http://'"$ip"':'"$port"'","interval": "5s"}]}' http://$consul_ip:$consul_port/v1/agent/service/register
(2)配置tasks
name:ansible hosts的name
group_names[0]:组名,如果属于children,那么就是group_names[1]
inventory_hostname:ansible hosts的ip
node_exporter_port:node_exporter的端口,默认9100
consul_ip:consule服务的ip
consul_port:consule服务的端口
# cat roles/register/tasks/main.yml
- name: push consul_register.sh
copy:
src: roles/register/files/consul_register.sh
dest: /usr/local/bin
- include: roles/register/tasks/register.yml
# cat roles/register/tasks/register.yml
- name: register nodes into consul
shell: /bin/bash /usr/local/bin/consul_register.sh {{ name }} {{ group_names[0] }} {{ inventory_hostname }} {{ node_exporter_port }} {{ consul_ip }} {{ consul_port }}
五、修改Prometheus配置
1、配置prometheus.yml
services中的linux:ansible hosts文件中的group名字
这里的services为列表,所有可以添加多个不同组的服务器进来,也实现了分组
# cat prometheus/conf/prometheus.yml
...
scrape_configs:
# The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
- job_name: "linux"
consul_sd_configs:
- server: 172.18.200.52:8500
services: ['linux']
2、重启
# docker restart prometheus
六、执行并添加Grafana
1、执行ansible-playbook命令
# ansible-playbook -i inventory/hosts node_exporter_roles.yml
2、查看Consul
3、添加Grafana
模板id:9276