【发布时间】:2022-01-04 23:36:44
【问题描述】:
我正在尝试在 GKE 上部署 Prometheus 以监控现有的 Strimzi Kafka GKE 集群,但我遇到了一些问题。 (参考 - https://strimzi.io/docs/operators/latest/deploying.html#proc-metrics-deploying-prometheus-operator-str)
这是完成的:
- 创建了命名空间-监控,而Kafka部署在命名空间-kafka
- 修改了 kafka-deployment.yaml 以包含 metricsConfig 和 KafkaExporter 在文件https://github.com/strimzi/strimzi-kafka-operator/tree/0.26.0/examples/metrics/kafka-metrics.yaml 中指定
这里是变化:
apiVersion: kafka.strimzi.io/v1beta2
kind: Kafka
metadata:
name: prom-graf #1
spec:
kafka:
version: 3.0.0
........
metricsConfig:
type: jmxPrometheusExporter
valueFrom:
configMapKeyRef:
name: kafka-metrics
key: kafka-metrics-config.yml
zookeeper:
......
metricsConfig:
type: jmxPrometheusExporter
valueFrom:
configMapKeyRef:
name: kafka-metrics
key: zookeeper-metrics-config.yml
entityOperator: #11
topicOperator: {}
userOperator: {}
kafkaExporter:
topicRegex: ".*"
groupRegex: ".*"
将更改应用到 Kafka 集群的命令:
kubectl apply -f kafka-deployment.yaml -n kafka
- 创建了 Prometheous 运算符
command :
curl -s https://raw.githubusercontent.com/coreos/prometheus-operator/master/bundle.yaml > bundle.yaml
在 bundle.yaml 中将命名空间更改为 'monitoring' 并创建 Prometheous 运算符
kubectl create -f bundle.yaml -n monitoring
运行以下命令在 GKE 上创建 Prometheus 和 Grafana: 文件与 github 中的文件相同 - https://github.com/strimzi/strimzi-kafka-operator/tree/0.26.0/examples/metrics,尽管我对命名空间的更改感到很生气
kubectl apply -f strimzi-pod-monitor.yaml
(changed namespace to kafka, since kafka is deployed in namespace kafka)
kubectl apply -f prometheus-rules.yaml -n monitoring
kubectl apply -f prometheus.yaml -n monitoring
kubectl apply -f grafana.yaml - monitoring
- created secret from file - prometheus-additional.yaml
kubectl create secret generic additional-scrape-configs --from-file=prometheus-additional.yaml -n monitoring
当我检查在命名空间 - 监控上创建的对象时,列出的内容如下:
Karans-MacBook-Pro:prometheus-yamls karanalang$ kc get all,prometheus -n monitoring
NAME READY STATUS RESTARTS AGE
pod/grafana-5ccfb68647-55mzk 0/1 Running 0 21s
pod/prometheus-operator-85d84bb848-57lrn 1/1 Running 0 46m
NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
service/grafana ClusterIP 10.100.3.10 <none> 3000/TCP 21s
service/prometheus-operator ClusterIP None <none> 8080/TCP 46m
NAME READY UP-TO-DATE AVAILABLE AGE
deployment.apps/grafana 0/1 1 0 24s
deployment.apps/prometheus-operator 1/1 1 1 46m
NAME DESIRED CURRENT READY AGE
replicaset.apps/grafana-5ccfb68647 1 1 0 23s
replicaset.apps/prometheus-operator-85d84bb848 1 1 1 46m
NAME VERSION REPLICAS AGE
prometheus.monitoring.coreos.com/prometheus 3 44m
问题是 - 我如何获取 Prometheus 安装的主机名(因为我需要将其添加为 Grafana 中的源)? 不知何故,我没有看到 prometheus pod,不应该创建它吗?
任何想法如何调试/修复这个? 蒂亚!
【问题讨论】:
标签: kubernetes google-kubernetes-engine prometheus monitoring grafana