【发布时间】:2016-05-14 10:44:25
【问题描述】:
我有一个 ELlastic 搜索服务器:
{
"cluster_name" : "elasticsearch",
"status" : "yellow",
"timed_out" : false,
"number_of_nodes" : 76,
"number_of_data_nodes" : 1,
"active_primary_shards" : 297,
"active_shards" : 297,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 297,
"delayed_unassigned_shards" : 0,
"number_of_pending_tasks" : 0,
"number_of_in_flight_fetch" : 0
}
它在具有 30gb Ram 的双核机器上运行,并且正在接收来自 3 到 4 个 logstash 服务器的日志,并且总共有 30 个输入(所有 log-stash 服务器相结合),但是对于大多数输入,日志都被遗漏了,我30-40 分钟内没有日志,我在 Logstash 服务器中收到警告,如:retrying-failed-action-with-response-code-429。此外,弹性搜索服务器的内存使用率非常高。日志文件的心跳非常高..我检查了 gork 模式,他们是正确的
这是我的 conf 文件之一:
input {
exec {
command => "/usr/bin/touch /var/run/logstash-monit/input.touch && /bin/echo OK."
interval => 60
type => "heartbeat"
}
file {
type => 'seller-forever'
path => '/var/log/seller/seller.log'
sincedb_path => "/opt/logstash/sincedb-access1"
}
}
filter {
grok {
type => "seller-forever"
match => [ "message", "%{GREEDYDATA:logline} %{GREEDYDATA:extra_fields}" ]
}
geoip {
add_tag => [ "GeoIP" ]
database => "/opt/logstash/GeoLiteCity.dat"
source => "clientip"
}
if [useragent] != "-" and [useragent] != "" {
useragent {
add_tag => [ "UA" ]
source => "useragent"
}
}
if [bytes] == 0 { mutate { remove => "[bytes]" } }
if [geoip][city_name] == "" { mutate { remove => "[geoip][city_name]" } }
if [geoip][continent_code] == "" { mutate { remove => "[geoip][continent_code]" } }
if [geoip][country_code2] == "" { mutate { remove => "[geoip][country_code2]" } }
if [geoip][country_code3] == "" { mutate { remove => "[geoip][country_code3]" } }
if [geoip][country_name] == "" { mutate { remove => "[geoip][country_name]" } }
if [geoip][latitude] == "" { mutate { remove => "[geoip][latitude]" } }
if [geoip][longitude] == "" { mutate { remove => "[geoip][longitude]" } }
if [geoip][postal_code] == "" { mutate { remove => "[geoip][postal_code]" } }
if [geoip][region_name] == "" { mutate { remove => "[geoip][region_name]" } }
if [geoip][time_zone] == "" { mutate { remove => "[geoip][time_zone]" } }
if [urlquery] == "" { mutate { remove => "urlquery" } }
if "apache_json" in [tags] {
if [method] =~ "(HEAD|OPTIONS)" { mutate { remove => "method" } }
if [useragent] == "-" { mutate { remove => "useragent" } }
if [referer] == "-" { mutate { remove => "referer" } }
}
if "UA" in [tags] {
if [device] == "Other" { mutate { remove => "device" } }
if [name] == "Other" { mutate { remove => "name" } }
if [os] == "Other" { mutate { remove => "os" } }
}
}
output {
stdout { codec => rubydebug }
elasticsearch {
type => "seller-forever"
index => "seller-forever"
host => "10.0.0.89"
protocol => "node"
}
}
我正在使用 kibana 进行可视化。 我应该如何解决这个问题我应该检查什么?任何帮助将不胜感激,我无法理解该怎么做。
【问题讨论】:
-
在 /var/log/logstash/ 中查看 logstash 的错误日志
-
@NishantSingh 在 logstash.log 中没有任何内容,这是在 logstash.err 中找到的内容 2016 年 5 月 14 日上午 10:28:35
org.elasticsearch.cluster.service.InternalClusterService$UpdateTask run INFO: [logstash-ip-10-0-0-105-8431-13990] added {[logstash-ip-10-0-0-105-6096-13970][M-skGUKgQXC-_Zt9kHma6w][ip-10-0-0-105][inet[/10.0.0.105:9303]]{client=true, data=false},}, reason: zen-disco-receive(from master [[Hammer Harrison][GPRbvuZ4RJW_Cq_iPW1i7A][ip-10-0-0-89][inet[/10.0.0.89:9300]]]) -
429 代码表示您的 Logstash 客户端发送的请求过多,节点无法处理。
标签: elasticsearch logstash kibana elastic-stack logstash-configuration