logstash+elasticsearch+kafka统一日志收集

logstash+elasticsearch+kafka统一日志收集

环境准备

以下相关软件可从(http://pan.baidu.com/s/1c1Dj41U)下载

  1. 安装jdk
  2. 安装elasticsearch
    从(https://www.elastic.co/downloads)下载最新版,解压后直接运行bin/elasticsearch.bat即可,打开elasticsearch.yml配置文件,设置network.host为外部可访问的ip,建议0.0.0.0
  3. 安装kibana
    从(https://www.elastic.co/downloads)下载最新版,解压后运行bin/kibana.bat启动程序
  4. 安装logstash
    从(https://www.elastic.co/downloads)下载最新版,解压到指定目录
  5. 安装kafka
    从(http://kafka.apache.org/downloads.html)下载最新的binary版本

参数配置

  1. 在logstash/config目录下新建文件log4j.conf,然后启动logstash服务(logstash.bat -f …/config/log4j.conf ),内容如下
input {
  log4j{
    mode => "server"
    type=>"log4j-json"
    port=>4712
  } 
}
filter {}
output {
   stdout { codec => rubydebug }
   elasticsearch { hosts => ["127.0.0.1"] }
}
  1. 在系统内log4j要配置SocketAppender输出,log4j的properties配置
log4j.rootLogger=info,logstash
# Socket,logstash
log4j.appender.logstash=org.apache.log4j.net.SocketAppender
log4j.appender.logstash.Port=4712
log4j.appender.logstash.RemoteHost=localhost
log4j.appender.logstash.ReconnectionDelay=60000
log4j.appender.logstash.LocationInfo=true
# 工程名
log4j.appender.logstash.application=stage

其他说明

  1. logstash的elasticsearch插件默认将信息存储在logstash-%{+YYYY.MM.dd}格式的索引下面,在第一次打开kibana时,会要求创建查询索引,直接创建默认的logstash-*即可
  2. elasticsearch最好部署成集群环境,以实现不间断收集日志
  3. 本示例直接使用log4j的SocketAppender输出,如果logstash的socket服务断掉,日志会消失;如果log4j输出到kafka,再从kafka输出到logstash会避免这个问题

通过kafka搜集日志

引入第三方包kafka-log4j-appender

<dependency>
    <groupId>org.apache.kafka</groupId>
    <artifactId>kafka-log4j-appender</artifactId>
    <version>0.10.0.0</version>
</dependency>

配置kafka appender

## appender kafka
log4j.appender.kafka=org.apache.kafka.log4jappender.KafkaLog4jAppender
log4j.appender.kafka.topic=logstash
log4j.appender.kafka.brokerList=brokerNode1:9091,brokerNode2:9092
log4j.appender.kafka.compressionType=none
log4j.appender.kafka.syncSend=true
log4j.appender.kafka.layout=org.apache.log4j.PatternLayout
log4j.appender.kafka.layout.ConversionPattern=%d{yyyy-MM-dd HH:mm:ss} %-5p %c{1}:%L %% - %m%n

logstash配置kafka输入

input {
kafka{
    bootstrap_servers => "192.168.1.202:9092,192.168.1.202:9093"
    topics=>["logstash"]
	type=>"log4j-json"
  }   
}
filter {
 grok {
    match => { "message" => "%{TIMESTAMP_ISO8601:logtime} \[%{DATA:javaThread}\] %{WORD:project} %{LOGLEVEL:loglevel} %{DATA:msg}" }

  }
}
output {
   stdout { codec => rubydebug }
   elasticsearch { hosts => ["127.0.0.1"] }
}

注意:

  1. kafka的topic必须配置一致;
  2. logstash的elasticsearch插件默认端口是9092,如果配置多个地址必须写成[“127.0.0.1:9200”,“127.0.0.2:9200”]
  3. logstash可以通过filter插件,利用正则表达式提取关键字符串作为单独字段存储
  4. 当前版本的KafkaLog4jAppender代码有bug,无法输出异常栈,需重写源码,修改subAppend方法
private String subAppend(LoggingEvent event) {
    StringBuffer msg = new StringBuffer();
    msg.append((this.layout == null) ? event.getRenderedMessage() : this.layout.format(event));

    if (layout.ignoresThrowable()) {
        String[] s = event.getThrowableStrRep();
        if (s != null) {
            int len = s.length;
            for (int i = 0; i < len; i++) {
                msg.append(s[i]);
                msg.append(Layout.LINE_SEP);
            }
        }
    }
    return msg.toString();
}

参考资料

猜你喜欢

转载自blog.csdn.net/wlddhj/article/details/83902356