如何使用Logstash将Mysql数据迁移到elasticsearch

时间:2019-04-26 06:34:59

标签: mysql elasticsearch logstash kibana

我需要简要说明如何使用Logstash将MySQL数据转换为Elastic Search。 谁能解释这个的逐步过程

3 个答案:

答案 0 :(得分:1)

您可以使用jdbc input plugin进行日志记录。

Here是一个配置示例。

答案 1 :(得分:0)

这是一个广泛的问题,我不知道您对MySQLES有多少了解。假设您有一个表格user。您只需将其转储为csv并加载到ES即可。但是如果您拥有动态数据(例如MySQL就像管道一样),则需要编写Script来完成这些工作。无论如何,您可以先询问以下链接以建立基础知识,然后再询问如何做。

How to dump mysql?

How to load data to ES

此外,由于您可能想知道如何将CSV转换为json文件,这是ES可以理解的最佳套件。

How to covert CSV to JSON

答案 2 :(得分:0)

让我为您提供高级指导。

  • 安装Logstash和Elasticsearch。
  • 在Logstash bin文件夹中复制jar ojdbc7.jar。
  • 对于logstash,请创建一个配置文件,例如:config.yml
# 
input {
    # Get the data from database, configure fields to get data incrementally
    jdbc {
        jdbc_driver_library => "./ojdbc7.jar"
        jdbc_driver_class => "Java::oracle.jdbc.driver.OracleDriver"
        jdbc_connection_string => "jdbc:oracle:thin:@db:1521:instance"
        jdbc_user => "user"
        jdbc_password => "pwd"

        id => "some_id"

        jdbc_validate_connection => true
        jdbc_validation_timeout => 1800
        connection_retry_attempts => 10
        connection_retry_attempts_wait_time => 10

        #fetch the db logs using logid
        statement => "select * from customer.table where logid > :sql_last_value order by logid asc"

        #limit how many results are pre-fetched at a time from the cursor into the client’s cache before retrieving more results from the result-set
        jdbc_fetch_size => 500
        jdbc_default_timezone => "America/New_York"

        use_column_value => true
        tracking_column => "logid"
        tracking_column_type => "numeric"
        record_last_run => true

        schedule => "*/2 * * * *"

        type => "log.customer.table"
        add_field => {"source" => "customer.table"}
        add_field => {"tags" => "customer.table" } 
        add_field => {"logLevel" => "ERROR" }

        last_run_metadata_path => "last_run_metadata_path_table.txt"
    }

}

# Massage the data to store in index
filter {
    if [type] == 'log.customer.table' {
        #assign values from db column to custom fields of index
        ruby{
            code => "event.set( 'errorid', event.get('ssoerrorid') );
                    event.set( 'msg', event.get('errormessage') );
                    event.set( 'logTimeStamp', event.get('date_created'));
                    event.set( '@timestamp', event.get('date_created'));
                    "
        }
        #remove the db columns that were mapped to custom fields of index
        mutate {
            remove_field => ["ssoerrorid","errormessage","date_created" ]
        }
    }#end of [type] == 'log.customer.table' 
} #end of filter

# Insert into index
output {
    if [type] == 'log.customer.table' {
        amazon_es {
            hosts => ["vpc-xxx-es-yyyyyyyyyyyy.us-east-1.es.amazonaws.com"]
            region => "us-east-1"
            aws_access_key_id => '<access key>'
            aws_secret_access_key => '<secret password>'
            index => "production-logs-table-%{+YYYY.MM.dd}"
        }
    }
}
  • 转到bin,运行方式 logstash -f config.yml