基于上一节搭建的 Spark 环境,这一节我们继续搭建 Flume 相关的环境。在开始之前我们先介绍一个什么是 Flume?引用官网对于 Flume 的阐述:
Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.