.Using.Flume.Flexible.Scalable.and.Reliable.Data.Streaming
文件大小:
3856k
资源说明:Title: Using Flume: Flexible, Scalable, and Reliable Data Streaming
Author: Hari Shreedharan
Length: 238 pages
Edition: 1
Language: English
Publisher: O'Reilly Media
Publication Date: 2014-10-02
ISBN-10: 1449368301
ISBN-13: 9781449368302
How can you get your data from frontend servers to Hadoop in near real time? With this complete reference guide, you’ll learn Flume’s rich set of features for collecting, aggregating, and writing large amounts of streaming data to the Hadoop Distributed File System (HDFS), Apache HBase, SolrCloud, Elastic Search, and other systems.
Using Flume shows operations engineers how to configure, deploy, and monitor a Flume cluster, and teaches developers how to write Flume plugins and custom components for their specific use-cases. You’ll learn about Flume’s design and implementation, as well as various features that make it highly scalable, flexible, and reliable. Code examples and exercises are available on GitHub.
Learn how Flume provides a steady rate of flow by acting as a buffer between data producers and consumers
Dive into key Flume components, including sources that accept data and sinks that write and deliver it
Write custom plugins to customize the way Flume receives, modifies, formats, and writes data
Explore APIs for sending data to Flume agents from your own applications
Plan and deploy Flume in a scalable and flexible way—and monitor your cluster once it’s running
Table of Contents
Chapter 1. Apache Hadoop and Apache HBase: An Introduction
Chapter 2. Streaming Data Using Apache Flume
Chapter 3. Sources
Chapter 4. Channels
Chapter 5. Sinks
Chapter 6. Interceptors, Channel Selectors, Sink Groups, and Sink Processors
Chapter 7. Getting Data into Flume*
Chapter 8. Planning, Deploying, and Monitoring Flume
本源码包内暂不包含可直接显示的源代码文件,请下载源码包。