Practical Hadoop Ecosystem

Practical Hadoop Ecosystem

EnglishEbook
Vohra, Deepak
APress
EAN: 9781484221990
Available online
CZK 1,539
Common price CZK 1,710
Discount 10%
pc

Detailed information

Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.What You Will Learn:Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5Run a MapReduce jobStore data with Apache Hive, and Apache HBaseIndex data in HDFS with Apache SolrDevelop a Kafka messaging systemStream Logs to HDFS with Apache FlumeTransfer data from MySQL database to Hive, HDFS, and HBase with SqoopCreate a Hive table over Apache SolrDevelop a Mahout User Recommender SystemWho This Book Is For:Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
EAN 9781484221990
ISBN 1484221990
Binding Ebook
Publisher APress
Publication date September 30, 2016
Language English
Country United States
Authors Vohra, Deepak