Learning Apache Spark 2 | Megabooks CZ

Learning Apache Spark 2

AngličtinaEbook

Muhammad Asif Abbasi, Abbasi

Packt Publishing

EAN: 9781785889585

Dostupné online

836 Kč

Běžná cena: 929 Kč

Sleva 10 %

ks

Dostupné formáty

Ebook DRM PDF 836 Kč Ebook DRM EPUB 836 Kč

Podrobné informace

Learn about the fastest-growing open source project in the world, and find out how it revolutionizes big data analyticsAbout This BookExclusive guide that covers how to get up and running with fast data processing using Apache SparkExplore and exploit various possibilities with Apache Spark using real-world use cases in this bookWant to perform efficient data processing at real time? This book will be your one-stop solution.Who This Book Is ForThis guide appeals to big data engineers, analysts, architects, software engineers, even technical managers who need to perform efficient data processing on Hadoop at real time. Basic familiarity with Java or Scala will be helpful.The assumption is that readers will be from a mixed background, but would be typically people with background in engineering/data science with no prior Spark experience and want to understand how Spark can help them on their analytics journey.What You Will LearnGet an overview of big data analytics and its importance for organizations and data professionalsDelve into Spark to see how it is different from existing processing platformsUnderstand the intricacies of various file formats, and how to process them with Apache Spark.Realize how to deploy Spark with YARN, MESOS or a Stand-alone cluster manager.Learn the concepts of Spark SQL, SchemaRDD, Caching and working with Hive and Parquet file formatsUnderstand the architecture of Spark MLLib while discussing some of the off-the-shelf algorithms that come with Spark.Introduce yourself to the deployment and usage of SparkR.Walk through the importance of Graph computation and the graph processing systems available in the marketCheck the real world example of Spark by building a recommendation engine with Spark using ALS.Use a Telco data set, to predict customer churn using Random Forests.In DetailSpark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing Hadoop installation and configuring with Yarn and Mesos.The next part of the journey after installation is using key components, APIs, Clustering, machine learning APIs, data pipelines, parallel programming. It is important to understand why each framework component is key, how widely it is being used, its stability and pertinent use cases.Once we understand the individual components, we will take a couple of real life advanced analytics examples such as 'Building a Recommendation system', 'Predicting customer churn' and so on.The objective of these real life examples is to give the reader confidence of using Spark for real-world problems.Style and approachWith the help of practical examples and real-world use cases, this guide will take you from scratch to building efficient data applications using Apache Spark.You will learn all about this excellent data processing engine in a step-by-step manner, taking one aspect of it at a time.This highly practical guide will include how to work with data pipelines, dataframes, clustering, SparkSQL, parallel programming, and such insightful topics with the help of real-world use cases.

EAN 9781785889585

ISBN 1785889583

Typ produktu Ebook

Vydavatel Packt Publishing

Datum vydání 28. března 2017

Stránky 356

Jazyk English

Země Uruguay

Autoři Muhammad Asif Abbasi, Abbasi

25 718 130 knih a 4 738 326 e-knih právě v nabídce

Osobní odběr zdarma
na 9 místech po celé České republice

Doprava zdarma
při nákupu nad 1000 Kč

Knihy z celého světa

ISO 9001 Certification - Bureau Veritas

ISO 27001 Certification - Bureau Veritas

Asociace pro elektronickou komerci

CZECH Stability Award

Naše společnost

Visa

Visa Electron

Mastercard

Maestro

Endered Card

Sodexo Pluxee

© 2022 Megabooks CZ spol. s r.o. - zahraniční literatura. Výhradní distributor Oxford University Press

Všechny ceny jsou uvedeny s DPH

25 718 130 knih a 4 738 326 e-knih právě v nabídce

Osobní odběr zdarma
na 9 místech po celé České republice

Doprava zdarma
při nákupu nad 1000 Kč

Knihy z celého světa

Visa

Visa Electron

Mastercard

Maestro

Endered Card

Sodexo Pluxee

ISO 9001 Certification - Bureau Veritas

ISO 27001 Certification - Bureau Veritas

CZECH Stability Award

Asociace pro elektronickou komerci

© 2022 Megabooks CZ spol. s r.o. - zahraniční literatura. Výhradní distributor Oxford University Press

Všechny ceny jsou uvedeny s DPH