Learning PySpark (eBook, ePUB)

Sofort per Download lieferbar
26,95 €
inkl. MwSt.
Weitere Ausgaben:
PAYBACK Punkte
13 °P sammeln!
Apache Spark is an open source framework for efficient cluster computing with a strong interface for data parallelism and fault tolerance. This book will show you how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Spark 2.0 architecture and how to set up a Python environment for Spark. You will get familiar with the modules available in PySpark. You will learn how to abstract data with RDDs and DataFrames and understand the streaming capabilities of PySpark. Also, you will get a thorough overview of machine learn...