Data Analytics with Spark Using Python, 1st edition

Published by Addison-Wesley Professional (June 6, 2018) © 2018

  • Jeffrey Aven
Products list

Details

  • A print text
  • Free shipping

Title overview

Spark is at the heart of today's Big Data revolution, helping data professionals supercharge efficiency and performance in a wide range of data processing and analytics tasks. In this guide, Big Data expert Jeffrey Aven covers all you need to know to leverage Spark, together with its extensions, subprojects, and wider ecosystem.

Aven combines a language-agnostic introduction to foundational Spark concepts with extensive programming examples utilising the popular and intuitive PySpark development environment. This guide's focus on Python makes it widely accessible to large audiences of data professionals, analysts, and developers - even those with little Hadoop or Spark experience.

Aven's broad coverage ranges from basic to advanced Spark programming, and Spark SQL to machine learning. You'll learn how to efficiently manage all forms of data with Spark: streaming, structured, semi-structured, and unstructured. Throughout, concise topic overviews quickly get you up to speed, and extensive hands-on exercises prepare you to solve real problems.

Samples

Preview sample pages from Data Analytics with Spark Using Python >

Table of contents

  • Introduction
  • Chapter 1 Introducing Big Data, Hadoop, and Spark
  • Chapter 2 Deploying Spark
  • Chapter 3 Understanding the Spark Cluster Architecture
  • Chapter 4 Learning Spark Programming Basics
  • Chapter 5 Advanced Programming Using the Spark Core API
  • Chapter 6 SQL and NoSQL Programming with Spark
  • Chapter 7 Stream Processing and Messaging Using Spark
  • Chapter 8 Introduction to Data Science and Machine Learning Using Spark

Need help?Get in touch