Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale, 1st edition

Published by Pearson (December 8, 2016) © 2017

  • Ofer Mendelevitch
  • Casey Stella
  • Douglas Eadline
Products list
Products list

This product is expected to ship within 5-7 business days for Australian customers.

This book provides a unique perspective on applying data science with Hadoop by explaining what data science with Hadoop is all about, its practical business applications, and then diving deep into the details and providing a hands-on tutorial and showcase of various use-cases from the real world. The authors bring together all the practical knowledge students will need to do real, useful data science with Hadoop.

The full text downloaded to your computer

With eBooks you can:

  • search for key concepts, words and phrases
  • make highlights and notes as you study
  • share your notes with friends

eBooks are downloaded to your computer and accessible either offline through the Bookshelf (available as a free download), available online and also via the iPad and Android apps.

Upon purchase, you'll gain instant access to this eBook.

Time limit

The eBooks products do not have an expiry date. You will continue to access your digital ebook products whilst you have your Bookshelf installed.

  • Part I: Data Science with Hadoop—An Overview
  • Chapter 1: Introduction to Data Science
  • Chapter 2: Use Cases for Data Science
  • Chapter 3: Hadoop and Data Science
  • Part II: Preparing and Visualizing Data with Hadoop
  • Chapter 4: Getting Data into Hadoop
  • Chapter 5: Data Munging with Hadoop
  • Chapter 6: Exploring and Visualizing Data
  • Part III: Applying Data Modeling with Hadoop
  • Chapter 7: Machine Learning with Hadoop
  • Chapter 8: Predictive Modeling
  • Chapter 9: Clustering
  • Chapter 10: Anomaly Detection with Hadoop
  • Chapter 11: Natural Language Processing
  • Chapter 12: Data Science with Hadoop—The Next Frontier
  • Appendix A: Book Web Page and Code Download
  • Appendix B: HDFS Quick Start
  • Appendix C: Additional Background on Data Science and Apache Hadoop and Spark

Need help? Get in touch