Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS, 1st edition

Published by Addison-Wesley Professional (November 29, 2016) © 2017

  • Sam R. Alapati
Products list
  • Available for purchase from all major ebook resellers, including InformIT.com
Products list

Details

  • A print text
  • Free shipping
  • Also available for purchase as an ebook from all major ebook resellers, including InformIT.com

This product is expected to ship within 3-6 business days for US and 5-10 business days for Canadian customers.

In Expert Hadoop® Administration, leading Hadoop administrator Sam R. Alapati brings together authoritative knowledge for creating, configuring, securing, managing, and optimizing production Hadoop clusters in any environment. Drawing on his experience with large-scale Hadoop administration, Alapati integrates action-oriented advice with carefully researched explanations of both problems and solutions. He covers an unmatched range of topics and offers an unparalleled collection of realistic examples.


Alapati demystifies complex Hadoop environments, helping readers understand exactly what happens behind the scenes when they administer their cluster. Students will gain unprecedented insight as they walk through building clusters from scratch and configuring high availability, performance, security, encryption, and other key attributes.

  • Part I: Introduction to Hadoop—Architecture and Hadoop Clusters
  • Chapter 1: Introduction to Hadoop and Its Environment
  • Chapter 2: An Introduction to the Architecture of Hadoop
  • Chapter 3: Creating and Configuring a Simple Hadoop Cluster
  • Chapter 4: Planning for and Creating a Fully Distributed Cluster
  • Part II: Hadoop Application Frameworks
  • Chapter 5: Running Applications in a Cluster—The MapReduce Framework (and Hive and Pig)
  • Chapter 6: Running Applications in a Cluster—The Spark Framework
  • Chapter 7: Running Spark Applications
  • Part III: Managing and Protecting Hadoop Data and High Availability
  • Chapter 8: The Role of the NameNode and How HDFS Works
  • Chapter 9: HDFS Commands, HDFS Permissions and HDFS Storage
  • Chapter 10: Data Protection, File Formats and Accessing HDFS
  • Chapter 11: NameNode Operations, High Availability and Federation
  • Part IV: Moving Data, Allocating Resources, Scheduling Jobs and Security
  • Chapter 12: Moving Data Into and Out of Hadoop
  • Chapter 13: Resource Allocation in a Hadoop Cluster
  • Chapter 14: Working with Oozie to Manage Job Workflows
  • Chapter 15: Securing Hadoop
  • Part V: Monitoring, Optimization and Troubleshooting
  • Chapter 16: Managing Jobs, Using Hue and Performing Routine Tasks
  • Chapter 17: Monitoring, Metrics and Hadoop Logging
  • Chapter 18: Tuning the Cluster Resources, Optimizing MapReduce Jobs and Benchmarking
  • Chapter 19: Configuring and Tuning Apache Spark on YARN
  • Chapter 20: Optimizing Spark Applications
  • Chapter 21: Troubleshooting Hadoop—A Sampler
  • Chapter 22: Installing VirtualBox and Linux and Cloning the Virtual Machines

Need help? Get in touch