Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS, 1st edition

Sam R. Alapati

Products list

VitalSource eTextbook

Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS
ISBN-13: 9780134703381 Published 2016 | Published 2016

$62.95

$62.95 AUD

Instant access

Title overview

In Expert Hadoop® Administration, leading Hadoop administrator Sam R. Alapati brings together authoritative knowledge for creating, configuring, securing, managing, and optimising production Hadoop clusters in any environment. Drawing on his experience with large-scale Hadoop administration, Alapati integrates action-oriented advice with carefully researched explanations of both problems and solutions. He covers an unmatched range of topics and offers an unparalleled collection of realistic examples.

Alapati demystifies complex Hadoop environments, helping readers understand exactly what happens behind the scenes when they administer their cluster. Students will gain unprecedented insight as they walk through building clusters from scratch and configuring high availability, performance, security, encryption, and other key attributes.

The full text downloaded to your computer

With eBooks you can:

search for key concepts, words and phrases
make highlights and notes as you study
share your notes with friends

eBooks are downloaded to your computer and accessible either offline through the Bookshelf (available as a free download), available online and also via the iPad and Android apps.

Upon purchase, you'll gain instant access to this eBook.

Time limit

The eBooks products do not have an expiry date. You will continue to access your digital ebook products whilst you have your Bookshelf installed.

Part I: Introduction to Hadoop—Architecture and Hadoop Clusters
Chapter 1: Introduction to Hadoop and Its Environment
Chapter 2: An Introduction to the Architecture of Hadoop
Chapter 3: Creating and Configuring a Simple Hadoop Cluster
Chapter 4: Planning for and Creating a Fully Distributed Cluster
Part II: Hadoop Application Frameworks
Chapter 5: Running Applications in a Cluster—The MapReduce Framework (and Hive and Pig)
Chapter 6: Running Applications in a Cluster—The Spark Framework
Chapter 7: Running Spark Applications
Part III: Managing and Protecting Hadoop Data and High Availability
Chapter 8: The Role of the NameNode and How HDFS Works
Chapter 9: HDFS Commands, HDFS Permissions and HDFS Storage
Chapter 10: Data Protection, File Formats and Accessing HDFS
Chapter 11: NameNode Operations, High Availability and Federation
Part IV: Moving Data, Allocating Resources, Scheduling Jobs and Security
Chapter 12: Moving Data Into and Out of Hadoop
Chapter 13: Resource Allocation in a Hadoop Cluster
Chapter 14: Working with Oozie to Manage Job Workflows
Chapter 15: Securing Hadoop
Part V: Monitoring, Optimization and Troubleshooting
Chapter 16: Managing Jobs, Using Hue and Performing Routine Tasks
Chapter 17: Monitoring, Metrics and Hadoop Logging
Chapter 18: Tuning the Cluster Resources, Optimizing MapReduce Jobs and Benchmarking
Chapter 19: Configuring and Tuning Apache Spark on YARN
Chapter 20: Optimizing Spark Applications
Chapter 21: Troubleshooting Hadoop—A Sampler
Chapter 22: Installing VirtualBox and Linux and Cloning the Virtual Machines

Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS, 1st edition

Title overview

Table of contents

Need help?Get in touch

Shop

Digital learning

Subject catalogue

Get support

Support

Digital products

Resources by discipline

Get support

Support

Support

Subject catalogue

Shop for university

Digital learning

Get support

Resources by discipline

Digital learning platforms

Get support

Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS, 1st edition

Title overview

Table of contents

Need help?Get in touch

Shopping

Explore

Help & support

Pearson