ORPP logo
Image from Google Jackets

Professional Hadoop.

By: Contributor(s): Material type: TextTextPublisher: Newark : John Wiley & Sons, Incorporated, 2016Copyright date: ©2016Edition: 1st edDescription: 1 online resource (219 pages)Content type:
  • text
Media type:
  • computer
Carrier type:
  • online resource
ISBN:
  • 9781119267188
Subject(s): Genre/Form: Additional physical formats: Print version:: Professional HadoopDDC classification:
  • 005.74
LOC classification:
  • QA76.9.D5 -- .P764 2016eb
Online resources:
Contents:
Cover -- Title Page -- Copyright -- Contents -- Introduction -- Chapter 1: Hadoop Introduction -- Business Analytics and Big Data -- The Components of Hadoop -- The Distributed File System (HDFS) -- What Is MapReduce? -- What Is YARN? -- What Is ZooKeeper? -- What Is Hive? -- Integration with Other Systems -- The Hadoop Ecosystem -- Data Integration and Hadoop -- Summary -- Chapter 2: Storage -- Basics of Hadoop HDFS -- Concept -- Architecture -- Interface -- Setting Up the HDFS Cluster in Distributed Mode -- Install -- Advanced Features of HDFS -- Snapshots -- Offline Viewer -- Tiered Storage -- Erasure Coding -- File Format -- Cloud Storage -- Summary -- Chapter 3: Computation -- Basics of Hadoop MapReduce -- Concept -- Architecture -- How to Launch a MapReduce Job -- Writing a Map Task -- Writing a Reduce Task -- Writing a MapReduce Job -- Configurations -- Advanced Features of MapReduce -- Distributed Cache -- Counter -- Job History Server -- The Difference from a Spark Job -- Summary -- Chapter 4: User Experience -- Apache Hive -- Hive Installation -- HiveQL -- UDF/SerDe -- Hive Tuning -- Apache Pig -- Pig Installation -- Pig Latin -- UDF -- Hue -- Features -- Apache Oozie -- Oozie Installation -- How Oozie Works -- Workflow/Coordinator -- Oozie CLI -- Summary -- Chapter 5: Integration with Other Systems -- Apache Sqoop -- How It Works -- Apache Flume -- How It works -- Apache Kafka -- How It Works -- Kafka Connect -- Stream Processing -- Apache Storm -- How It Works -- Trident -- Kafka Integration -- Summary -- Chapter 6: Hadoop Security -- Securing the Hadoop Cluster -- Perimeter Security -- Authentication Using Kerberos -- Service Level Authorization in Hadoop -- Impersonation -- Securing the HTTP Channel -- Securing Data -- Data Classification -- Bringing Data to the Cluster -- Protecting Data in the Cluster -- Securing Applications.
YARN Architecture -- Application Submission in YARN -- Summary -- Chapter 7: Ecosystem at Large: Hadoop with Apache Bigtop -- Basics Concepts -- Software Stacks -- Test Stacks -- Works on My Laptop -- Developing a Custom-Tailored Stack -- Apache Bigtop: The History -- Apache Bigtop: The Concept and Philosophy -- The Structure of the Project -- Meet the Build System -- Toolchain and Development Environment -- BOM Definition -- Deployment -- Bigtop Provisioner -- Master-less Puppet Deployment of a Cluster -- Configuration Management with Puppet -- Integration Validation -- iTests and Validation Applications -- Stack Integration Test Development -- Validating the Stack -- Cluster Failure Tests -- Smoke the Stack -- Putting It All Together -- Summary -- Chapter 8: In-Memory Computing in Hadoop Stack -- Introduction to In-Memory Computing -- Apache Ignite: Memory First -- System Architecture of Apache Ignite -- Data Grid -- A Discourse on High Availability -- Compute Grid -- Service Grid -- Memory Management -- Persistence Store -- Legacy Hadoop Acceleration with Ignite -- Benefits of In-Memory Storage -- Memory Filesystem: HDFS Caching -- In-Memory MapReduce -- Advanced Use of Apache Ignite -- Spark and Ignite -- Sharing the State -- In-Memory SQL on Hadoop -- SQL with Ignite -- Streaming with Apache Ignite -- Summary -- Glossary -- Index -- EULA.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
No physical items for this record

Cover -- Title Page -- Copyright -- Contents -- Introduction -- Chapter 1: Hadoop Introduction -- Business Analytics and Big Data -- The Components of Hadoop -- The Distributed File System (HDFS) -- What Is MapReduce? -- What Is YARN? -- What Is ZooKeeper? -- What Is Hive? -- Integration with Other Systems -- The Hadoop Ecosystem -- Data Integration and Hadoop -- Summary -- Chapter 2: Storage -- Basics of Hadoop HDFS -- Concept -- Architecture -- Interface -- Setting Up the HDFS Cluster in Distributed Mode -- Install -- Advanced Features of HDFS -- Snapshots -- Offline Viewer -- Tiered Storage -- Erasure Coding -- File Format -- Cloud Storage -- Summary -- Chapter 3: Computation -- Basics of Hadoop MapReduce -- Concept -- Architecture -- How to Launch a MapReduce Job -- Writing a Map Task -- Writing a Reduce Task -- Writing a MapReduce Job -- Configurations -- Advanced Features of MapReduce -- Distributed Cache -- Counter -- Job History Server -- The Difference from a Spark Job -- Summary -- Chapter 4: User Experience -- Apache Hive -- Hive Installation -- HiveQL -- UDF/SerDe -- Hive Tuning -- Apache Pig -- Pig Installation -- Pig Latin -- UDF -- Hue -- Features -- Apache Oozie -- Oozie Installation -- How Oozie Works -- Workflow/Coordinator -- Oozie CLI -- Summary -- Chapter 5: Integration with Other Systems -- Apache Sqoop -- How It Works -- Apache Flume -- How It works -- Apache Kafka -- How It Works -- Kafka Connect -- Stream Processing -- Apache Storm -- How It Works -- Trident -- Kafka Integration -- Summary -- Chapter 6: Hadoop Security -- Securing the Hadoop Cluster -- Perimeter Security -- Authentication Using Kerberos -- Service Level Authorization in Hadoop -- Impersonation -- Securing the HTTP Channel -- Securing Data -- Data Classification -- Bringing Data to the Cluster -- Protecting Data in the Cluster -- Securing Applications.

YARN Architecture -- Application Submission in YARN -- Summary -- Chapter 7: Ecosystem at Large: Hadoop with Apache Bigtop -- Basics Concepts -- Software Stacks -- Test Stacks -- Works on My Laptop -- Developing a Custom-Tailored Stack -- Apache Bigtop: The History -- Apache Bigtop: The Concept and Philosophy -- The Structure of the Project -- Meet the Build System -- Toolchain and Development Environment -- BOM Definition -- Deployment -- Bigtop Provisioner -- Master-less Puppet Deployment of a Cluster -- Configuration Management with Puppet -- Integration Validation -- iTests and Validation Applications -- Stack Integration Test Development -- Validating the Stack -- Cluster Failure Tests -- Smoke the Stack -- Putting It All Together -- Summary -- Chapter 8: In-Memory Computing in Hadoop Stack -- Introduction to In-Memory Computing -- Apache Ignite: Memory First -- System Architecture of Apache Ignite -- Data Grid -- A Discourse on High Availability -- Compute Grid -- Service Grid -- Memory Management -- Persistence Store -- Legacy Hadoop Acceleration with Ignite -- Benefits of In-Memory Storage -- Memory Filesystem: HDFS Caching -- In-Memory MapReduce -- Advanced Use of Apache Ignite -- Spark and Ignite -- Sharing the State -- In-Memory SQL on Hadoop -- SQL with Ignite -- Streaming with Apache Ignite -- Summary -- Glossary -- Index -- EULA.

Description based on publisher supplied metadata and other sources.

Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2024. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.

There are no comments on this title.

to post a comment.

© 2024 Resource Centre. All rights reserved.