Learning Hadoop 2 introduces the world of building data processing applications on the wide variety of tools supported by the platform. Starting from the core components of the framework – HDFS and YARN – this book will guide you in building analytics and data processing applications using a variety of approaches.
This book is aimed at system and application developers interested in learning to solve practical problems using the Hadoop framework and related components. Prerequisites are familiarity with Unix / Linux command line interface and experience with the Java programming language. Familiarity with Hadoop 1 is a plus.
Each chapter illustrates a key component of Hadoop 2 with a hands on approach complete with use cases and best practices. Each topic is illustrated in the context of data analysis or a processing application built around a dataset generated from Twitter’s message stream.