Title : | Hadoop : the definitive guide | Material Type: | printed text | Authors: | Tom White, Author | Edition statement: | 3rd edition | Publisher: | Beijing : O'Reilly | Publication Date: | 2012 | Pagination: | xxiii, 657 p. | Layout: | ill. | Size: | 24 cm | ISBN (or other code): | 978-1-449-31152-0 | General note: | Includes index | Languages : | English (eng) Original Language : English (eng) | Descriptors: | Apache Hadoop File organization (Computer science)
| Class number: | 005.74 | Abstract: | "Hadoop: The Definitive Guide"- provides a comprehensive and detailed guide to the Hadoop ecosystem. The first three chapters provide an overview and history of the Hadoop project and introduce the two primary components; HDFS and Map Reduce. The following chapters focus in great depth on the architecture of HDFS and Map Reduce. These chapters build upon the introductory chapters and dive deeper into topics such as architecture, availability, compression, file systems, building Map Reduce applications, jobs and tasks. The core components of Hadoop are HDFS and Map Reduce and and they are covered by the author in a progressive and digestible format. Each chapter provides in addition to description, technical details and recommendations, working examples that build chapter upon chapter to walk you through simple illustrative examples of each of the concepts. The examples are well presented and easy to understand and consistently use the data and use cases from previous chapters so that the reader does not need to comprehend a new use case for each example distracting focus from the example’s message. There are two chapters devoted to configuring and operating a Hadoop cluster and these are augmented by the three appendices that cover installation and prepping for the example code. Working through the example set up in combination with these chapters should prepare the reader for their own Hadoop implementation. Pig, HBase and ZooKeeper are addressed in later chapters but are not covered in the same depth as HDFS and Map Reduce. Each of these additional tools deserves its own reference and there are many available. If you have covered the previous chapters the author’s introduction and examples to these three tools will get you started. | Contents note: | Meet Hadoop; MapReduce; The Hadoop distributed filesystem; Hadoop I/O; Developing a MapReduce application; How MapReduce works; MapReduce types and formats; MapReduce features; Setting up a Hadoop cluster; Administering Hadoop; Pig; Hive; HBase; ZooKepper; Sqoop; Case studies; Installing Apache Hadoop; Cloudera's distribution including Apache Hadoop; Preparing the NCDC weather data; | Record link: | https://library.seeu.edu.mk/index.php?lvl=notice_display&id=17951 |
|  |