TY - BOOK AU - WHITE,TOM TI - HADOOP:THE DEFINITIVE GUIDE SN - 978-93-5023-756-4 U1 - 005.74/WHI PY - 2013/// CY - MUMBAI PB - SPD KW - HADOOP,DATA TORAGE,DATA ANALYSIS,MAP REDUCE APPLICATION,INTERNET N1 - DATABASE/RACK N2 - Store large datasets with the Hadoop Distributed File System (HDFS) Run distributed computations with MapReduce Use Hadoop’s data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster—or run Hadoop in the cloud Load data from relational databases into HDFS, using Sqoop Perform large-scale data processing with the Pig query language Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase for structured and semi-structured data, and ZooKeeper for building distributed systems ER -