Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
《SQL基础教程(第3版)》是一本SQL的入门书,介绍如何使用最常用的SQL语言维护和查询数据库信息。书中介绍了各种DBMS,关系模型理
《巅峰决战》内容简介:本书介绍超级计算机可以算天、算地、算人。我们使用超级计算机给大地做CT,寻找石油。分析人的基因,解读生
《iOS编程》荣获Jolt生产力大奖。第4版更新了iOS7和Xcode5的内容。全书涵盖了开发iOS应用的方方面面。从Objective-C基础知识到新
在线阅读本书TheEMAlgorithmandExtensionsremainstheonlysinglesourcetoofferacompleteandun...
《空间信息网络传输协议》内容简介:本书系统、全面地介绍了空间信息网络的特点及其对传输协议造成的影响;重点介绍和讨论了空间信
《室内分布系统规划与设计:GSM/TD-SCDMA/TD-LTE/WLAN》介绍了GSM/TD—SCDMA/WLAN/TD—LTE四网融合室内分布系统的基本原理...
《ASP.NET2.0技术内幕》围绕着ASP.NET2.0是Web开发的重要分水岭这一主题,采用自顶向下的方式介绍ASP.NET2.0的最新编程实践,从更
《软件之美》内容简介:行走在红尘里,每个人都会遇见暴风骤雨和诗情画意。“忧者见之而忧,喜者见之而喜”。一路上,我们会听见花
Objective-C是创建MacOSX应用和iPhone应用的主要语言,优雅的面向对象编程环境与快速而普及的C语言珠联璧合,造就了它的不俗表现
《移动通信(第2版影印版)》是移动通信领域的导论,主要讨论数字数据传输。适用于选修计算机网络或通信课程的电子工程或计算机专业
数据通信设备中心液体冷却指南A105 内容简介 该书共6章,分别从数据中心设施冷却装置、管路系统、数据通信设备液冷方法、冷水系统基本要求及冷液基础设施对工艺冷却...
《神好多的日本》内容简介:★说文解字,日本“八百万”神明,一目了然。神社神明一一对应,日本神社观光不再走马观花。★视角新颖
《深入实践DDD》内容简介:本书是拥有二十年商业软件开发经验及十年技术管理经验的资深技术专家呕心沥血之作,也是目前市场上少有的
《中国本草图谱》内容简介:《食物本草》可以说是明代食药养生的集大成者,是我国现存内容很丰富、很全面的食药疗法专著。全书共有
Thismust-readtextforallwebdesignersdeliversvitalinformationonhowtoemployinformat...
假如,给你一间老房子,你要用它做什么?咖啡馆、民宿、饮食空间、小酒馆……在本书中,你或许可以寻找到答案。30个台湾老屋的再
《刑法案例研习教程(第二版)》内容简介:本书由韩玉胜主编,每章都有详细分析的案例若干,然后提供了若干个没有提供分析论证的探
微信公众平台应用开发方法.技巧与案例 本书特色 本书是目前微信公众平台应用开发领域内容*全面、系统和深入的一本书,也是技术版本*新的。由著名的资深微信公众平台应...
《聚势》内容简介:本书首先从理论上分析移动互联网时代的渠道发展趋势,提出渠道运营管理“442”模型,解析通信业渠道发展历史和发
全国计算机等级考试二级教程.C语言程序设计:2010年版 内容简介 本书根据教育部考试中心制定的《全国计算机等级考试二级c语言程序设计考试大纲(2007年版)》...