Apache Hadoop is ideal for organizations with a growing need to store and process massive application datasets. Hadoop: The Definitive Guide is a comprehensive resource for using Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters. The book includes case studies that illustrate how Hadoop solves specific problems.
Organizations large and small are adopting Apache Hadoop to deal with huge application datasets. Hadoop: The Definitive Guide provides you with the key for unlocking the wealth this data holds. Hadoop is ideal for storing and processing massive amounts of data, but until now, information on this open-source project has been lacking -- especially with regard to best practices. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems. Programmers will find details for analyzing large datasets with Hadoop, and administrators will learn how to set up and run Hadoop clusters.
With case studies that illustrate how Hadoop solves specific problems, this book helps you:
* Learn the Hadoop Distributed File System (HDFS), including ways to use its many APIs to transfer data
* Write distributed computations with MapReduce, Hadoop's most vital component
* Become familiar with Hadoop's data and IO building blocks for compression, data integrity, serialization, and persistence
* Learn the common pitfalls and advanced features for writing real-world MapReduce programs
* Design, build, and administer a dedicated Hadoop cluster
* Use HBase, Hadoop's database for structured and semi-structured data
And more. Hadoop: The Definitive Guide is still in progress, but you can get started on this technology with the Rough Cuts edition, which lets you read the book online or download it in PDF format as the manuscript evolves.
Describesbasicprogrammingprinciplesandtheirstep-by-stepapplications.Numerousexam...
《王维诗集》内容简介:王维是盛唐时期的著名诗人,苏轼赞他“味摩诘之诗,诗中有画;观摩诘之画,画中有诗”,尤以山水诗成就为最
《简笔画5000例,一本就够(男生卷)》内容简介:简笔画几乎是每个人孩提时代绘画生涯的开始。简单的线条,可爱的形状,总能释放你
网络思想政治教育心理研究 内容简介 网络思想政治教育心理研究是思想政治教育心理学研究的重要内容,旨在通过网络时代思想政治教育心理方面有关问题的深入探讨,拓展思想...
本书提供了在C编程语言中进行安全编码的指导方针,描述了C语言程序中导致软件潜在风险根源的编码错误,并根据严重性、被利用的可
知名餐桌造型师、《爱就是在一起,吃好多好多顿饭》作者曾焱冰翻译推荐,餐桌美学经典之作。内容简介:◆餐桌布置一直都是社交中
五笔字型五笔数码编码大全 内容简介 这是一本集五笔字型——86版、98版;五笔数码——数字王码6键、数字王码9键为一体的五笔字型实用工具书。读者可以快速方便地查...
大数据猩球-海量数据处理实践指南 本书特色 本书以实用的、可操作的视角解释了大数据——采用黑猩猩和大象的隐喻,基于棒球统计数据集,使用a...
《聂卫平围棋习题精解·手筋专项训练(从3段到5段)》内容简介:本书是我国围棋职业运动员聂卫平同聂卫平围棋道场的明星教师团队联
《儿童歌曲器乐演奏启蒙——架子鼓》内容简介:《儿童歌曲器乐演奏启蒙》是一套适于乐器初学者使用的简易曲集,包含二胡、古筝、琵
空间碎片的危害正受到科学、商业等领域空间用户越来越广泛的关注。《空间碎片--模型与风险分析(精)》(作者克林克瑞德)是一部空间
《中文版3ds Max 2013实例教程(全彩超值版)》内容简介:这是一本全面介绍中文版3ds Max 2013各项功能的书。《中文版3ds Max 2013
《RocketMQ技术内幕》内容简介:这是一本指导读者如何在实践中让RocketMQ实现低延迟、高并发、高可用、高可靠的著作。作者是Rocket
"EthnographyandVirtualWorlds"istheonlybookofitskind-aconcise,comprehensive,andpr...
《跟动物交换身体2》内容简介:★畅销书《跟动物交换身体》第二弹重磅上市!魔性画风+专业知识+奇趣解读,以独特的视角直观解读动物
计算机:一部历史 本书特色 《计算机——一部历史》(彼得·本特利著), 给大众读者写的计算机科普读物,零门槛入门计算机 科学。讲述计算机背后鲜为人知的故事,普及...
《JVM G1源码分析和调优》内容简介:G1是目前最成熟的垃圾回收器,已经广泛应用在众多公司的生产环境中。我们知道,CMS作为使用最为
《小创客学光环板》内容简介:本书主要介绍利用小巧的光环板及功能强大的慧编程平台实现智能可穿戴设备作品的设计与创作。在内容上
Inover40yearsatBraun,DieterRamsestablishedhimselfasoneofthemostinfluentialdesign...
《河北上市公司财务发展报告(2016)》内容简介:本书以河北上市公司作为具体研究对象,从公司筹资、投资、资金运营、业绩及履行社