"Mining the Web: Discovering Knowledge from Hypertext Data" is the first book devoted entirely to techniques for producing knowledge from the vast body of unstructured Web data. Building on an initial survey of infrastructural issues - including Web crawling and indexing - Chakrabarti examines low-level machine learning techniques as they relate specifically to the challenges of Web mining. He then devotes the final part of the book to applications that unite infrastructure and analysis to bring machine learning to bear on systematically acquired and stored data. Here the focus is on results: the strengths and weaknesses of these applications, along with their potential as foundations for further progress. From Chakrabarti's work-painstaking, critical, and forward-looking-readers will gain the theoretical and practical understanding they need to contribute to the Web mining effort. Features include: a comprehensive, critical exploration of statistics-based attempts to make sense of Web Mining; details the special challenges associated with analyzing unstructured and semi-structured data; looks at how classical Information Retrieval techniques have been modified for use with Web data; focuses on today's dominant learning methods: clustering and classification, hyperlink analysis, and supervised and semi-supervised learning; analyzes current applications for resource discovery and social network analysis; and, an excellent way to introduce students to especially vital applications of data mining and machine learning technology.
《中国佛教信仰与生活史》内容简介:本书从制度史、社会史、文化史的视角,以忏法、素食、慈善、讲经、放生、菩萨信仰、名山信仰等
我的Photoshop学习手记 内容简介 本书以photoshopcs5为技术平台,精心设计了80多个案例,内容涵盖鼠绘、色彩调整、图像合成、特效制作、创意、抠...
《ExtJS源码分析与开发实例宝典》从ExtJS实现的基本功能开始讲解,从两个方面对ExtJS进行整体上的概述,让读者从宏观上去把握Ext
机器视觉技术及应用实例详解 本书特色 陈兵旗撰写的《机器视觉技术及应用实例详解》一书具有如下特点:1、通过大量的典型案例对机器视觉技术的关键点和应用方法进行了详...
TheResourceDescriptionFramework(RDF)isastructurefordescribingandinterchangingmet...
《幼儿心理学》内容简介:本书分为12章,包括绪论、幼儿心理发展概述、幼儿注意的发展、幼儿感觉和知觉的发展、幼儿记忆的发展、幼
《把孩子交给爸爸》内容简介:在当下家庭教育中,普遍存在父亲教育缺失或不足的现象,本书作者作为一个相当称职的爸爸,给千万个家
Despitethehugenumberofmobiledevicesandappsinusetoday,yourbusinessstillneedsawebs...
本書は、オープンソースのツールキット「Arduino」を使った新しいものづくりの実践を目的とした書籍です。その中心は「距離を測る
Fanswillgetbentoutofshapeiftheymissthefirstbooktocovercircuit-bending-bending,fo...
虚拟现实交互设计 本书特色 本书从虚拟现实的基础理论出发,内容涵盖设计艺术学领域多个专业的知识,通过具体的原创设计案例分析,在3ds max和vrp平台,虚拟现...
RubyonRailsisthesuper-productivenewwaytodevelopfull-featuredwebapplications.With...
SoyouknowHTML,evenJavaScript,buttheideaoflearninganactualprogramminglanguagelike...
LATEX 2e完全学习手册-(第二版)-附光盘1张 本书特色 latex2e,简称latex,是一种专业的高品质文稿排版系统,目前已成为国际学术出版界广泛使用...
“不作恶”的Google何以身陷“三重门”?聪明的Google为何在中国变得不够聪明?强大的Google在与百度的交锋中为何会完败?作者简
《南京传》内容简介:春归秣陵树,人老建康城。作为一位公认的文章大家,叶兆言对他写了四十年的南京有着独特理解。南京为他提供了
《GAE编程指南》是一种云计算服务,跟其他的同类产品不同,它提供了一种简单的应用程序构建模型,通过这种模型,你可以轻松地构建
作为一位平面设计师,为什么一定要依赖那些已有的字体、用别人的图形——如果你能够创造自己的标志、字体和书写的话。莱斯利·凯
Wouldyoulikeanoverviewofthestateoftheartinwebdesigninaspecificfield?WEBDESIGNIND...
CCNA学习指南 本书特色 本学习指南帮你准备*新的ccna考试:cisco网络权威todd lammle编写的这本*畅销的学习指南能帮助你仔细的准备,信心十足...