Text Processing in Python describes techniques for manipulation of text using the Python programming language. At the broadest level, text processing is simply taking textual information and doing something with it. This might be restructuring or reformatting it, extracting smaller bits of information from it, or performing calculations that depend on the text. Text processing is arguably what most programmers spend most of their time doing. Because Python is clear, expressive, and object-oriented it is a perfect language for doing text processing, even better than Perl. As the amount of data everywhere continues to increase, this is more and more of a challenge for programmers. This book is not a tutorial on Python. It has two other goals: helping the programmer get the job done pragmatically and efficiently; and giving the reader an understanding - both theoretically and conceptually - of why what works works and what doesn't work doesn't work. Mertz provides practical pointers and tips that emphasize efficent, flexible, and maintainable approaches to the textprocessing tasks that working programmers face daily.
From the Back Cover:
Text Processing in Python is an example-driven, hands-on tutorial that carefully teaches programmers how to accomplish numerous text processing tasks using the Python language. Filled with concrete examples, this book provides efficient and effective solutions to specific text processing problems and practical strategies for dealing with all types of text processing challenges.
Text Processing in Python begins with an introduction to text processing and contains a quick Python tutorial to get you up to speed. It then delves into essential text processing subject areas, including string operations, regular expressions, parsers and state machines, and Internet tools and techniques. Appendixes cover such important topics as data compression and Unicode. A comprehensive index and plentiful cross-referencing offer easy access to available information. In addition, exercises throughout the book provide readers with further opportunity to hone their skills either on their own or in the classroom. A companion Web site (http://gnosis.cx/TPiP) contains source code and examples from the book.
Here is some of what you will find in thie book:
* When do I use formal parsers to process structured and semi-structured data? Page 257
* How do I work with full text indexing? Page 199
* What patterns in text can be expressed using regular expressions? Page 204
* How do I find a URL or an email address in text? Page 228
* How do I process a report with a concrete state machine? Page 274
* How do I parse, create, and manipulate internet formats? Page 345
* How do I handle lossless and lossy compression? Page 454
* How do I find codepoints in Unicode? Page 465
Photoshop CS4平面广告设计经典108例-含2DVD 内容简介 本书是一本Photoshop CS4的平面设计案例教程,也是一本Photoshop商业...
《软件产品质量要求和测试细则》内容简介:计算机软件是计算机应用的核心,其质量的好坏关系到计算机应用系统的成败,软件测评是提
《中国当代经典电影赏析》内容简介:本书是南京大学国际化合作项目“一带一路国家中国文化教学合作研究”的成果,主要针对中高级汉
《移动互联网时代文化产业商业模式》内容简介:这是一本面向大众,将国家社科基金重大项目"我国文化产业发展战略研究”之子课题"中
《中信国学大典:礼记·孝经》内容简介:中信国学大典(50册)是中信出版社引进自香港中华书局的一套深具国际视野、贴近当代社会的
《全国会计从业资格考试标准化教材(大纲解析版):财经法规与会计职业道德》内容简介:全国会计从业资格考试标准化教材(大纲解析
繁琐的CSS规则和各浏览器支持度的不统一素来都为网页设计师所诟病。通过多年潜心研究与深入测试,作者归纳组合出了多达350条的设
《哈佛凌晨四点半大全集》内容简介:哈佛大学的老师经常给学生这样的告诫:如果你想在进入社会后,在任何时候、任何场合下都能得心
《全彩中国史(下册)》内容简介:《中国史(全彩)(图文全解)(套装上下册)》脉络清晰,以历史批判精神,描绘和评述了从盘古开天辟地
《2015年职称英语考试历年真题详解及全真模拟试卷A级(理工类)》内容简介:人事部组织的全国统一标准的职称英语考试划分为综合、理
《背包为家》内容简介:本书是儿童文学著名作家张国龙新近创作的一本随笔美文集,共分为背包回家、遥想他乡、天涯情旅三卷,其中所
《随波逐源》内容简介:本书是作者在追寻生命和物质本源的基础上,探寻人生意义过程中的一些认识及体会,书中汇集了作者对于人生观
Everyoneknowsthatapicturetellsathousandwords.Butwhatabouttheelementsthatmakeupap...
Webusersaregettingtiredofthetraditionalwebexperience.Theygetfrustratedlosingthei...
智能手机故障检测与维修实战技巧全图解 本书特色本书以手机维修现场检测实操及图解的方式,系统地讲解了维修工具的使用方法、手机电路图读图技能、手机元器件故障检修的检...
本书是作者在美国、瑞士的ArtCebter研读设计,以及长期在美国和中国大陆从事产品设计工作的经验总结。书籍内容丰富,图文并茂,信
《刑法案例研习教程(第二版)》内容简介:本书由韩玉胜主编,每章都有详细分析的案例若干,然后提供了若干个没有提供分析论证的探
《读文心雕龙手记》内容简介:本书收入十三篇札记和两篇附录,写于不同时期,是作者二十多年精研《文心雕龙》的会心之作。《文心雕
BookDescriptionReusablecomponentsandpatternsforAjax-drivenapplicationsAjaxisoneo...
《Excel数据分析可视化必备技能案例视频精讲》内容简介:数据分析的目的是快速发现问题,进而分析问题并解决问题。因此,通过制作可