Practical Text Mining with Perl (Wiley Series on Methods and Applications in Data Mining)

Practical Text Mining with Perl (Wiley Series on Methods and Applications in Data Mining)
Автор
 
Год
 
Страниц
 
296
ISBN
 
0470176431
Издатель
 
Focal Press
Категория
 
Разное

Содержание:

Приключения Тома Сойера, Приключения Гекльберри Финна, Зверобой, Последний из могикан, Всадник без головы, Жизнь у индейцев, Оцеола - вождь семинолов

Описание:

Provides readers with the methods, algorithms, and means to perform text mining tasks This book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet (www.perl.org). It covers mining ideas from several perspectives--statistics, data mining, linguistics, and information retrieval--and provides readers with the means to successfully complete text mining tasks on their own. The book begins with an introduction to regular expressions, a text pattern methodology, and quantitative text summaries, all of which are fundamental tools of analyzing text. Then, it builds upon this foundation to explore: Probability and texts, including the bag-of-words model Information retrieval techniques such as the TF-IDF similarity measure Concordance lines and corpus linguistics Multivariate techniques such as correlation, principal components analysis, and clustering Perl modules,...

Похожие книги

Perl.Изучаем глубжеPerl.Изучаем глубже
Автор: Рэндал Л. Шварц, Брайан Д. Фой и Том Феникс
Год: 2008
Higher-Order Perl: Transforming Programs with ProgramsHigher-Order Perl: Transforming Programs with Programs
Автор: Mark Jason Dominus
Год: 2005
Perl Programming for BiologistsPerl Programming for Biologists
Автор: D. Curtis Jamison
Год: 2003
Special Edition Using PERL 5 for Web ProgrammingSpecial Edition Using PERL 5 for Web Programming
Автор: Harlan D., Doyle P., Powers Sh.
Год: 1996
Pro PerlPro Perl
Автор: Wainwright P.
Год: 2005
Minimal Perl: For UNIX and Linux PeopleMinimal Perl: For UNIX and Linux People
Автор: Tim Maher
Год: 2006