Skip to content

Categories:

Pattern: A Bundle of Data Mining Modules for Python

Pattern is a collection of open source (BSD license) web mining modules for Python from the Computational Linguistics and Psycholinguistics Research Center. It contains tools for data retrieval, text analysis and data visualization and comes with over 30 sample scripts.

Sponsor

Pattern schema illustration

Pattern consists of six main modules:

pattern.web: A toolkit that includes APIs for various Web services, including Google, Gmail, Bing, Twitter Wikipedia and Flickr. It has its own HTML parser and Web spider.
pattern.table: A module for working with tabular data, used for storing data from the pattern.web module.
pattern.en: A natural language processing toolkit for English.pattern.search: A module containing a search algorithm.
pattern.vector: A module containing various tools for analyzing the text of a document.
pattern.graph: A module for data visualization using Canvas.

Discuss


Posted in Uncategorized.


0 Responses

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.



Some HTML is OK

or, reply to this post via trackback.