Python NLP Tools
2016-05-23 10:57
399 查看
A collection of important python tools regarding natural language processing
This is a part of Python Knowledge and Resources
List
editrevisions
NLTK
editrevisions
NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to lexical
resources such as WordNet.It also has text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning
Website:
http://www.nltk.org/
Installation:
Install NLTK: run sudo pip install -U nltk
Install Numpy (optional): run sudo pip install -U numpy
Test installation: run python then type import nltk
editrevisions
Pattern
editrevisions
Pattern has tools for natural language processing like part-of-speech taggers, n-gram search, sentiment analysis, WordNet.It supports machine learning vector space model, clustering, SVM.
Website:
https://github.com/clips/pattern
Installation:
pip install pattern
editrevisions
TextBlob
editrevisions
TextBlob is a Python library for processing textual data. It provides a simple API for diving into common natural language processing tasks such as part-of-speech tagging, noun phrase extraction, sentiment
analysis, classification, translation, and more.
Website:
http://textblob.readthedocs.org/en/dev/
Installation:
pip install -U textblob
editrevisions
Gensim
editrevisions
Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. It can process input larger
than RAM. According to the author it is "the most robust, efficient and hassle-free piece of software to realize unsupervised semantic modelling from plain text
Website:
https://github.com/piskvorky/gensim
Installation
pip install -U gensim
editrevisions
PyNLPl
editrevisions
Python Natural Language Processing Library(pronounced as: pineapple) is a Python library for Natural Language Processing. It is a collection
of various independent or loosely interdependent modules useful for common, and less common, NLP tasks. PyNLPl can be used for example the computation of n-grams, frequency lists and distributions, language models. There are also more complex data types, such
as Priority Queues, and search algorithms, such as Beam Search
Installation:
#Linux
sudo apt-get install pymol
# Fedora
yum install pymol
editrevisions
spaCy
editrevisions
It’s commercial open source software. Industrial strength NLP with Python and Cython. Its a pipeline for fast, state-of-the-art natural language processing.
Website:
https://spacy.io
Installation
pip install spacy
editrevisions
Polyglot
editrevisions
Polyglot is a natural language pipeline that supports massive multilingual applications. It supports tokenization for 165 languages, Language detection for 196 languages, Named Entity Recognition for 40
languages, Part of Speech Tagging for 16 languages, Sentiment Analysis for 136 languages, Word Embeddings for 137 languages, Morphological analysis for 135 languages, Transliteration for 69 languages
Website:
https://pypi.python.org/pypi/polyglot
Installation
pip install polyglot
editrevisions
MontyLingua
editrevisions
MontyLingua is a free, commonsense-enriched, end-to-end natural language understander for English. Feed raw English text into MontyLingua, and the output will be a semantic interpretation of that
text. Perfect for information retrieval and extraction, request processing, and question answering. From English sentences, it extracts subject/verb/object tuples, extracts adjectives, noun phrases and verb phrases, and extracts people's names, places, events,
dates and times, and other semantic information
Website: http://web.media.mit.edu/~hugo/montylingua/
editrevisions
BLLIP Parser
editrevisions
BLLIP Parser (also known as the Charniak-Johnson parser) is a statistical natural language parser including a generative constituent parser and discriminative maximum entropy reranker. It includes command-line andPython interfaces.
editrevisions
Quepy
editrevisions
Quepy is a python framework to transform natural language questions to queries in a database query language. It can be easily customized to different kinds of questions in natural language and database queries. So, with little coding you can build your own
system for natural language access to your database.
Website
https://github.com/machinalis/quepy
http://quepy.machinalis.com/
相关文章推荐
- PYTHON功能学习之邮件发送
- 【python】Python框架、库和软件资源大全
- 安装python package的经验和常用命令
- PYTHON文件操作和列表的使用
- python实现从FTP下载文件通过多线程同时分发到多台机器
- python下编译py成pyc和pyo
- Python学习笔记只列表和元组
- 『Python学习』xlrd库的使用
- python 生成html文件
- python 类 属性get set
- python多线程监控指定目录
- Python 中的异常种类
- python正则表达式
- DAY3 python群发短信
- Python NTLK资料
- Python闭包需要注意的问题
- ESL python调用C模块时传递unicode字符串报错问题解决
- ESL python调用C模块时传递unicode字符串报错问题解决
- 用python开发股票自动技术分析的软件
- vs2013下的python远程调试支持(2) :工程调试