Open source Python modules, linguistic data and documentation for research and development in natural language processing, supporting dozens of NLP tasks, with distributions for Windows, Mac OSX and Linux. As of version 3.5, python 2.7 is no longer supported and python3 is now required. NLTK comes with many corpora, toy grammars, trained models, etc. A complete list is posted at: http://nltk.org/nltk_data/. To retrieve all the data, use "python3 -m nltk.downloader all". To ensure system wideinstallation, you can run the command "python3 -m nltk.downloader -d /usr/share/nltk_data all" as root. Note that the 'regex' package, also available on SBo, is required to run this command.