Computational Linguistics/NLP links
Paper Archives and Logs
Overview sites
- NLP
- Information Retrieval
- Latent Semantic Analysis
- Text Summarization
- Perl for Linguistics
Terminology
Tools
- List of tools for dictionary research and development
- Simple Concordance Program: Create word lists and search natural language text files for words, phrases, and patterns. Has built-in alphabets for English, French, German, Greek, Russian, etc.
- TouchGraph: Browsing CiteSeer, by exploring links between related articles
- VisualText: VisualText is an Integrated Development Environment for deep text analysis applications. Think of it as Visual C++ for Natural Language Processing applications.
- NLTK: Natural Language Toolkit
- CLUTO: a clustering package for high-dimensional datasets.
- Minipar: a dependency parser for English
- FreeLing: library providing language analysis services (such as morfological analysis, date recognition, PoS tagging, etc.)
- Agrep: a tool for fast searching a file or many files for a string or regular expression, with approximate matching capabilities and user-definable records
- SVMTool: a simple and effective part-of-speech tagger based on Support Vector Machines
- WordSmith: a lexical analysis software for the PC
- Simple text tools: Text Statistics, Lemmatizations, and Concordances
Resources
Bioinformatics
Information Retrieval
Document Clustering
Finite State Machines
Corpora
Other Languages
Slavic Linguistics
People
Mail Lists
Plagiarism detection
Random stuff
Finally... Some FUN!!!