Stupid Text Tricks
- Coarse IR, Clustering
- Don’t need dimension reduction (except stopwords)
- Don’t need morphological analysis
- Don’t need word sense disambiguation
- Partial parsing:
- Simple, greedy transformation rules
- Cascading finite state machines
- Categorization