Resources
This page contains some resources potentially useful to text-learning group members.
- Einat Amitay's Web IR and IE resource page
- Rainbow
Fast C
building bag-of-words representations, learning models, and
classifying documents - Andrew McCallum
Rainbow Documentation
Local executables:
- Source: /afs/cs/project/theo-9/webkb/mccallum/src/bow
- Linux binaries: /afs/cs/project/theo-9/webkb/mccallum/src/bow-linux
- SUNOS binaries: /afs/cs/project/theo-9/webkb/mccallum/src/bow-sunos
- Wordnet
Wordnet
home page. This is installed on CS machines - callable as
wn - there are manpages installed too.
- Link Grammar
Link
Grammar is a simple grammar and parser developed at CMU, with a
large coverage
vocabulary, and robustness to repetition (but not omission). Dayne is
working with the link grammar parser.
- Language Modeling Resources
- Data Archive Archive of text data sets
- LibParse Dayne's library of text-parsing functions, including html parsing, as well as implementation of perl-like regular expressions in LISP
-
The old resources page (contains useful things!)
Rosie Jones (rosie@cs.cmu.edu)
Last modified: Sun Jan 23 14:29:30 EST 2000