This application builds an index for a collection of documents.
To use it, follow the general steps of running a lemur application.
The parameters are:
index
: name of the index table-of-content file without the .ifp extension. indexType
: the type of index, key (KeyfileIncIndex), inv (Inv(FP)Index), indri (IndriIndex) memory
: memory (in bytes) of Inv(FP)PushIndex (def = 96000000). position
: store position information (def = 1), applicable only for inv indexes. stopwords
: name of file containing the stopword list. acronyms
: name of file containing the acronym list. countStopWords
: If true, count stopwords in document length. docFormat
: stemmer
: KstemmerDir
: Path to directory of data files used by Krovetz's stemmer. arabicStemDir
: Path to directory of data files used by the Arabic stemmers. arabicStemFunc
: Which stemming algorithm to apply, one of: dataFiles
: name of file containing list of datafiles to index.