The indri applications, BuildIndriIndex, IndriDaemon, and IndriRunQuery accept parameters from either the command line or from a file. The parameter file uses an XML format. The command line uses dotted path notation. The top level element in the parameters file is named parameters.
-memory=100M
on the command line. -index=/path/to/repository
on the command line.
-corpus.path=/path/to/file_or_directory
on the command line. -corpus.class=trecweb
on the command line. The known classes are: field
entry specifying the metadata fields to index, eg DOCNO. Specified as <metadata><field>fieldname</field></metadata> in the parameter file and as metadata.field=fieldname
on the command line. -field.name=fieldname
on the command line. true
if the field contains numeric data, otherwise the symbol false
, specified as <field><numeric>true</numeric></field> in the parameter file and as -field.numeric=true
on the command line. This is an optional parameter, defaulting to false. Note that 0
can be used for false and 1
can be used for true.
-stemmer.name=stemmername
on the command line. This is an optional parameter with the default of no stemming. -stopper.word=stopword
on the command line. This is an optional parameter with the default of no stopping.
-memory=100M
on the command line. -index=/path/to/repository
on the command line. This element can be specified multiple times to combine Repositories. -server=hostname
on the command line. The hostname can include an optional port number to connect to, using the form hostname:portnum
. This element can be specified multiple times to combine servers. -count=number
on the command line.
( key ":" value ) [ "," key ":" value ]*
Here's an example rule in command line format:
-rule=method:linear,collectionLambda:0.2,field:title
and in parameter file format:
<rule>method:linear,collectionLambda:0.2,field:title</rule>
This corresponds to Jelinek-Mercer smoothing with background lambda equal to 0.2, only for items in a title field.
If nothing is listed for a key, all values are assumed. So, a rule that does not specify a field matches all fields. This makes -rule=method:linear,collectionLambda:0.2
a valid rule.
Valid keys:
Valid methods:
-stopper.word=stopword
on the command line. This is an optional parameter with the default of no stopping.
-queryOffset=number
on the command line. -runID=someID
on the command line. true
to produce TREC scorable output, otherwise the symbol false
. Specified as <trecFormat>true</trecFormat> in the parameter file and as -trecFormat=true
on the command line. Note that 0
can be used for false, and 1
can be used for true.
-fbDocs=number
on the command line. -fbTerms=number
on the command line. -fbMu=number
on the command line. -fbOrigWeight=number
on the command line.
-memory=100M
on the command line. -index=/path/to/repository
on the command line. -port=number
on the command line.