#include <ngram.h>
Definition at line 90 of file ngram.h.
|
The constant required for absolute discounting Definition at line 160 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
Definition at line 130 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
The filaname of the arpa format LM Definition at line 187 of file ngram.h. Referenced by main(). |
|
The file of the arpa format LM |
|
The filaname of the bin format LM Definition at line 191 of file ngram.h. Referenced by main(), and write_bin_lm(). |
|
The file of the bin format LM Definition at line 192 of file ngram.h. Referenced by load_lm(), main(), and write_bin_lm(). |
|
Pointer to array of back-off weights Definition at line 117 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
Pointer to array of 4 byte back_off weights. Only one of these arrays will be allocated Definition at line 118 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
True if word with this id is a context cue Definition at line 167 of file ngram.h. Referenced by compute_perplexity(), compute_unigram(), display_stats(), load_lm(), main(), validate(), and write_bin_lm(). |
|
The filename of the context cues file Definition at line 178 of file ngram.h. Referenced by main(). |
|
The file pointer of the context cues file Definition at line 180 of file ngram.h. Referenced by main(). |
|
Pointer to array of count lists (actually indices in a count table) Definition at line 108 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
Alternative method of storing the counts, using 4 bytes. Not normally allocated Definition at line 114 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
Pointer to array of count tables Definition at line 136 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
Have same size for each count table Definition at line 135 of file ngram.h. Referenced by increment_context(), load_lm(), main(), and write_bin_lm(). |
|
Array of cutoffs Definition at line 149 of file ngram.h. Referenced by calc_mem_req(), load_lm(), main(), and write_bin_lm(). |
|
Pointer to array of discounting ranges - typically will be fof_size - 1, but can be reduced further if stats are anomolous Definition at line 153 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), display_stats(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
See define stuff at the top of this file Definition at line 147 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
0 if we have open vocab, 1 if we have a closed vocab. Definition at line 202 of file ngram.h. Referenced by compute_back_off(), compute_unigram(), display_stats(), increment_context(), load_lm(), main(), validate(), and write_bin_lm(). |
|
The sizes of the above arrays Definition at line 152 of file ngram.h. Referenced by display_stats(), load_lm(), main(), and write_bin_lm(). |
|
Definition at line 213 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), display_stats(), for(), load_lm(), main(), and write_bin_lm(). |
|
Definition at line 214 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
Array of frequency of frequency information Definition at line 150 of file ngram.h. Referenced by compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
The discounted values of the counts Definition at line 157 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
The filename of the id-gram file Definition at line 175 of file ngram.h. Referenced by calc_mem_req(), and main(). |
|
The file pointer of the id-gram file Definition at line 176 of file ngram.h. Referenced by calc_mem_req(), and main(). |
|
Pointer to array of index lists Definition at line 121 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), increment_context(), load_lm(), main(), num_of_types(), and write_bin_lm(). |
|
The linear discounting ratio Definition at line 159 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
Array of marginal counts for the unigrams. The normal unigram counts differ in that context cues have zero counts there, but not here Definition at line 110 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
Ditto Definition at line 116 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), increment_context(), load_lm(), main(), and write_bin_lm(). |
|
The maximum alpha in the table Definition at line 127 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
The minimum alpha in the table Definition at line 126 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
Count to which infrequent unigrams will be bumped up Definition at line 171 of file ngram.h. Referenced by main(). |
|
n=3 for trigram, n=4 for 4-gram etc. Definition at line 94 of file ngram.h. Referenced by calc_mem_req(), compute_perplexity(), display_stats(), for(), load_lm(), main(), validate(), and write_bin_lm(). |
|
Total number of unigrams in the training data Definition at line 169 of file ngram.h. Referenced by compute_unigram(), load_lm(), main(), and write_bin_lm(). |
|
Number of context cues Definition at line 102 of file ngram.h. Referenced by compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
Array indicating how many 2-grams, ... ,n-grams, have been processed so far Definition at line 196 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), display_stats(), increment_context(), load_lm(), main(), num_of_types(), and write_bin_lm(). |
|
Definition at line 212 of file ngram.h. Referenced by compute_unigram(), display_stats(), load_lm(), main(), and write_bin_lm(). |
|
The maximum number of out of range alphas that we are going to allow. Definition at line 128 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
Pointer to the tables used for compact representation of the indices Definition at line 140 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), increment_context(), load_lm(), main(), num_of_types(), and write_bin_lm(). |
|
Pointer to array of pointer tables Definition at line 142 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), increment_context(), load_lm(), main(), num_of_types(), and write_bin_lm(). |
|
Definition at line 131 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), for(), load_lm(), main(), and write_bin_lm(). |
|
Pointer to table size array Definition at line 106 of file ngram.h. Referenced by calc_mem_req(), and main(). |
|
Log probs for each unigram Definition at line 166 of file ngram.h. Referenced by compute_unigram(), for(), load_lm(), main(), and write_bin_lm(). |
|
Probs for each unigram Definition at line 165 of file ngram.h. Referenced by bo_ng_prob(), compute_unigram(), for(), load_lm(), main(), and write_bin_lm(). |
|
Definition at line 95 of file ngram.h. Referenced by load_lm(), and write_bin_lm(). |
|
Array of vocabulary words Definition at line 101 of file ngram.h. Referenced by compute_back_off(), compute_perplexity(), display_stats(), for(), load_lm(), main(), and validate(). |
|
The filename of the vocabulary file Definition at line 177 of file ngram.h. Referenced by main(). |
|
Vocabulary hash table Definition at line 99 of file ngram.h. Referenced by compute_perplexity(), load_lm(), main(), validate(), and write_bin_lm(). |
|
Vocabulary size Definition at line 100 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), compute_perplexity(), compute_unigram(), display_stats(), increment_context(), load_lm(), main(), num_of_types(), validate(), and write_bin_lm(). |
|
see define stuff at the top Definition at line 200 of file ngram.h. Referenced by calc_prob_of(), compute_perplexity(), compute_unigram(), display_stats(), load_lm(), main(), validate(), and write_bin_lm(). |
|
Pointer to array of id lists Definition at line 107 of file ngram.h. Referenced by bo_ng_prob(), compute_back_off(), load_lm(), main(), and write_bin_lm(). |
|
True if the language model is to be written out in arpa format Definition at line 185 of file ngram.h. Referenced by main(). |
|
True if the language model is to be written out in binary format Definition at line 189 of file ngram.h. Referenced by main(). |
|
cap on prob(zeroton) as fraction of P(singleton) Definition at line 210 of file ngram.h. Referenced by compute_unigram(), load_lm(), main(), and write_bin_lm(). |