HomeFeedback
GVSM
Sinopsis
Generalized Vector Space Model (GVSM).
Description
Usage
The best way to start is of course copying the Makefile in your working directory.
Wrappers are in moscow:/usr9/xliu/gvsm.
If you want to see the source code for LSI and GVSM,
they are at moscow:/usr3/xliu/work/lsi.
Example
Repeating part of Xin's work on the UNICEF corpus is a good example.
His documentation is complete and I'll review just a small part of it.
Goal: estimate the 11-avgp on the monolingual and translingual data sets of the UNICEF corpus.
Copy the Makefile in your gvsm directory
.
Try "make init". It should create a source directory with a bunch of data files.
Modify the SV value and "make all" will generate the corresponding evaluation files (look in the eval directory).
Example
For the UNICEF corpus, using ntc weighting.
| 500 |
MIR 11-avgp | 0.3791
|
TIR 11-avgp | 0.3674
|
DON'T FORGET TO CLEAN AND TIDY ('gmake clean tidy') AS SOON AS YOU GET THE DESIRED RESULTS. DATA AND TEMPORARY FILES USE LOTS OF RESOURCES.