Janus 3 Tutorial - Scripts Thread

This page list all available Tcl-scripts with a short explanation and a link to a page that contains the script. This page is useful if you know what you are looking for and only need a link to the script. Other approaches to get to Tcl-scripts would be to work in the do-it-yourself thread and follow a link there.

Keep in mind, that Janus is still under development, and that we can't guarantee that every script that is listed here will work correctly forever. Sometimes, a program changes faster than its supporting documentation.

The scripts were all made for the tutorial, they are not meant to work for any task in any environment. The script can be run unmodified in the do-it-yourself environment of this tutorial. None of the scripts uses any external script or library function. They are all complete. There is a big Janus script library which contains many scripts for many jobs and environments. These are not part of the tutorial. Once you know how to work with Janus you will find it much easier to use the big Janus library or to develop your own scripts.

Okay, here come the scripts:

makeDatabase.tcl
Given a list of utterances together with their transcriptions, create a simple Janus database object.
initCI.tcl
Initialize a context-independent system, i.e. create an initial environment, create architecture description files.
startup.tcl
Fire up a Janus process for a newly created environment, using acoustic parameters from a generic recognizer.
makeLabels.tcl
Use the generic acoustic models recognizer to run an alignment on the training database and write label files.
firstLda.tcl
Compute a first LDA transformation matrix (and some side dishes, like class-counts) for a context independent system.
firstTraining.tcl
Run a forced-aligment training iteration on all training data along the previously saved labels, and store the optimized weight files.
makeVocab.tcl
Extract the recognition vocabulary from the dictionary.
languageModel.tcl
Compute a simple unigram/bigram language model based on the training data.
firstTest.tcl
Run the decoder on the test set using the previously trained weights. Here we are only run a very simple one-pass decoding.
firstKmeans.tcl
Compute new codebooks with k-means after extracting sample vectors. Write new weights and description files.
trainAlongLabels.tcl
Train a couple of iterations not doing forced alignment, but using the previously written label files.
polyphones.tcl
Create a first unclustered context-dependent environment. Compute polyphone lists and write the needed description files.
polyTrain.tcl
Train the context-dependent system, this is similar to the "trainAlongLabels" script, only the startup differs.
makeQuestions.tcl
Create a list of phoneme classes that can be used for the decision-tree clustering of polyphone contexts.
cluster.tcl
Cluster the polyphone contexts into fewer contexts, and create a separate codebook for each new cluster. Write the corresponding description files.
secondLda.tcl
Compute an LDA transformation matrix a second time, this time using the context-dependent classes.
secondKmeans.tcl
Compute new context-dependent codebooks with k-means after extracting sample vectors. Write new weights and description files.
secondTest.tcl
Run a more sophisticated test on the test data, using multiple pass search and lattice rescoring.