JANUS-2: #QK-m(1) #QK-m(1)-0 #QK-m(1)-1 JANUS-3: #QK-m(1) {#QK-m(1)-0 #QK-m(1)-1}
JANUS-2: SIL 2 0 0.01 1 0.0 3 3 0 0.01 1 0.0 2 0.015 JANUS-3: SIL { { 0 0.01 } { 1 0.0 } } 3 { { 0 0.01 } { 1 0.0 } { 2 0.015 } }
JANUS-2 ; Feature Description File for JANUS-2 Training and Testing ; ; to magic what from parameters ; INPUT 21 2 read FFT MFSC 16 3 cut INPUT 0 15 dMFSC 16 4 delta MFSC 2 LOGPOW 1 11 cut INPUT 16 16 dLOGPOW1 1 5 delta LOGPOW 1 ddLOGPOW1 1 6 delta dLOGPOW1 1 dLOGPOW2 1 7 delta LOGPOW 2 ddLOGPOW2 1 8 delta dLOGPOW2 2 COMBI 4 9 cut INPUT 17 20 ZWISCH 40 10 concat MFSC COMBI LDAS 40 13 meanvarnorm ZWISCH 2 LDAT 40 12 transform LDAS ../env-GSST/matrix.cd LDA1 16 0 cut LDAT 0 15 LDA2 16 1 cut LDAT 16 31 JANUS-3 # Feature Description File for JANUS-3 # # what to from parameters # $fes read INPUT $arg(FFT) $fes split MFSC INPUT 0 15 $fes delta dMFSC MFSC -delta -2 $fes split LOGPOW INPUT 16 16 $fes delta dLOGPOW1 LOGPOW -delta -1 $fes delta ddLOGPOW1 dLOGPOW1 -delta -1 $fes delta dLOGPOW2 LOGPOW -delta -2 $fes delta ddLOGPOW2 dLOGPOW2 -delta -2 $fes split COMBI INPUT 17 20 $fes merge ZWISCH MFSC dMFSC dLOGPOW1 ddLOGPOW1 dLOGPOW2 ddLOGPOW2 COMBI $fes meansub LDAS ZWISCH -a 2 $fes matmul LDAT LDAS $fes:matrixfeat.data $fes split LDA1-big LDAT 0 15 $fes split LDA2-big LDAT 16 31 set frameN [expr [$fes:LDA1-big configure -frameN] -3] $fes cut LDA1 LDA1-big 2 $frameN $fes cut LDA2 LDA2-big 2 $frameN
WB WE
3state { ROOT-b ROOT-m ROOT-e } { 3 3 3 } SIL { ROOT-m ROOT-m } { SIL SIL }The SIL and the 3 that are the last pair of braces refer to the names in the transition file. Together with the transition file this defines a set of full phoneme topologies.
ttree { 0=SIL } 3stateNode silNode - - 3stateNode { } - - - 3state silNode { } - - - SILNote that the questions come in braces. This is because you can put a list of qustions that are connected by and. The 0 in the first line means the question refers to the current phoneme, not the one left or right of it. 0=SIL means, is the current phoneme a SIL Phoneme ? If no, we go to the leaf that answers to use the model named 3state for this phoneme, if yes we go to the leaf with the model named SIL.
Ready ? Ok.