Here is the proposed file hierarchy:

  • <dataset>/
    • segments/
      • *.seg
    • acfeats/
      • *.slx2
    • <def>/
      • mcfeats/
        • *.ling
      • models/ (*.model files)
        • 1v1/
        • <language_set>/
      • results/
        • <language_set>/
        • <language_pair>/
      • ling.batch
      • def.xml
    • wav.batch
    • train.dataset
    • devtest.dataset
    • eval.dataset

(The bracketed items are variables)

List of Batch Files:

Type Input Output
Phone Recognizer .wav .seg
RecOptimizer .wav,trans -
PRAAT .seg .slx2
Feature Extraction + Conversion .slx2 .ling
Learner .ling .model
Classifier .ling prob

First two are “per language” batch files, last two are “full batch”. Spoken Language ID

nlp-private/proposed-file-hierarchy-slid.txt · Last modified: 2015/04/23 13:17 by ryancha
Back to top
CC Attribution-Share Alike 4.0 International
chimeric.de = chi`s home Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0