This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision | |||
|
language_modeling [2015/03/31 16:30] mganzeboom |
language_modeling [2015/03/31 16:40] (current) mganzeboom |
||
|---|---|---|---|
| Line 30: | Line 30: | ||
| ==== Commands to create a simple unsmoothed bigram text model ==== | ==== Commands to create a simple unsmoothed bigram text model ==== | ||
| Also see this short and practical tutorial as part of a Linguistics course at UC San Diego [[http:// | Also see this short and practical tutorial as part of a Linguistics course at UC San Diego [[http:// | ||
| - | When you have your vocabulary and corpus text files ready, the following command from SRI LM Toolkit will also create a bigram language model and store it in binary | + | When you have your vocabulary and corpus text files ready, the following command from SRI LM Toolkit will create a bigram language model and store it in the ARPA backoff N-gram |
| - | '' | + | '' |
| For an explanation of this command and the options used, please refer to the above tutorial or the [[http:// | For an explanation of this command and the options used, please refer to the above tutorial or the [[http:// | ||
| + | |||
| + | To convert this text format to the binary format required by the SPRAAK ASR Toolkit, execute the following command (assuming you have the toolkit on the PATH):\\ | ||
| + | '' | ||