StreamToTextMusicFilter

The StreamToTextMusicFilter task takes a stream and returns a file containing a text version of the wav file. Sections of the audio that are determined to be music are filtered out and not included in the transcript.

Parameters

Parameter Description Required
Type The task type. Set to StreamToTextMusicFilter. Yes
Conf Whether to generate word confidence scores.  
Diag Whether to generate diagnostic information.  
DiagFile The alignment diagnostics file to generate.  
DnnScale The DNN output acoustic score scaling factor.  
FrameDupl An integer value which allows for greater time efficiency with only a minimal loss of recognition accuracy.  
Lang The language pack to use.  
LatFile The name of the lattice file that contains word hypotheses.  
LatScale The depth of the lattice.  
LatWinSize The size (in seconds) of the lattice output window.  
LatWordFile A list of words to find.  
Out The file to write language identification results to.  
SilThresh The threshold between what the task identifies as silence and non-silence.  
SpeechThresh The threshold between speech and non-speech (music or noise).  

Example

http://localhost:13000/action=AddTask&Type=StreamToTextMusicFilter&SpeechThresh=-30&Out=Transcript1.ctm       

This action uses port 13000 to instruct Speech Server, which is located on the local machine, to transcribe the audio stream (using the specified threshold values to determine sections that contain music), and to write the results to the Transcript1.ctm file, with any sections determined to be music filtered out.


_HP_HTML5_bannerTitle.htm