The StreamToTextMusicFilterPunct
task takes a stream and returns a file containing a text version of the wav file. Sections of the audio that are determined to be music are filtered out and not included in the transcript. Simple sentence-forming punctuation (such as full stops and initial capital letters) is included in the .ctm
output.
Parameter | Description | Required |
---|---|---|
Type | The task type. Set to StreamToTextMusicFilterPunct . |
Yes |
Conf | Whether to generate word confidence scores. | |
Diag | Whether to generate diagnostic information. | |
DiagFile | The alignment diagnostics file to generate. | |
DnnScale | The DNN output acoustic score scaling factor. | |
FrameDupl | An integer value which allows for greater time efficiency with only a minimal loss of recognition accuracy. | |
Lang | The language pack to use. | |
LatFile | The name of the lattice file that contains word hypotheses. | |
LatScale | The depth of the lattice. | |
LatWinSize | The size (in seconds) of the lattice output window. | |
LatWordFile | A list of words to find. | |
NonSentFinalWords | A list of words that are unlikely to end a sentence. | |
Out | The file to write language identification results to. | |
SilThresh | The threshold between what the task identifies as silence and non-silence. | |
SpeechThresh | The threshold between speech and non-speech (music or noise). |
http://localhost:13000/action=AddTask&Type=StreamToTextMusicFilterPunct&SpeechThresh=-30&Out=Transcript1.ctm
This action uses port 13000
to instruct Speech Server, which is located on the local machine, to transcribe the audio stream (using the specified threshold values to determine sections that contain music), and to write the results to the Transcript1.ctm
file, with any sections determined to be music filtered out, and punctuation included.
|