IDOL Speech Server allows you to add extra information into a transcript, which it then marks with a metadata tag in the processed transcript, enabling you to easily find it. The extra information does not affect transcription alignment. For example, you might want to mark the point in a lecture transcript where a video was played. You could add the following to the transcript:
... Let’s look at this example <metadata video = GlobalIssues /> Environmental issues ...
The aligned transcript that is generated contains the following:
...
1
|
A
|
10.1
|
0.3
|
Let’s
|
1.0
|
|
1
|
A
|
10.4
|
0.2
|
look
|
1.0
|
|
1
|
A
|
10.6
|
0.3
|
at
|
1.0
|
|
1
|
A
|
10.9
|
0.3
|
this
|
1.0
|
|
1
|
A
|
11.2
|
0.3
|
example
|
1.0
|
|
1
|
A
|
11.5
|
0.0
|
video = GlobalIssues
|
1.0
|
<-- metadata tag
|
1
|
A
|
11.5
|
0.4
|
Environmental
|
1.0
|
|
1
|
A
|
11.9
|
0.3
|
issues
|
1.0
|
Metadata tags must conform to the following syntax to pass through the text normalizer unmodified. The syntax is loosely based on the format of tags used in XML.
Metadata ::= '<metadata' (S Attribute)* S? '/'? '>'
S ::= (#x20 | #x9 | #xD | #xA)+
Attribute ::= Name Eq AttValue
Name ::= NameStartChar (NameChar)*
NameStartChar ::= ":"
| [A-Z]
| "_"
| [a-z]
| [#xC0-#xD6]
| [#xD8-#xF6]
| [#xF8-#x2FF]
| [#x370-#x37D]
| [#x37F-#x1FFF]
| [#x200C-#x200D]
| [#x2070-#x218F]
| [#x2C00-#x2FEF]
| [#x3001-#xD7FF]
| [#xF900-#xFDCF]
| [#xFDF0-#xFFFD]
| [#x10000-#xEFFFF]
NameChar ::= NameStartChar
| "-"
| "."
| [0-9]
| #xB7
| [#x0300-#x036F]
| [#x203F-#x2040]
Eq ::= S? '=' S?
AttValue ::= '"' [^"]* '"'
| "'" [^']* "'"
|