Initial < > and Interpreter Errors

Summed up: The first < > contains information on the speaker. If the speaker violates any rules, then annotate it behind the violating word in < >.
- Whisper can not use < >, so we can use them in two contexts:
- At the beginning of the text window to mark (within the same < >)
- <speaker/interpreter> : this tells me if an interpreter is speaking or the original speaker. Please always do this.
- <non_native> : this means the speaker does not sound like a native speaker. Only put this down if you are sure.
- <wrong_language> : this means the recording is not in your working language. You do not have to do anything else here.
- <voting_session> : this means the recording is mainly the president counting votes. You do not have to do anything else here.
- You may also use the < > at the start to mark something you find really exceptional, but please use this very rarely. For example <speaker very thick dialect>
- Directly after an obvious grammatical error of the speaker/interpreter (give them the benefit of the doubt):
In the example "questa" does not fit the gender of the following noun, <questo> is correct. Two lines down "asprendere" does not exist at all
in standard Italian, while "prendere" does not seem a viable choice. If it is unclear from the recording what the right choice is <?> can be used.
- Please don't mark unusual idioms this way, just single words that are obviously wrong.
- You should mark words that should not be there with <->. You should mark every single word of a "false start" this way.
- If there is a word missing and you are 100% certain where it should be, then you can write e.g. <+the>, otherwise mark sentences with missing elements with a <+> right after a punctuation mark