Split words with apostrophe
Language models often represent words containing apostrophe as two tokens, for example:
he’s -> he ‘s isn’t -> is n’t
he’s -> he ‘s
isn’t -> is n’t
exceptions (Sequence[str]) – Preserve these words with apostrophe instead of splitting them.
A new TextGrid in which realizations with apostrophe are split accordingly.