I'd like to know: How the language model-based tokenizers fare on domain-specifi...

		justlexi93 on March 25, 2020 \| parent \| context \| favorite \| on: Stanza: A Python natural language processing toolk... I'd like to know: How the language model-based tokenizers fare on domain-specific documents, since language models don't have context for unknown tokens. Are language model-based tokenizers any better at identifying abbreviations than rule-based ones?