- [x] Changes in the C tokenizer - [x] Categorize failing tests - [x] Fix failing tests or modify/remove them as needed - [x] Changes in Python tokenizer <!-- gh-linked-prs --> ### Linked PRs * gh-102855 * gh-103633 * gh-103634 * gh-104006 * gh-104323 * gh-104731 * gh-104824 * gh-104847 * gh-104852 * gh-104854 * gh-104861 * gh-104865 <!-- /gh-linked-prs -->