The Strawberry Problem: Emergence of Character-level Understanding in Tokenized Language Models. Adrian Cosma, Stefan Ruseti, Emilian Radoi, Mihai Dascalu. In: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), November 2025, Suzhou, China, DOI: 10.18653/v1/2025.emnlp-main.1434.


