I accidentally answered a question where the original problem involved splitting sentence to separate words.
And the author suggested to use BreakIterator to tokenize input strings and some people ...
I have a text like that:
The C language is%y% widely used today in application, operating
system, and embedded system development, and its influence is seen in