utf « string « Java Data Type Q&A





1. UTF -8 conversion for String in other Language    coderanch.com

Not really. You haven't identified the problem in any clear way. You just have some idea that there's some kind of a problem, but you haven't described that problem in any way at all. You might consider producing a description of the problem; then other people might be able to suggest ways to deal with it.

2. How to truncate a utf string to desired boundaries    forums.oracle.com

Okay. There aren't any surrogate pairs in that string. (And it's not a "utf-8" string either, there's no such thing in Java, only Unicode.) And all the characters are in the BMP, so each of them is a code point. You wrote code to extract the first two chars, then, so why did you expect it to extract three? Edit: As ...