Encoding and bytesize : encoding « String « Ruby






Encoding and bytesize


euro1 = "\u20AC"                     # Start with the Unicode Euro character
puts euro1                           # Prints "€"
euro1.encoding                       # => <Encoding:UTF-8>
euro1.bytesize                       # => 3

euro2 = euro1.encode("iso-8859-15")  # Transcode to Latin-15
puts euro2.inspect                   # Prints "\xA4"
euro2.encoding                       # => <Encoding:iso-8859-15>
euro2.bytesize                       # => 1

euro3 = euro2.encode("utf-8")        # Transcode back to UTF-8
euro1 == euro3                       # => true

 








Related examples in the same category

1.Check the string encoding
2.Interpret a byte as an iso-8859-15 codepoint, and transcode to UTF-8
3.The iso-8859-1 encoding doesn't have a Euro sign, so this raises an exception
4.Encoding constant
5.Get utf-8 encoding
6.Specify Unicode UTF-8, or start Ruby with -Ku option
7.Get the position of the first multibyte char
8.Get byte from a unicode string
9.Get each char in a unicode