Get byte from a unicode string : encoding « String « Ruby






Get byte from a unicode string


$KCODE = "u"        # Specify Unicode UTF-8, or start Ruby with -Ku option
require "jcode"     # Load multibyte character support

mb = "2\303\2272=4" # This is "2×2=4" with a Unicode multiplication sign

mb.each_byte do |c| # Iterate through the bytes of the string.
  print c, " "      # c is Fixnum
end                 # Outputs "50 195 151 50 61 52 "

 








Related examples in the same category

1.Check the string encoding
2.Encoding and bytesize
3.Interpret a byte as an iso-8859-15 codepoint, and transcode to UTF-8
4.The iso-8859-1 encoding doesn't have a Euro sign, so this raises an exception
5.Encoding constant
6.Get utf-8 encoding
7.Specify Unicode UTF-8, or start Ruby with -Ku option
8.Get the position of the first multibyte char
9.Get each char in a unicode