Convert into Hexadecimal notation of Unicode : Unicode « I18N « Java






Convert into Hexadecimal notation of Unicode

   

/* infoScoop OpenSource
 * Copyright (C) 2010 Beacon IT Inc.
 * 
 * This program is free software: you can redistribute it and/or modify
 * it under the terms of the GNU Lesser General Public License version 3
 * as published by the Free Software Foundation.
 * 
 * This program is distributed in the hope that it will be useful,
 * but WITHOUT ANY WARRANTY; without even the implied warranty of
 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
 * GNU Lesser General Public License for more details.
 * 
 * You should have received a copy of the GNU Lesser General Public
 * License along with this program.  If not, see
 * <http://www.gnu.org/licenses/lgpl-3.0-standalone.html>.
 */


/**
 * A utility class related to character string.
 * 
 * @author Eiichi Sakurai
 */
 
 
public class Util{
  
  /**
   * convert into Hexadecimal notation of Unicode.<br>
   * example)a?\u0061
   * @param str
   * @return
   */
  public static String toHexString(String str) {
    StringBuffer sb = new StringBuffer();
    for (int i = 0; i < str.length(); i++) {
      sb.append(toHexString(str.charAt(i)));
    }
    return sb.toString();
  }

  /**
   * convert into Hexadecimal notation of Unicode.<br>
   * example)a?\u0061
   * @param ch
   * @return
   */
  public static String toHexString(char ch) {
    String hex = Integer.toHexString((int) ch);
    while (hex.length() < 4) {
      hex = "0" + hex;
    }
    hex = "\\u" + hex;
    return hex;
  }
}

   
    
    
  








Related examples in the same category

1.Unicode DisplayUnicode Display
2.Character Sets and Unicode: Code Set Conversion
3.Display "special character" using Unicode
4.International friendly string comparison with case-order
5.Generic unicode textreader, which will use BOM mark to identify the encoding to be used. If BOM is not found then use a given default or system encoding.
6.Generic Unicode text reader, which uses a BOM (Byte Order Mark) to identify the encoding to be used.
7.Generic unicode text reader.
8.processing SGML into unicode characters.
9.Write a 16 bit short as LITTLE_ENDIAN
10.Write a 32 bit int as LITTLE_ENDIAN.
11.Arabic Reshaper