Java String Diacritics removeDiacritics(String st)

Here you can find the source of removeDiacritics(String st)

Description

Remove diacritics (i.e., accents) from String

License

Open Source License

Parameter

Parameter Description
st a parameter

Declaration

public static String removeDiacritics(String st) 

Method Source Code

//package com.java2s;
/**/*from ww  w.j  a  v  a 2s  .c  o  m*/
 * PTStemmer - A Stemming toolkit for the Portuguese language (C) 2008-2010 Pedro Oliveira
 * 
 * This file is part of PTStemmer.
 * PTStemmer is free software: you can redistribute it and/or modify
 * it under the terms of the GNU Lesser General Public License as published by
 * the Free Software Foundation, either version 3 of the License, or
 * (at your option) any later version.
 * 
 * PTStemmer is distributed in the hope that it will be useful,
 * but WITHOUT ANY WARRANTY; without even the implied warranty of
 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
 * GNU Lesser General Public License for more details.
 * 
 * You should have received a copy of the GNU Lesser General Public License
 * along with PTStemmer. If not, see <http://www.gnu.org/licenses/>.
 * 
 */

import java.text.Normalizer;

public class Main {
    /**
     * Remove diacritics (i.e., accents) from String
     * @param st
     * @return
     */
    public static String removeDiacritics(String st) {
        st = Normalizer.normalize(st, Normalizer.Form.NFD);
        return st.replaceAll("[^\\p{ASCII}]", "");
    }
}

Related

  1. removeDiacriticalMarks(String string)
  2. removeDiacriticals(final String s)
  3. removeDiacritics(String input)
  4. removeDiacritics(String input)
  5. removeDiacritics(String text)
  6. removeDiacritics(String word)