Java HTML to Text text(Element e)

Here you can find the source of text(Element e)

Description

Fetches the text of an element but preserves newlines.

License

Apache License

Parameter

Parameter Description
e The element which has the text you need.

Return

The text of the element with newlines inside where BR and P

Declaration

public static String text(Element e) 

Method Source Code


//package com.java2s;
/*//from   www. j  av a  2  s  .co m
   Copyright 2014-2016 PetaByte Research Ltd.
    
   Licensed under the Apache License, Version 2.0 (the "License");
   you may not use this file except in compliance with the License.
   You may obtain a copy of the License at
    
   http://www.apache.org/licenses/LICENSE-2.0
    
   Unless required by applicable law or agreed to in writing, software
   distributed under the License is distributed on an "AS IS" BASIS,
   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
   See the License for the specific language governing permissions and
   limitations under the License.
 */

import static com.google.common.base.Preconditions.checkNotNull;
import org.jsoup.nodes.Element;

public class Main {
    /**
     * Fetches the text of an element but preserves newlines. Appends one
     * newline before every BR tag, prepends two newline before every P tag,
     * before calling Jsoup's text() on the element.
     *
     * @param e
     *            The element which has the text you need.
     * @return The text of the element with newlines inside where BR and P
     */
    public static String text(Element e) {
        checkNotNull(e, "e should not be null.");
        e.select("br").append("\\n");
        e.select("p").prepend("\\n\\n");
        return e.text().replaceAll("\\\\n", "\n").trim();
    }
}

Related

  1. html2text(final String html)
  2. html2text(String html)
  3. html2text(String htmlStr)
  4. text(Element element)
  5. textOf(final Element el)
  6. toElement(String html)
  7. toHtmlByHtml(String html)