HtmlCleaner « xpath « Java XML Q&A

Home
Java XML Q&A
1.convert
2.Development
3.document
4.dom
5.dom4j
6.dtd
7.element
8.jaxb
9.jaxp
10.jdom
11.jsoup
12.namespace
13.Node
14.parse
15.parser
16.pdf
17.sax
18.schema
19.stax
20.tag
21.transform
22.Validation
23.xalan
24.xmlbeans
25.xpath
26.xsd
27.xslt
28.xstream
Java XML Q&A » xpath » HtmlCleaner 

1. HtmlCleaner failing on some xpaths generated by XPather    stackoverflow.com

I am using HtmlCleaner2.1 library for evaluating xpaths generated by XPather plugin against html to scrape content from it. But sometimes, HtmlCleaner fails to evaluate xpath. For e.x. http://www.megaoutdoors.co.uk/norwegen-army-shirt-zipped-roll-top-collar-278-p.asp For product title, xpath given ...

2. Scrape data using XPath and Firefox    stackoverflow.com

I have extracted the xpath using firefox's FirePath addon..

xpath = html/body/div[2]/div/div[5]/div[2]/div/div[2]/div[1]/span/span
Are there any Java API available where i can directly use the above xpath. I am planning to use htmlcleaner does it ...

java2s.com  | Contact Us | Privacy Policy
Copyright 2009 - 12 Demo Source and Support. All rights reserved.
All other trademarks are property of their respective owners.