Jsoup获取部分页面数据失败 org.jsoup.UnsupportedMimeTypeException: Unhandled content type. Must be text/*, application/xml, or application/
2017-03-29 10:50
585 查看
用Jsoup在获取一些网站的数据时,起初获取很顺利,但是在访问某浪的数据是Jsoup报错,应该是请求头里面的请求类型(ContextType)不符合要求。
请求代码如下:
能看到我这里设置了请求代理和相应时间。
报错信息如下:
在google上查找到了解决方法:添加ignoreContentType(true)
修改后代码:
那这里的ignoreContentType(true)看词就知道忽略ContextType的检查
请求代码如下:
private static void testOuGuanMatch() throws IOException{ Document doc = Jsoup.connect("我的URL").userAgent("Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.2.15)").timeout(5000).get(); System.out.println(doc); }
能看到我这里设置了请求代理和相应时间。
报错信息如下:
org.jsoup.UnsupportedMimeTypeException: Unhandled content type. Must be text/*, application/xml, or application/xhtml+xml. Mimetype=application/javascript, URL=.... at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:472) at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:424) at org.jsoup.helper.HttpConnection.execute(HttpConnection.java:178) at org.jsoup.helper.HttpConnection.get(HttpConnection.java:167) at calendarSpider.SpiderTest.testOuGuanMatch(SpiderTest.java:174) at calendarSpider.SpiderTest.main(SpiderTest.java:39)
在google上查找到了解决方法:添加ignoreContentType(true)
修改后代码:
private static void testOuGuanMatch() throws IOException{ Document doc = Jsoup.connect("我的URL").ignoreContentType(true).userAgent("Mozilla/5.0 (Windows; U; Windows NT 5.1; zh-CN; rv:1.9.2.15)").timeout(5000).get(); System.out.println(doc); }
那这里的ignoreContentType(true)看词就知道忽略ContextType的检查
相关文章推荐
- Jsoup获取部分页面数据失败 org.jsoup.UnsupportedMimeTypeException: Unhandled content type. Must be text/*, application/xml, or application/
- Jsoup获取部分页面数据失败 org.jsoup.UnsupportedMimeTypeException: Unhandled content type. Must be text/*, application/xml, or application/
- Jsoup获取部分页面数据失败 org.jsoup.UnsupportedMimeTypeException: Unhandled content type. Must be text/*, application/xml, or application/xhtml+xml.
- org.xml.sax.SAXParseException: Element type "beans" must be followed by either attribute specifications, ">" or "/>".
- Error parsing XML. org.xml.sax.SAXParseException: Element type "sqlMapConfig" must be declared出错解决方法
- Error parsing XML. org.xml.sax.SAXParseException: Element type "sqlMapConfig" must be declared出错解决方法
- org.xml.sax.SAXParseException: The content of elements must consist of well-formed character data or
- solrj使用报错:HttpSolrServer$RemoteSolrException: Expected mime type application/xml but got text/html
- Error parsing XML. org.xml.sax.SAXParseException: Element type "sqlMapConfig" must be declared出错解决方法
- 未指定Content-Type: application/x-www-form-urlencoded将会导致服务端获取Ajax POST数据失败
- org.xml.sax.SAXParseException The content of element type "package" must match "(result-types?,inter
- "org.xml.sax.SAXParseException: The content of element type "class" must match" 异常解决
- org.xml.sax.SAXParseException: Attribute "colum" must be declared for element type "result".
- org.apache.ibatis.builder.BuilderException: Error creating document instance. Cause: org.xml.sax.SAXParseException: The content of elements must consi
- Request.ContentType页面输出方式Gets or sets the MIME content type of the incoming request
- Cause: org.xml.sax.SAXParseException: The content of elements must consist of well-formed character
- org.xml.sax.SAXParseException: The content of element type “sqlMapConfig” must match “(properties?,s
- Caused by: org.xml.sax.SAXParseException: The content of elements must consist of well-formed charac
- UnsupportedMimeTypeException about Jsoup
- org.xml.sax.SAXParseException: The markup in the document preceding the root element must be well-fo