设为首页收藏本站

LUPA开源社区

 找回密码
 注册
文章 帖子 博客
LUPA开源社区 首页 业界资讯 软件追踪 查看内容

jsoup 1.10.3发布,Java的HTML解析器

2017-6-12 21:42| 发布者: joejoe0332| 查看: 534| 评论: 0|原作者: oschina|来自: oschina

摘要: jsoup 1.10.3 发布了,该版本带来了更好的 CSS 选择器性能,Jsoup.Connection 改进和其他 bug 修复。详情包括:ImprovementsAddedElements.eachText()andElements.eachAttr(), which return a list of anElement'ste ...

jsoup 1.10.3 发布了,该版本带来了更好的 CSS 选择器性能,Jsoup.Connection 改进和其他 bug 修复。

详情包括:

Improvements

  • Added Elements.eachText() and Elements.eachAttr(), which return a list of an Element's text or attribute values, respectively. This makes it simpler to for example get a list of each URL on a page: List<String> urls = doc.select("a").eachAttr("abs:href"");

  • Improved selector validation for :contains(...) with unbalanced quotes.

  • Improved the speed of index based CSS selectors and other methods that use elementSiblingIndex, by a factor of 34x.

  • Added Node.clearAttributes(), to simplify removing of all attributes of a Node / Element.

Fixes

  • Bugfix: if an attribute name started or ended with a control character, the parse would fail with a validation exception.

  • Bugfix: Element.hasClass() and the .classname selector would not find the class attribute case-insensitively.

  • Bugfix: In Jsoup.Connection, if a redirect contained a query string with %xxescapes, they would be double escaped before the redirect was followed, leading to fetching an incorrect location.

  • Bugfix: In Jsoup.Connection, if a request body was set and the connection was redirected, the body would incorrectly still be sent.

  • Bugfix: In DataUtil when detecting the character set from meta data, and there are two Content-Types defined, use the one that defines a character set.

  • Bugfix: when parsing unknown tags in case-sensitive HTML mode, end tags would not close scope correctly.

  • In Jsoup.Connection, ensure there is no Content-Type set when being redirected to a GET.

  • Bugfix: in certain locales (Turkish specifically), lowercasing and case insensitivity could fail for specific items.

下载地址:https://jsoup.org/download


酷毙

雷人

鲜花

鸡蛋

漂亮
  • 快毕业了,没工作经验,
    找份工作好难啊?
    赶紧去人才芯片公司磨练吧!!

最新评论

关于LUPA|人才芯片工程|人才招聘|LUPA认证|LUPA教育|LUPA开源社区 ( 浙B2-20090187 浙公网安备 33010602006705号   

返回顶部