Ticket #43 (closed defect: fixed)

Opened 3 years ago

Last modified 2 years ago

Won't parse Yahoo!! RSS

Reported by: kyle.c.maxwell@… Owned by: ser
Priority: normal Milestone: 3.1.5
Component: DOM Version: 3.1.2
Severity: normal Keywords:
Cc: Ruby version: 1.8.2
Operating system: Linux

Description

#Sample rss feed:

url = http://news.search.yahoo.com/news/rss?p=market&ei=UTF-8&fl=0&x=wrt

#Whenever an item title has an apostrophe in it, the following returns nothing: Document.new(Kernel.open(url)).elements.each("//item")

#In addition, any text nodes with apostrophes below will be corrupted. Document.new(Kernel.open(url)).elements.each("//title").collect{|e| e.text}

Change History

Changed 3 years ago by anonymous

Kernel.open(url) should be Kernel.open(url).read

Changed 3 years ago by ser

  • status changed from new to assigned
  • Confirmed that something odd is going on with XPath and the Yahoo feed. In particular, only the //item XPath fails; other nodes are matched correctly.
  • Haven't (yet) confirmed that it is related to apostrophes in the text
  • The second complaint about text corruption can't be replicated with REXML 3.1.3

Changed 3 years ago by anonymous

  • milestone changed from 3.1.4 to 3.1.5

Changed 2 years ago by ser

  • status changed from assigned to closed
  • resolution set to fixed

Can't replicate. Either Yahoo fixed their feeds, or a previous bug fix in REXML resolved this; in any case, the Yahoo feed is working fine in REXML 3.1.4.

Unit tests were added in changeset:1207

Note: See TracTickets for help on using tickets.