libo26 / feedparser

Automatically exported from code.google.com/p/feedparser
Other
0 stars 0 forks source link

inconsistent results using debian 6 #278

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

I get different entry keys from single feed source on two different servers.

The feed:
http://deals.adpages.com/austin/deals/rss

First machine:
>>> feed = feedparser.parse('http://deals.adpages.com/austin/deals/rss')
>>> feed.entries[0].keys()
['updated_parsed', 'links', u'smrss_largeimage', 'updated', 
u'smrss_merchantname', u'smrss_price', u'smrss_mediumimage', 
u'smrss_addressline2', u'smrss_city', u'smrss_value', 'title', 
u'smrss_emailposition', 'title_detail', u'smrss_smallimage', u'smrss_zipcode', 
u'smrss_sellendtime', 'summary_detail', u'smrss_emailsubject', 'tags', 
u'smrss_savings', 'link', u'smrss_rawdescription', 'summary', 
u'smrss_discount', u'smrss_state', u'smrss_address', u'smrss_sellbegintime']

Second machine:
>>> feed = feedparser.parse('http://deals.adpages.com/austin/deals/rss')
>>> feed.entries[0].keys()
['summary_detail', 'updated_parsed', 'links', 'title', 'tags', 'updated', 
'summary', 'title_detail', 'link']

What is the expected output? What do you see instead?

Well, i guess, on second server feedparser can't provide all the fields for 
entry.

What version of the product are you using? On what operating system?

First machine:
uname -a
Linux fuxter-laptop 2.6.38-ARCH #1 SMP PREEMPT Fri May 13 07:54:18 UTC 2011 
i686 Intel(R) Core(TM)2 Duo CPU T7100 @ 1.80GHz GenuineIntel GNU/Linux

Python 2.7.1 (r271:86832, Apr 15 2011, 12:11:58) 
[GCC 4.5.2 20110127 (prerelease)] on linux2

feedparser.__version__
'5.0.1'

Second machine:
uname -a
Linux debian 2.6.38.3-linode32 #1 SMP Thu Apr 21 20:08:53 UTC 2011 i686 
GNU/Linux

Python 2.6.6 (r266:84292, Dec 27 2010, 00:02:40) 
[GCC 4.4.5] on linux2

feedparser.__version__
'5.0.1'

Please provide any additional information below.

Just to check if the problem is in python2.6, I installed python2.7 from debian 
testing. But the result was the same. Moreover, I wgeted this feed on those two 
servers and checked their md5sums. Hashes were identical. Sorry if my 
explanation is clumsy. I'm not native English. But i hope logs are 
self-explanatory.

Let me know if I can provide more information.

Original issue reported on code.google.com by fuxt...@gmail.com on 21 May 2011 at 9:41

GoogleCodeExporter commented 9 years ago
Issue 279 has been merged into this issue.

Original comment by kurtmckee on 22 May 2011 at 5:43

GoogleCodeExporter commented 9 years ago
Although I'm not in front of my computer right now and thus can't test what 
feedparser is doing, I did view the feed source code using Firefox, and it 
appears none of the elements that you're seeing on the first machine are 
actually present in the feed. I've attached a copy of the feed that the server 
sent to Firefox.

My guess is that this isn't a feedparser issue, but when I'm back home and have 
time I'll look into this further. In the meantime, please download the feed 
from both servers and attach both copies to this report. Thanks!

Original comment by kurtmckee on 22 May 2011 at 5:58

Attachments:

GoogleCodeExporter commented 9 years ago
Oki-doki, herein attached the same feed downloaded from two different machines.

Original comment by fuxt...@gmail.com on 22 May 2011 at 8:58

Attachments:

GoogleCodeExporter commented 9 years ago
Gosh, I'm so sorry for confusing everyone and myself. I mentioned the wrong 
feed in the first post.

Here's the right one:
http://www.savemore.com/rss/affiliate/179208

Again, sorry for the mess. I work with lots of feeds lately. Attached *feed.xml 
file in previous post are correct.

Original comment by fuxt...@gmail.com on 22 May 2011 at 9:27

GoogleCodeExporter commented 9 years ago
You're surely running a modified version of the feedparser code, despite the 
version number it's claiming to be. I've downloaded what appear to be current 
packages from both the Debian and Arch package sites (links below) and they're 
identical.

How did you install feedparser on the Debian system? Do you get the same 
results if you simply download and extract feedparser from this site or PyPI?

http://www.archlinux.org/packages/extra/any/python2-feedparser/
http://packages.debian.org/sid/python-feedparser

Original comment by kurtmckee on 29 May 2011 at 6:50

GoogleCodeExporter commented 9 years ago

Original comment by kurtmckee on 29 May 2011 at 6:18

GoogleCodeExporter commented 9 years ago
Most probably you are right. I remember tinkering with feedparser's code some 
time ago, but I don't actually remember leaving some changes. If I recall it 
right, the problem was something about tags and categories in one particular 
rss feed.

Anyway, after reinstalling feedparser with pip everything works fine. So, 
stupid me. Thank you, kurtmckee. Issue closed, I guess.

Original comment by fuxt...@gmail.com on 31 May 2011 at 4:39

GoogleCodeExporter commented 9 years ago
Super!

Original comment by kurtmckee on 31 May 2011 at 5:20