kurtmckee / feedparser

Parse feeds in Python
https://feedparser.readthedocs.io/en/latest/
Other
1.88k stars 335 forks source link

Incomplete parsing with Python 3 #164

Closed cristianocca closed 5 years ago

cristianocca commented 5 years ago

Using Python 3 (3.6) and feedparser 5.2.1 under ubuntu.

I'm trying to parse a feed that for some reason started to fail recently. After digging a bit, it turns out the failures are because some entries are not being parsed properly and everything after that ends incomplete (i.e., the resulting dict has link and other atributes).

The feed I'm parsing is the following: data = feedparser.parse('https://thn.li/rss.php')

Since the feed might change from the time you read this, below are the raw text responses, and the parsed dicts.

-- Parsed dict (from data['entries']) --


{'title': 'Telegram Gained 3 Million New Users During WhatsApp, Facebook Outage', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'Telegram Gained 3 Million New Users During WhatsApp, Facebook Outage'}, 'summary': 'WhatsApp, Facebook, and Instagram faced a widespread outage yesterday with users from around the world reporting issues with sending messages on WhatsApp and Messenger, posting feeds on Facebook and accessing other features on the three Facebook-owned platforms. While the outage was quite troubling...', 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': 'WhatsApp, Facebook, and Instagram faced a widespread outage yesterday with users from around the world reporting issues with sending messages on WhatsApp and Messenger, posting feeds on Facebook and accessing other features on the three Facebook-owned platforms. While the outage was quite troubling...'}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/SByATQr38uI/encrypted-telegram-messenger.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-J1QIdWsKcUo/XIowhdP1JZI/AAAAAAAAzhE/-vYg6sDHsMc5YiJ0e4-K8MrwbjWuRuX2gCLcBGAs/s1600-e100/encrypted-telegram-messenger.jpg', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/SByATQr38uI/encrypted-telegram-messenger.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-3258920075678632403', 'guidislink': False, 'published': 'Thu, 14 Mar 2019 06:50:21 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=14, tm_hour=10, tm_min=50, tm_sec=21, tm_wday=3, tm_yday=73, tm_isdst=0), 'authors': [{'name': 'Mohit Kumar', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Mohit Kumar)', 'author_detail': {'name': 'Mohit Kumar', 'email': 'info@thehackernews.com'}}

{'title': 'New WordPress Flaw Lets Unauthenticated Remote Attackers Hack Sites', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'New WordPress Flaw Lets Unauthenticated Remote Attackers Hack Sites'}, 'summary': "If for some reason your WordPress-based website has not yet been automatically updated to the latest version 5.1.1, it's highly recommended to immediately upgrade it before hackers could take advantage of a newly disclosed vulnerability to hack your website. Simon Scannell, a researcher at RIPS Tec...", 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': "If for some reason your WordPress-based website has not yet been automatically updated to the latest version 5.1.1, it's highly recommended to immediately upgrade it before hackers could take advantage of a newly disclosed vulnerability to hack your website. Simon Scannell, a researcher at RIPS Tec..."}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/Ixkczn3sNdo/hack-wordpress-websites.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-YV90ZKWuy0I/XIohdMBQ3kI/AAAAAAAAzg4/6FHZxlaG4IYeoxPxlGexPYYsdMFARtyrgCLcBGAs/s1600-e100/wordpress-hacking-exploit-min.jpg', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/Ixkczn3sNdo/hack-wordpress-websites.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-3149589378230119499', 'guidislink': False, 'published': 'Thu, 14 Mar 2019 05:41:21 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=14, tm_hour=9, tm_min=41, tm_sec=21, tm_wday=3, tm_yday=73, tm_isdst=0), 'authors': [{'name': 'Swati Khandelwal', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Swati Khandelwal)', 'author_detail': {'name': 'Swati Khandelwal', 'email': 'info@thehackernews.com'}}

{'title': "Zero-Day Flaws in Counter-Strike 1.6 Let Malicious Servers Hack Gamers' PCs", 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': "Zero-Day Flaws in Counter-Strike 1.6 Let Malicious Servers Hack Gamers' PCs"}, 'summary': "If you are a Counter-Strike gamer, then beware, because 39% of all existing Counter-Strike 1.6 game servers available online are malicious that have been set-up to remotely hack gamers' computers. A team of cybersecurity researchers at Dr. Web has disclosed that an attacker has been using malicious...", 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': "If you are a Counter-Strike gamer, then beware, because 39% of all existing Counter-Strike 1.6 game servers available online are malicious that have been set-up to remotely hack gamers' computers. A team of cybersecurity researchers at Dr. Web has disclosed that an attacker has been using malicious..."}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/hPNtrFCCmBw/counter-strike-game-servers.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-gkSNFY9mN3I/XIn9oNkbn9I/AAAAAAAAzgk/9YWetIjzoOogtuNr6bU6XGBUhqSHHPWpQCLcBGAs/s1600-e100/Counter-Strike-hacks.jpg', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/hPNtrFCCmBw/counter-strike-game-servers.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-8648906939331791193', 'guidislink': False, 'published': 'Thu, 14 Mar 2019 03:10:00 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=14, tm_hour=7, tm_min=10, tm_sec=0, tm_wday=3, tm_yday=73, tm_isdst=0), 'authors': [{'name': 'Mohit Kumar', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Mohit Kumar)', 'author_detail': {'name': 'Mohit Kumar', 'email': 'info@thehackernews.com'}}

{'title': 'AWS Certification Training Courses – Get 2019 Bundle @ 96% OFF', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'AWS Certification Training Courses – Get 2019 Bundle @ 96% OFF'}, 'summary': 'With countless web apps and online services launching every day, there is an increasing demand for cloud developers. This exciting niche is due to grow rapidly over the next few years, and the paycheck should follow suit. If you want to build a career in this lucrative niche, it pays to know AWS ...', 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': 'With countless web apps and online services launching every day, there is an increasing demand for cloud developers. This exciting niche is due to grow rapidly over the next few years, and the paycheck should follow suit. If you want to build a career in this lucrative niche, it pays to know AWS ...'}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/n9w13jGLcUw/aws-certification-training.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-FUIccE9oYYY/XIZCr3h3N3I/AAAAAAAAzes/iH2uYLKsQtoD2WSEeuF7NonYkqLwPVrFgCLcBGAs/s1600-e100/AWS-certification-training-course.jpg', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/n9w13jGLcUw/aws-certification-training.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-8308074854618794428', 'guidislink': False, 'published': 'Wed, 13 Mar 2019 10:11:24 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=13, tm_hour=14, tm_min=11, tm_sec=24, tm_wday=2, tm_yday=72, tm_isdst=0), 'authors': [{'name': 'Exclusive Deals', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Exclusive Deals)', 'author_detail': {'name': 'Exclusive Deals', 'email': 'info@thehackernews.com'}}

{'title': 'Firefox Send — Free Encrypted File Transfer Service Now Available For All', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'Firefox Send — Free Encrypted File Transfer Service Now Available For All'}, 'summary': 'Mozilla has made it easy for you to share large files securely and privately with whomever you want, eliminating the need to depend upon less secure free third-party services or file upload tools that burn a hole in your pocket. Mozilla has finally launched its free, end-to-end encrypted file-trans...', 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': 'Mozilla has made it easy for you to share large files securely and privately with whomever you want, eliminating the need to depend upon less secure free third-party services or file upload tools that burn a hole in your pocket. Mozilla has finally launched its free, end-to-end encrypted file-trans...'}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/dN6VHLOqcM4/firefox-send-encrypted-file-share.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-M4I8IQpcJq0/XIi9rUgXyUI/AAAAAAAAzgM/Ddh6s4LxU24zYDVlPvYLIbOI6iSsqchHwCLcBGAs/s1600-e100/firefox-send-encrypted-file-share.png', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/dN6VHLOqcM4/firefox-send-encrypted-file-share.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-5689667980183526488', 'guidislink': False, 'published': 'Wed, 13 Mar 2019 04:40:40 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=13, tm_hour=8, tm_min=40, tm_sec=40, tm_wday=2, tm_yday=72, tm_isdst=0), 'authors': [{'name': 'Swati Khandelwal', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Swati Khandelwal)', 'author_detail': {'name': 'Swati Khandelwal', 'email': 'info@thehackernews.com'}}

{'title': 'Microsoft Releases Patches for 64 Flaws — Two Under Active Attack', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'Microsoft Releases Patches for 64 Flaws — Two Under Active Attack'}, 'summary': 'It\'s time for another batch of "Patch Tuesday" updates from Microsoft. Microsoft today released its March 2019 software updates to address a total of 64 CVE-listed security vulnerabilities in its Windows operating systems and other products, 17 of which are rated critical, 45 important, one moderat...', 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': 'It\'s time for another batch of "Patch Tuesday" updates from Microsoft. Microsoft today released its March 2019 software updates to address a total of 64 CVE-listed security vulnerabilities in its Windows operating systems and other products, 17 of which are rated critical, 45 important, one moderat...'}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/JzdTsywTggA/microsoft-windows-security-updates.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-PKdiogHzFeI/XIf0vqexEZI/AAAAAAAAzfs/p4e6mA-R0002aWC4T5QjStHpVJq7nTecACLcBGAs/s1600-e100/microsoft-windows-security-updates.jpg', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/JzdTsywTggA/microsoft-windows-security-updates.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-8917473220512557652', 'guidislink': False, 'published': 'Wed, 13 Mar 2019 07:01:52 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=13, tm_hour=11, tm_min=1, tm_sec=52, tm_wday=2, tm_yday=72, tm_isdst=0), 'authors': [{'name': 'Mohit Kumar', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Mohit Kumar)', 'author_detail': {'name': 'Mohit Kumar', 'email': 'info@thehackernews.com'}}

{'title': 'Adobe Releases Patches for Critical Flaws in Photoshop CC and Digital Edition', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'Adobe Releases Patches for Critical Flaws in Photoshop CC and Digital Edition'}, 'summary': 'Adobe users would feel lighter this month, as Adobe has released patches for just two security vulnerability in its March Security Update. The company today released its monthly security updates to address two critical arbitrary code execution vulnerabilities—one in Adobe Photoshop CC and another i...', 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': 'Adobe users would feel lighter this month, as Adobe has released patches for just two security vulnerability in its March Security Update. The company today released its monthly security updates to address two critical arbitrary code execution vulnerabilities—one in Adobe Photoshop CC and another i...'}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'http://feedproxy.google.com/~r/TheHackersNews/~3/xR-7QrrUy70/adobe-software-updates.html'}, {'length': '12216320', 'type': 'image/jpeg', 'href': 'https://1.bp.blogspot.com/-Jx-dqugynac/XIfch0W0ItI/AAAAAAAAzfk/M8hg-rV8D9QgDXk2XqigX8Efh8sCzjpJgCLcBGAs/s1600-e100/adobe-software-updates.png', 'rel': 'enclosure'}], 'link': 'http://feedproxy.google.com/~r/TheHackersNews/~3/xR-7QrrUy70/adobe-software-updates.html', 'id': 'tag:blogger.com,1999:blog-4802841478634147276.post-771537093114302379', 'guidislink': False, 'published': 'Tue, 12 Mar 2019 12:22:15 -0400', 'published_parsed': time.struct_time(tm_year=2019, tm_mon=3, tm_mday=12, tm_hour=16, tm_min=22, tm_sec=15, tm_wday=1, tm_yday=71, tm_isdst=0), 'authors': [{'name': 'Swati Khandelwal', 'email': 'info@thehackernews.com'}], 'author': 'info@thehackernews.com (Swati Khandelwal)', 'author_detail': {'name': 'Swati Khandelwal', 'email': 'info@thehackernews.com'}}

**{'title': "Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts", 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': "Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts"}}**

**This one is incomplete**

--- Raw response ---

<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>The Hacker News</title><link>https://thehackernews.com</link><description>Most trusted, widely acknowledged news source for cybersecurity researchers, hackers, technologists, enthusiasts and nerds.</description><language>en-us</language><item><title>Telegram Gained 3 Million New Users During WhatsApp, Facebook Outage</title><description>WhatsApp, Facebook, and Instagram faced a widespread outage yesterday with users from around the world reporting issues with sending messages on WhatsApp and Messenger, posting feeds on Facebook and accessing other features on the three Facebook-owned platforms. While the outage was quite troubling...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/SByATQr38uI/encrypted-telegram-messenger.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-3258920075678632403</guid><pubDate>Thu, 14 Mar 2019 06:50:21 -0400</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-J1QIdWsKcUo/XIowhdP1JZI/AAAAAAAAzhE/-vYg6sDHsMc5YiJ0e4-K8MrwbjWuRuX2gCLcBGAs/s1600-e100/encrypted-telegram-messenger.jpg" length="12216320" type="image/jpeg"/></item><item><title>New WordPress Flaw Lets Unauthenticated Remote Attackers Hack Sites</title><description>If for some reason your WordPress-based website has not yet been automatically updated to the latest version 5.1.1, it\'s highly recommended to immediately upgrade it before hackers could take advantage of a newly disclosed vulnerability to hack your website. Simon Scannell, a researcher at RIPS Tec...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/Ixkczn3sNdo/hack-wordpress-websites.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-3149589378230119499</guid><pubDate>Thu, 14 Mar 2019 05:41:21 -0400</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-YV90ZKWuy0I/XIohdMBQ3kI/AAAAAAAAzg4/6FHZxlaG4IYeoxPxlGexPYYsdMFARtyrgCLcBGAs/s1600-e100/wordpress-hacking-exploit-min.jpg" length="12216320" type="image/jpeg"/></item><item><title>Zero-Day Flaws in Counter-Strike 1.6 Let Malicious Servers Hack Gamers&#039; PCs</title><description>If you are a Counter-Strike gamer, then beware, because 39% of all existing Counter-Strike 1.6 game servers available online are malicious that have been set-up to remotely hack gamers\' computers. A team of cybersecurity researchers at Dr. Web has disclosed that an attacker has been using malicious...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/hPNtrFCCmBw/counter-strike-game-servers.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-8648906939331791193</guid><pubDate>Thu, 14 Mar 2019 03:10:00 -0400</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-gkSNFY9mN3I/XIn9oNkbn9I/AAAAAAAAzgk/9YWetIjzoOogtuNr6bU6XGBUhqSHHPWpQCLcBGAs/s1600-e100/Counter-Strike-hacks.jpg" length="12216320" type="image/jpeg"/></item><item><title>AWS Certification Training Courses – Get 2019 Bundle @ 96% OFF</title><description>With countless web apps and online services launching every day, there is an increasing demand for cloud developers. This exciting niche is due to grow rapidly over the next few years, and the paycheck should follow suit. If you want to build a career in this lucrative niche, it pays to know AWS ...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/n9w13jGLcUw/aws-certification-training.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-8308074854618794428</guid><pubDate>Wed, 13 Mar 2019 10:11:24 -0400</pubDate><author>info@thehackernews.com (Exclusive Deals)</author><enclosure url="https://1.bp.blogspot.com/-FUIccE9oYYY/XIZCr3h3N3I/AAAAAAAAzes/iH2uYLKsQtoD2WSEeuF7NonYkqLwPVrFgCLcBGAs/s1600-e100/AWS-certification-training-course.jpg" length="12216320" type="image/jpeg"/></item><item><title>Firefox Send — Free Encrypted File Transfer Service Now Available For All</title><description>Mozilla has made it easy for you to share large files securely and privately with whomever you want, eliminating the need to depend upon less secure free third-party services or file upload tools that burn a hole in your pocket. Mozilla has finally launched its free, end-to-end encrypted file-trans...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/dN6VHLOqcM4/firefox-send-encrypted-file-share.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-5689667980183526488</guid><pubDate>Wed, 13 Mar 2019 04:40:40 -0400</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-M4I8IQpcJq0/XIi9rUgXyUI/AAAAAAAAzgM/Ddh6s4LxU24zYDVlPvYLIbOI6iSsqchHwCLcBGAs/s1600-e100/firefox-send-encrypted-file-share.png" length="12216320" type="image/jpeg"/></item><item><title>Microsoft Releases Patches for 64 Flaws — Two Under Active Attack</title><description>It\'s time for another batch of "Patch Tuesday" updates from Microsoft. Microsoft today released its March 2019 software updates to address a total of 64 CVE-listed security vulnerabilities in its Windows operating systems and other products, 17 of which are rated critical, 45 important, one moderat...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/JzdTsywTggA/microsoft-windows-security-updates.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-8917473220512557652</guid><pubDate>Wed, 13 Mar 2019 07:01:52 -0400</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-PKdiogHzFeI/XIf0vqexEZI/AAAAAAAAzfs/p4e6mA-R0002aWC4T5QjStHpVJq7nTecACLcBGAs/s1600-e100/microsoft-windows-security-updates.jpg" length="12216320" type="image/jpeg"/></item>

<item><title>Adobe Releases Patches for Critical Flaws in Photoshop CC and Digital Edition</title><description>Adobe users would feel lighter this month, as Adobe has released patches for just two security vulnerability in its March Security Update. The company today released its monthly security updates to address two critical arbitrary code execution vulnerabilities—one in Adobe Photoshop CC and another i...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/xR-7QrrUy70/adobe-software-updates.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-771537093114302379</guid><pubDate>Tue, 12 Mar 2019 12:22:15 -0400</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-Jx-dqugynac/XIfch0W0ItI/AAAAAAAAzfk/M8hg-rV8D9QgDXk2XqigX8Efh8sCzjpJgCLcBGAs/s1600-e100/adobe-software-updates.png" length="12216320" type="image/jpeg"/></item>

**<item><title>Cynet is offering unhappy competitors&#039; customers a refund for the time remaining on existing contracts</title><description>Cynet goes head-to-head with CrowdStrike, DarkTrace, Cylance, Carbon Black & Symantec, offering their unhappy customers a refund for the time remaining on their existing contracts. Cynet, the automated threat discovery and mitigation platform was built to address the advanced threats that AV and Fi...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/2kBjOTNiTks/cynet-endpoint-security.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-799735885797893608</guid><pubDate>Tue, 12 Mar 2019 09:12:54 -0400</pubDate><author>info@thehackernews.com (Exclusive Deals)</author><enclosure url="https://1.bp.blogspot.com/-_YPubP4uoCw/XIev0y7WlDI/AAAAAAAAzfc/l10J_ulQUS8Jxdr729Up4My_SZC2fiDvwCLcBGAs/s1600-e100/cynet-endpoint-security-min.png" length="12216320" type="image/jpeg"/></item>**

<item><title>Windows 10 Now Automatically Uninstalls Updates That Cause Problems</title><description>Do you always think twice before installing Windows updates worrying that it could crash your system or leave it non-working the day after Patch Tuesdays? Don\'t worry. Microsoft has addressed this issue by adding a safety measure that would from now onwards automatically uninstall buggy software u...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/GeTNufEhpBw/windows-buggy-updates.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-8213877463907617129</guid><pubDate>Thu, 14 Mar 2019 01:37:56 -0400</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-m1SpQgbOlpA/XIdwhTf9jvI/AAAAAAAAzfI/ZsGaUH5S5uA8XEOH962m3e0m7BvRT99XQCLcBGAs/s1600-e100/microsoft-windows-software-updates.png" length="12216320" type="image/jpeg"/></item><item><title>F5 Networks Acquires NGINX For $670 Million</title><description>One of the most important software companies NGINX, which is also behind the very popular open-source web server of the same name, is being acquired by its rival, F5 Networks, in a deal valued at about $670 million. While NGINX is not a name that you have ever heard of, the reality is that you use ...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/vgzdSUS2alE/f5-networks-acquires-nginx.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-1072511406385194421</guid><pubDate>Tue, 12 Mar 2019 03:17:40 -0400</pubDate><author>info@thehackernews.com (Wang Wei)</author><enclosure url="https://1.bp.blogspot.com/-S88Qe0CbwZ4/XIdaEC81DMI/AAAAAAAAzfA/hrpD5tBVvuAM0EGu6wuHPzTUuMiFXwzlACLcBGAs/s1600-e100/f5-networks-acquires-nginx.jpg" length="12216320" type="image/jpeg"/></item><item><title>BEWARE – New &#039;Creative&#039; Phishing Attack You Really Should Pay Attention To</title><description>A cybersecurity researcher who last month warned of a creative phishing campaign has now shared details of a new but similar attack campaign with The Hacker News that has specifically been designed to target mobile users. Just like the previous campaign, the new phishing attack is also based on the...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/G2AaKS4izps/ios-mobile-phishing-attack.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-6604025958423180181</guid><pubDate>Mon, 11 Mar 2019 11:46:20 -0400</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-44UOno5Ez34/XIZ-sJ0pYnI/AAAAAAAAze4/AQdn1-h__KQoPuA60rqJLltcOzZunTFrACLcBGAs/s1600-e100/mobile-phishing-attack.jpg" length="12216320" type="image/jpeg"/></item><item><title>Severe Flaw Disclosed In StackStorm DevOps Automation Software</title><description>A security researcher has discovered a severe vulnerability in the popular, open source event-driven platform StackStorm that could allow remote attackers to trick developers into unknowingly execute arbitrary commands on targeted services. StackStorm, aka "IFTTT for Ops," is a powerful event-drive...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/LW6DpTurryQ/stackstorm-security-vulnerability.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-422023839504248671</guid><pubDate>Mon, 11 Mar 2019 06:16:46 -0400</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-O3e2NMsghGc/XIYoKAKtUiI/AAAAAAAAzeg/TIdUVVdyw0clhacRniEIwooqEGCZxaRgACLcBGAs/s1600-e100/StackStorm-security-vulnerability.jpg" length="12216320" type="image/jpeg"/></item><item><title>Citrix Data Breach – Iranian Hackers Stole 6TB of Sensitive Data</title><description>Popular enterprise software company Citrix that provides services to the U.S. military, the FBI, many U.S. corporations, and various U.S. government agencies disclosed last weekend a massive data breach of its internal network by "international cyber criminals." Citrix said it was warned by the FBI...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/R9rkb3hklGY/citrix-data-breach.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-78591481585274562</guid><pubDate>Mon, 11 Mar 2019 03:52:16 -0400</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-hcCARqP_JwE/XIYS8mdVIXI/AAAAAAAAzeM/8sZE5E8lKEgOy4yzLDbNqxvsczQEh0OSwCLcBGAs/s1600-e100/citrix-software-data-breach-hacking-min.jpg" length="12216320" type="image/jpeg"/></item><item><title>New Google Chrome Zero-Day Vulnerability Found Actively Exploited in the Wild</title><description>You must update your Google Chrome immediately to the latest version of the web browsing application. Security researcher Clement Lecigne of Google\'s Threat Analysis Group discovered and reported a high severity vulnerability in Chrome late last month that could allow remote attackers to execute ar...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/BSH1fZNXszE/update-google-chrome-hack.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-6232820394768541122</guid><pubDate>Wed, 06 Mar 2019 04:52:57 -0500</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-hYeL9HSrkr8/XH-YRFnQiNI/AAAAAAAAzdU/qJcBEPSf6D0nh6oxWhcwvfROeWNxLorjgCLcBGAs/s1600-e100/chrome.png" length="12216320" type="image/jpeg"/></item><item><title>NSA Releases GHIDRA 9.0 — Free, Powerful Reverse Engineering Tool</title><description>The United States\' National Security Agency (NSA) today finally released GHIDRA version 9.0 for free, the agency’s home-grown classified software reverse engineering tool that agency experts have been using internally for over a decade to hunt down security bugs in software and applications. GHIDRA...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/IRmgYVqr5gI/ghidra-reverse-engineering-tool.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-6284780049135116607</guid><pubDate>Wed, 06 Mar 2019 10:53:42 -0500</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/--wW8sijUcRg/XH-El--DZpI/AAAAAAAAzdA/vEvwgFAlJr8WZ6Te-csqrEGl109z7udogCLcBGAs/s1600-e100/ghidra-reverse-engineering-tool.png" length="12216320" type="image/jpeg"/></item><item><title>Google Launches Backstory — A New Cyber Security Tool for Businesses</title><description>Google\'s one-year-old cybersecurity venture Chronicle today announced its first commercial product, called Backstory, a cloud-based enterprise-level threat analytics platform that has been designed to help companies quickly investigate incidents, pinpoint vulnerabilities and hunt for potential threa...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/EpyP7lsRjDo/backstory-cybersecurity-software.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-285756448745347778</guid><pubDate>Tue, 05 Mar 2019 03:13:52 -0500</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-8aPHt9GiaEk/XH4uQMan2WI/AAAAAAAAzcw/Cvuf2nplGWAqG_M7dTRW67C-NF_zL_hcgCLcBGAs/s1600-e100/google-backstory.png" length="12216320" type="image/jpeg"/></item><item><title>Researchers Link &#039;Sharpshooter&#039; Cyber Attacks to North Korean Hackers</title><description>Security researchers have finally, with "high confidence," linked a previously discovered global cyber espionage campaign targeting critical infrastructure around the world to a North Korean APT hacking group. Thanks to the new evidence collected by researchers after analyzing a command-and-control...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/OPSt5YdR6Xg/north-korea-hacking.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-3999215995161811202</guid><pubDate>Mon, 04 Mar 2019 10:55:19 -0500</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-RozpUplJ0no/XH1KM2VhnpI/AAAAAAAAzcY/t2Ggze7dJBgkd4KM3iQZdVu46I-KN0b5wCLcBGAs/s1600-e100/north-korea-hacking-malware.png" length="12216320" type="image/jpeg"/></item><item><title>Google Discloses Unpatched &#039;High-Severity&#039; Flaw in Apple macOS Kernel</title><description>Cybersecurity researcher at Google\'s Project Zero division has publicly disclosed details and proof-of-concept exploit of a high-severity security vulnerability in macOS operating system after Apple failed to release a patch within 90 days of being notified. Discovered by Project Zero researcher Ja...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/Beyvk1Zcd2E/cybersecurity-macos-hacking.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-4511605765883260659</guid><pubDate>Mon, 04 Mar 2019 08:52:29 -0500</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-sE3ZtHehHYM/XH0o5TNvHuI/AAAAAAAAzcA/CjLlVWz-vkk5Cov0W_xM7u3hZW5ZmdiuQCLcBGAs/s1600-e100/macos-hacking.jpg" length="12216320" type="image/jpeg"/></item><item><title>Hackers Favorite CoinHive Cryptocurrency Mining Service Shutting Down</title><description>Coinhive, a notorious in-browser cryptocurrency mining service popular among cybercriminals, has announced that it will discontinue its services on March 8, 2019. Regular readers of The Hacker News already know how Coinhive\'s service helped cyber criminals earn hundreds of thousands of dollars by u...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/GJ7Je_QCLTA/cryptocurrency-mining-coinhive.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-3966583891849594843</guid><pubDate>Thu, 28 Feb 2019 05:59:35 -0500</pubDate><author>info@thehackernews.com (Wang Wei)</author><enclosure url="https://1.bp.blogspot.com/-RiEx-AnUFlk/XHe6KTU13YI/AAAAAAAAzbY/l8F0kYJgBN4kGG5gJZTUnJ27V0-VKKAugCLcBGAs/s1600-e100/coinhive-cryptocurrency-miner.png" length="12216320" type="image/jpeg"/></item><item><title>Learn Ethical Hacking with 180 Hours of Training — 2019 Course Bundle</title><description>The world of cybersecurity is fast-paced and ever-changing. New attacks are unleashed every day, and companies around the world lose millions of dollars as a result. The only thing standing in the way of cybercrime is a small army of ethical hackers. These cybersecurity experts are employed to fin...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/T92C0mCzVoY/ethical-hacker-training.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-8476452621173474101</guid><pubDate>Wed, 27 Feb 2019 08:51:58 -0500</pubDate><author>info@thehackernews.com (Exclusive Deals)</author><enclosure url="https://1.bp.blogspot.com/-6bFLF28Wvxc/XHaUg588fBI/AAAAAAAAAGU/USPKfrcXaLgzaOBfKGb92v-0T12CIaK9wCLcBGAs/s1600-e100/learn-hacking-training.jpg" length="12216320" type="image/jpeg"/></item><item><title>Severe Flaws in SHAREit Android App Let Hackers Steal Your Files</title><description>Security researchers have discovered two high-severity vulnerabilities in the SHAREit Android app that could allow attackers to bypass device authentication mechanism and steal files containing sensitive from a victim\'s device. With over 1.5 billion users worldwide, SHAREit is a popular file sharin...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/YZRTzEWj8_s/shareit-android-hacking.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-3072478572440509613</guid><pubDate>Wed, 27 Feb 2019 08:22:45 -0500</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-a_oJNkYTMVw/XHaMTAYUYNI/AAAAAAAAzbM/piKwoXLPwrY_lFwHq6mdKMcjURCJ5EfkACLcBGAs/s1600-e100/Shareit-android-hack.png" length="12216320" type="image/jpeg"/></item><item><title>New Flaws Re-Enable DMA Attacks On Wide Range of Modern Computers</title><description>Security researchers have discovered a new class of security vulnerabilities that impacts all major operating systems, including Microsoft Windows, Apple macOS, Linux, and FreeBSD, allowing attackers to bypass protection mechanisms introduced to defend against DMA attacks. Known for years, Direct m...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/qsvp92rxj5I/thunderbolt-peripheral-dma-attacks.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-4480715417450142847</guid><pubDate>Wed, 27 Feb 2019 07:25:03 -0500</pubDate><author>info@thehackernews.com (Mohit Kumar)</author><enclosure url="https://1.bp.blogspot.com/-KQ1rUIKmJHM/XHZ-TFOu07I/AAAAAAAAzbA/BZL9AoZLTZMBf8qV-JwCA1LXUKuzg1dxwCLcBGAs/s1600-e100/thunderbolt-dma-attack.png" length="12216320" type="image/jpeg"/></item><item><title>Latest WinRAR Flaw Being Exploited in the Wild to Hack Windows Computers</title><description>It\'s not just the critical Drupal vulnerability that is being exploited by in the wild\xa0cybercriminals to attack vulnerable websites that have not yet applied patches already available by its developers, but hackers are also exploiting a critical WinRAR vulnerability that was also revealed last week....</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/Wa2j4xkSDOQ/winrar-hacking-exploit.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-3716722805286540855</guid><pubDate>Tue, 26 Feb 2019 08:45:43 -0500</pubDate><author>info@thehackernews.com (Wang Wei)</author><enclosure url="https://1.bp.blogspot.com/-QCBmfEVV684/XHU8rDneL_I/AAAAAAAAzZo/srIwzRFz2Ew1bKw4zSPfJsh2JRZMcEuLACLcBGAs/s1600-e100/winrar-malware.png" length="12216320" type="image/jpeg"/></item><item><title>Hackers Actively Exploiting Latest Drupal RCE Flaw Published Last Week</title><description>Cybercriminals have actively started exploiting an already patched security vulnerability in the wild to install cryptocurrency miners on vulnerable Drupal websites that have not yet applied patches and are still vulnerable. Last week, developers of the popular open-source content management system...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/K7g8FhHI_WM/drupal-hacking-exploit.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-8481051333278882418</guid><pubDate>Tue, 26 Feb 2019 08:56:58 -0500</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-xKnslw3FrcU/XHUzQvDxlZI/AAAAAAAAzZc/YMly7RLa0GceeyTeAmk9iQdfSBvtIi-jwCLcBGAs/s1600-e100/drupal-hacking-exploit.jpg" length="12216320" type="image/jpeg"/></item><item><title>Android Gets FIDO2 Certification—Now Supports Secure Passwordless Logins</title><description>Great news. If you have already installed the latest update of Google Play Services released earlier today, and your Android device is running Android version 7.0 Nougat or above—Congratulations! Your device is now FIDO2 Certified. Are you thinking… what the heck that actually means? It means, in...</description><link>http://feedproxy.google.com/~r/TheHackersNews/~3/R-dWB6qBJKg/android-fido2-password-security.html</link><guid isPermaLink="false">tag:blogger.com,1999:blog-4802841478634147276.post-4078854350195982743</guid><pubDate>Mon, 25 Feb 2019 12:49:14 -0500</pubDate><author>info@thehackernews.com (Swati Khandelwal)</author><enclosure url="https://1.bp.blogspot.com/-z2zdyl7dEtA/XHQqAgwJicI/AAAAAAAAzZU/DNQmIUUq43E7t-jDcAQudbh-9_Pf26abgCLcBGAs/s1600-e100/android-fido2-password-security.png" length="12216320" type="image/jpeg"/></item></channel></rss>
buhtz commented 5 years ago

For me it is unclear what the problem is here. start to fail is not enough information.

What do you exactly mean with some entries are not being parsed properly and everything after that ends incomplete. What are the parsing errors? What is incomplete/missing?

cristianocca commented 5 years ago

Sorry if the report was not clear, I was hoping the samples would be enough to highlight what's going on.

Basically, some characters or input in the source string (which I couldn't find) is causing the whole parsing to stop early, yielding incomplete results. There are no parsing errors, the parsing just "stops".

As you can see in the examples above, the last line of the parsed dict is incomplete (there's no description/summary). In fact, the actual parsing stops there, and the following feed entries are not included neither.

In order to reproduce the issue, you probably need to take the text from above (remove the ) since the original feed has changed and the issue no longer happens. This looks like it was a very specific case that was making the parser go nuts and is no longer happening.

buhtz commented 5 years ago

First of all please reformate your json and xml code blocks. wrap the lines. It is unreadable. Because of that it is still unclear what your problem is.

cristianocca commented 5 years ago

Can't seem to wrap and use code blocks at the same time on the editor. But really, the problem is simple, the two lines with ** ** are the conflicting ones.

First, the parsed result from:

Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contractsCynet goes head-to-head with CrowdStrike, DarkTrace, Cylance, Carbon Black & Symantec, offering their unhappy customers a refund for the time remaining on their existing contracts. Cynet, the automated threat discovery and mitigation platform was built to address the advanced threats that AV and Fi...http://feedproxy.google.com/~r/TheHackersNews/~3/2kBjOTNiTks/cynet-endpoint-security.htmltag:blogger.com,1999:blog-4802841478634147276.post-799735885797893608Tue, 12 Mar 2019 09:12:54 -0400info@thehackernews.com (Exclusive Deals)

Is parsed into: {'title': "Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts", 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': "Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts"}}

Which is clearly incomplete. Furthermore, any following item after the broken one is not parsed (doesn't make it to the parsed dict).

buhtz commented 5 years ago

Keep in mind that we try to help you in our free- and family-time. Just formated. But wrapped would be easier. Or upload the xml-file as attachment whatever saves our ressources.

First, the parsed result from: <title>Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts</title>Cynet goes head-to-head with CrowdStrike, DarkTrace, Cylance, Carbon Black & Symantec, offering their unhappy customers a refund for the time remaining on their existing contracts. Cynet, the automated threat discovery and mitigation platform was built to address the advanced threats that AV and Fi...http://feedproxy.google.com/~r/TheHackersNews/~3/2kBjOTNiTks/cynet-endpoint-security.htmltag:blogger.com,1999:blog-4802841478634147276.post-799735885797893608Tue, 12 Mar 2019 09:12:54 -0400[info@thehackernews.com](mailto:info@thehackernews.com) (Exclusive Deals)

Is parsed into: {'title': "Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts", 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': "Cynet is offering unhappy competitors' customers a refund for the time remaining on existing contracts"}}

Which is clearly incomplete. Furthermore, any following item after the broken one is not parsed (doesn't make it to the parsed dict).

buhtz commented 5 years ago

You wrote there are no errors? You know FeedParserDict.bozo_exception ?

There is a parsing error.

>>> a.bozo_exception
SAXParseException('not well-formed (invalid token)',)
>>> type(a.bozo_exception)
<class 'xml.sax._exceptions.SAXParseException'>

So you have to check for bozo_exceptions.

The issue can be closed.

cristianocca commented 5 years ago

Interesting. I really didn't see that the parsed results would also contain an exception (instead of throwing it). I guess that's why I never saw it. I'm sorry if I wasted your time.

buhtz commented 5 years ago

it is not "wasted" - you learned something.

kurtmckee commented 5 years ago

@cristianocca Back in the old, old days of feedparser, the original author made the decision to never throw exceptions. However, nowadays this breaks people's expectations.

It may be worthwhile to revisit this decision, as it affects people (like yourself) who would expect an exception to be raised if there was a dire problem. Thanks for reporting this!