lairdshaw / fups

FUPS: Forum user-post scraper
GNU Affero General Public License v3.0
21 stars 9 forks source link

regex issue with forum_page_topicids #4

Closed TiloGit closed 5 years ago

TiloGit commented 5 years ago

issue on my target:

<a title="Posted: Mon zzz 2z, 201z 10:26 am" href="./viewtopic.php?f=240&amp;t=214" class="topictitle">zzz is shutting down November 1, 2018</a>
<a title="Posted: Wed zzz 1z, 201z 2:38 pm" href="./viewtopic.php?f=240&amp;t=2111" class="topictitle">Planned Changes zzz</a>
<a title="Posted: Thu zzz 0z, 201z 9:24 am" href="./viewtopic.php?f=240&amp;t=219" class="topictitle">List zzz</a>

this change one worked: 'forum_page_topicids' => '(\\s+href="\\./viewtopic\\.php\\?f=\\d+&amp;t=(\\d+)"\\s+class="topictitle">)',

lairdshaw commented 5 years ago

Fixed. Thank you for your report.