RSS-Bridge / rss-bridge

The RSS feed for websites missing it
https://rss-bridge.org/bridge01/
The Unlicense
6.99k stars 1.02k forks source link

XpathBridge not retrieving date or image #3994

Closed royaldunlin closed 3 months ago

royaldunlin commented 3 months ago

I'm attempting to use XpathBridge on a news site, and while it is working for most elements, the picture and date are not being populated. I suspect the date is failing because of a trailing ' -' in the filed, but I'm not sure how I would remove that.

For the picture I'm unsure as to why it's not working.

I also notice that no UID is created. Is this because there's no date?

Screenshot 2024-03-03 at 8 21 02 PM

Page URL: https://www.shepherdstownchronicle.com/news/ Item selector: //main[@id='category']//article Item Title Selector: ./a/h1/text() Item Description selector: ./p/text() Item URL selector: ./a/@href Item Date selector: ./time/p/text() Item image selector: ./div[1]/div[2]/a/img/@src Item category selector: ./time/p/a/text()

<article>
 <div class="full_category_photo_wrap">
 <div class="cat_article_image_blur"></div>
 <div class="cat_article_image"><a href="https://www.shepherdstownchronicle.com/news/2024/03/01/shepherdstown-community-club-to-raise-the-roof-with-fundraisers-for-historic-building/"><img class="lazy loaded" data-src="https://ogden_images.s3.amazonaws.com/www.shepherdstownchronicle.com/images/2024/02/28010553/20240201_135638-667x500.jpg" src="https://ogden_images.s3.amazonaws.com/www.shepherdstownchronicle.com/images/2024/02/28010553/20240201_135638-667x500.jpg" data-was-processed="true"></a></div>
</div>
<a href="https://www.shepherdstownchronicle.com/news/2024/03/01/shepherdstown-community-club-to-raise-the-roof-with-fundraisers-for-historic-building/"><h1>Shepherdstown Community Club to ‘Raise the Roof’ with fundraisers for historic building</h1></a>
<time><p>March 1, 2024 - <a style="font-family:Georgia, serif; font-weight:normal; color:#23384e; font-size:14px;" href="https://www.shepherdstownchronicle.com/news/local-news/"> Local News</a></p></time>
<p>SHEPHERDSTOWN — A year ago, the Shepherdstown Community Club (SCC) hired a building inspector to take a look at one of its historic properties, the War Memorial Building.
According to SCC President Jennifer Wabnitz, the inspection results identified two areas needing attention.
“The ...</p>
<div style="clear:both"></div>
</article>
Array
(
    [name] => News | News, Sports, Jobs - Shepherdstown Chronicle
    [uri] => https://www.shepherdstownchronicle.com/news/
    [icon] => https://github.com/rss-bridge/rss-bridge/favicon.ico
    [donationUri] => 
    [items] => Array
        (

            [0] => Array
                (
                    [uri] => https://www.shepherdstownchronicle.com/news/2024/03/01/shepherdstown-community-club-to-raise-the-roof-with-fundraisers-for-historic-building/
                    [title] => Shepherdstown Community Club to ‘Raise the Roof’ with fundraisers for historic building
                    [timestamp] => 
                    [author] => 
                    [content] => SHEPHERDSTOWN — A year ago, the Shepherdstown Community Club (SCC) hired a building inspector to take a look at one of its historic properties, the War Memorial Building.

According to SCC President Jennifer Wabnitz, the inspection results identified two areas needing attention.

“The ...
                    [enclosures] => Array
                        (
                        )

                    [categories] => Array
                        (
                            [0] => Local News
                        )

                    [uid] => 
                )
      )

)