slurdge / goeland

An alternative to rss2email written in golang with many filters
MIT License
174 stars 11 forks source link

results from Nitter feeds don't include the tweet author #35

Open kylrth opened 2 years ago

kylrth commented 2 years ago

With the following pipe definition:

[sources.lexfridman]
url = "https://nitter.nl/lexfridman/search/rss?f=tweets&e-replies=on"
type = "feed"
filters = ["unseen", "links", "includelink", "digest"]

I get an email that looks like this:

As you can see, the tweet body appears as both the title and the content. I'd ideally like the title to be the tweet author name (which may be different from @lexfridman in the case of retweets or if I'm making a digest of multiple Twitter users), and then let the body be the same.

I know you've got that replace filter for simple text manipulation in the body, but have you got any ideas about more complex field manipulation that might make it possible to insert the <dc:creator> tag into the title or something like that?

slurdge commented 2 years ago

I see, let me check how I could do something along those lines.

kylrth commented 2 years ago

I'm willing to implement if you'd like help!

On Tue, May 24, 2022, 17:08 slurdge @.***> wrote:

I see, let me check how I could do something along those lines.

— Reply to this email directly, view it on GitHub https://github.com/slurdge/goeland/issues/35#issuecomment-1136433841, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABGPUQRUJG2ZPQEHOLL6WCTVLVAMLANCNFSM5WZTFAPA . You are receiving this because you authored the thread.Message ID: @.***>

slurdge commented 2 years ago

Thanks. I've looked at the RSS feed given by nitter and it's not that trivial. We could create a special 'nitter' filter (as I don't believe Twitter give native RSS) feed, but that seems a bit contrived.

The real issue here would be to be able to put the dc:creator field inside the title field. First I have to see if it's exposed by the RSS library, maybe it's specific to some feeds and not always present. Secondly, it would be weird to have a filter that does this but does not allow to do it for description etc.

The "good" solution IMHO would be to have the long term script solution, where I would provide a way to write simple manipulations of the feed, where it should be as simple as:

item.Title = item.Author

but we are not there yet. I think I will do a "authorastitle" (or better name) filter and remove it when we have a better solution.

slurdge commented 2 years ago

Hello, I didn't forget about this feature but still haven't found a satisfying way of implementing it.

kylrth commented 2 years ago

Yeah no worries, I wouldn't want you to implement a hacky solution specific to my particular request.

panigrc commented 1 year ago

@slurdge suggestion: How about a field mapper, where we can define in the toml which fields from the rss will be mapped to which goeland fields ?

slurdge commented 1 year ago

That's a good idea, however the fact is not all feed (let's say imgur source for example) would have all fields. I guess this is also something that would be better tailored once I have a small scripting engine, as this would be trivial to do so with it.

Still I keep the suggestion, maybe a filter with a simple syntax would fit the bill