cjlee112 / spnet

selected papers network web engine
http://thinking.bioinformatics.ucla.edu/2011/07/02/open-peer-review-by-a-selected-papers-network/
GNU General Public License v2.0
40 stars 11 forks source link

Request: Add WordPress support #71

Open ThomasA opened 11 years ago

ThomasA commented 11 years ago

Once you get around to it, support for WordPress blogs would also be very welcome - both wordpress.com and self-hosted WP blogs.

dcernst commented 10 years ago

I'd like to see this, as well.

cjlee112 commented 10 years ago

@johncarlosbaez @dcernst @ThomasA Adding basic WordPress support is pretty easy, in the basic sense of retrieving the RSS feed and applying our standard paper ID + hashtag indexing to it. Of course, there are many possible extensions people might want. For example, rather than having to re-label their posts with #spnetwork and our standard paper IDs etc., people might want us to process old posts by recognizing arXiv URLs, use their existing WordPress tags etc.

So can we start with just applying our standard indexing, then later add extra indexing capabilities?

pkra commented 10 years ago

You could use the mathblogging.org APIs to index what we have in our database.

johncarlosbaez commented 10 years ago

"Basic" Wordpress support would already be nice, so if it's easy, please do it!

But...

Since I've written 400 posts on my Wordpress blog, I don't want to manually go through them all, find which ones refer to papers on the arXiv, etc. Someone smarter than me might be able to automate all this... but since the posts are stored not on my computer but on some Wordpress website that I only know how to interact with in limited ways, I don't know how to run a program on all my old blog entries and change them all.

If someone can figure out how to do this, I could try it. But being lazy, I'd be even happier if someone could search all Wordpress posts on a given blog with links to papers on the arXiv and create spnetwork links to those blog entries. Not just my blog but also many others could profit from this. In the world of math, some Wordpress blogs that stand out are Terry Tao's, Tim Gowers', and the Secret Blogging Seminar.

I also have 319 issues of This Week's Finds, many of which discuss papers on the arXiv. People in math and physics say these are a useful resource. They'd be even more useful if a program could go through them and create spnetwork links to all the arXiv paper URLs. They're all here:

http://math.ucr.edu/home/baez/twf_html/

cjlee112 commented 10 years ago

@johncarlosbaez Obviously if there's a straightforward rule by which an algorithm can index your archived posts, we should do it that way rather than asking people to retag old posts by hand. What would accomplish this for you? E.g. will automatically recognizing arXiv URLs accomplish most of what you need? We need to get specific now.

johncarlosbaez commented 10 years ago

For both blog posts and This Week's Finds, I think automatically recognizing arXiv URLs and getting spnetwork to link to the post in question is all we need or anyone can easily do. (It seems very hard to find and excerpt the portion of the post that actually discusses the arXiv paper.)

To do this right, we need to be aware of a number of old-style variant arXiv URLs, like these:

http://arxiv.org/abs/hep-th/9409044

http://arxiv.org/abs/gr-qc/9404059

and, truly old-school:

http://xxx.lanl.gov/abs/hep-th/9312037

http://xxx.lanl.gov/abs/gr-qc/9401010

cjlee112 commented 10 years ago

@pkra the mathblogging.org / scienceseeker API looks very cool, e.g. the has-citation search clearly is very useful for us. We can plug this in pretty easily. Perhaps we could talk some time so I can learn quickly about how we should index author identities, tags, etc.?

cjlee112 commented 10 years ago

@johncarlosbaez OK, that's all reasonably straightforward. How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?

pkra commented 10 years ago

@cjlee112 sure thing.

johncarlosbaez commented 10 years ago

Chris wrote: "How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?"

I think it's hopeless to recognize these different cases, because there's no sharp borderline between them and no standard signal that some article citation is "primary". So I'd just treat them all as equal. Sometimes people will be disappointed that a blog articles cites an arXiv paper in a very minor way, e.g. as part of a list of papers that are worth reading on some subject.

ThomasA commented 10 years ago

Could spnet be made to let blog authors sort these things out manually after import from a blog, if they wish to?

johncarlosbaez notifications@github.com skrev:

Chris wrote: "How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?"

I think it's hopeless to recognize these different cases, because there's no sharp borderline between them and no standard signal that some article citation is "primary". So I'd just treat them all as equal. Sometimes people will be disappointed that a blog articles cites an arXiv paper in a very minor way, e.g. as part of a list of papers that are worth reading on some subject.


Reply to this email directly or view it on GitHub: https://github.com/cjlee112/spnet/issues/71#issuecomment-25189245

semorrison commented 10 years ago

@Thomas, I'm sure that could be done, but it doesn't seem particularly high priority for now.

On Fri, Sep 27, 2013 at 4:12 AM, Thomas Arildsen notifications@github.comwrote:

Could spnet be made to let blog authors sort these things out manually after import from a blog, if they wish to?

johncarlosbaez notifications@github.com skrev:

Chris wrote: "How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?"

I think it's hopeless to recognize these different cases, because there's no sharp borderline between them and no standard signal that some article citation is "primary". So I'd just treat them all as equal. Sometimes people will be disappointed that a blog articles cites an arXiv paper in a very minor way, e.g. as part of a list of papers that are worth reading on some subject.


Reply to this email directly or view it on GitHub: https://github.com/cjlee112/spnet/issues/71#issuecomment-25189245

— Reply to this email directly or view it on GitHubhttps://github.com/cjlee112/spnet/issues/71#issuecomment-25190173 .

ketch commented 10 years ago

Regarding the question of how to handle multiple references in a single post, see #80 .