Open ThomasA opened 11 years ago
I'd like to see this, as well.
@johncarlosbaez @dcernst @ThomasA Adding basic WordPress support is pretty easy, in the basic sense of retrieving the RSS feed and applying our standard paper ID + hashtag indexing to it. Of course, there are many possible extensions people might want. For example, rather than having to re-label their posts with #spnetwork and our standard paper IDs etc., people might want us to process old posts by recognizing arXiv URLs, use their existing WordPress tags etc.
So can we start with just applying our standard indexing, then later add extra indexing capabilities?
You could use the mathblogging.org APIs to index what we have in our database.
"Basic" Wordpress support would already be nice, so if it's easy, please do it!
But...
Since I've written 400 posts on my Wordpress blog, I don't want to manually go through them all, find which ones refer to papers on the arXiv, etc. Someone smarter than me might be able to automate all this... but since the posts are stored not on my computer but on some Wordpress website that I only know how to interact with in limited ways, I don't know how to run a program on all my old blog entries and change them all.
If someone can figure out how to do this, I could try it. But being lazy, I'd be even happier if someone could search all Wordpress posts on a given blog with links to papers on the arXiv and create spnetwork links to those blog entries. Not just my blog but also many others could profit from this. In the world of math, some Wordpress blogs that stand out are Terry Tao's, Tim Gowers', and the Secret Blogging Seminar.
I also have 319 issues of This Week's Finds, many of which discuss papers on the arXiv. People in math and physics say these are a useful resource. They'd be even more useful if a program could go through them and create spnetwork links to all the arXiv paper URLs. They're all here:
@johncarlosbaez Obviously if there's a straightforward rule by which an algorithm can index your archived posts, we should do it that way rather than asking people to retag old posts by hand. What would accomplish this for you? E.g. will automatically recognizing arXiv URLs accomplish most of what you need? We need to get specific now.
For both blog posts and This Week's Finds, I think automatically recognizing arXiv URLs and getting spnetwork to link to the post in question is all we need or anyone can easily do. (It seems very hard to find and excerpt the portion of the post that actually discusses the arXiv paper.)
To do this right, we need to be aware of a number of old-style variant arXiv URLs, like these:
http://arxiv.org/abs/hep-th/9409044
http://arxiv.org/abs/gr-qc/9404059
and, truly old-school:
@pkra the mathblogging.org / scienceseeker API looks very cool, e.g. the has-citation search clearly is very useful for us. We can plug this in pretty easily. Perhaps we could talk some time so I can learn quickly about how we should index author identities, tags, etc.?
@johncarlosbaez OK, that's all reasonably straightforward. How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?
@cjlee112 sure thing.
Chris wrote: "How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?"
I think it's hopeless to recognize these different cases, because there's no sharp borderline between them and no standard signal that some article citation is "primary". So I'd just treat them all as equal. Sometimes people will be disappointed that a blog articles cites an arXiv paper in a very minor way, e.g. as part of a list of papers that are worth reading on some subject.
Could spnet be made to let blog authors sort these things out manually after import from a blog, if they wish to?
johncarlosbaez notifications@github.com skrev:
Chris wrote: "How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?"
I think it's hopeless to recognize these different cases, because there's no sharp borderline between them and no standard signal that some article citation is "primary". So I'd just treat them all as equal. Sometimes people will be disappointed that a blog articles cites an arXiv paper in a very minor way, e.g. as part of a list of papers that are worth reading on some subject.
Reply to this email directly or view it on GitHub: https://github.com/cjlee112/spnet/issues/71#issuecomment-25189245
@Thomas, I'm sure that could be done, but it doesn't seem particularly high priority for now.
On Fri, Sep 27, 2013 at 4:12 AM, Thomas Arildsen notifications@github.comwrote:
Could spnet be made to let blog authors sort these things out manually after import from a blog, if they wish to?
johncarlosbaez notifications@github.com skrev:
Chris wrote: "How do you want multiple paper citations per post to be handled? Should all the citations be treated as "equal" or are there situations where the post is mainly about one paper, and the other citations are secondary? If so, is there any prospect for automatically recognizing the latter case?"
I think it's hopeless to recognize these different cases, because there's no sharp borderline between them and no standard signal that some article citation is "primary". So I'd just treat them all as equal. Sometimes people will be disappointed that a blog articles cites an arXiv paper in a very minor way, e.g. as part of a list of papers that are worth reading on some subject.
Reply to this email directly or view it on GitHub: https://github.com/cjlee112/spnet/issues/71#issuecomment-25189245
— Reply to this email directly or view it on GitHubhttps://github.com/cjlee112/spnet/issues/71#issuecomment-25190173 .
Regarding the question of how to handle multiple references in a single post, see #80 .
Once you get around to it, support for WordPress blogs would also be very welcome - both wordpress.com and self-hosted WP blogs.