CrossRef / pdfextract

MOVED TO https://gitlab.com/crossref/pdfextract
https://gitlab.com/crossref/pdfextract
MIT License
508 stars 89 forks source link

Citation for a review of Article X is recovered rather than the citation for Article X itself. #30

Open rschwiebert opened 9 years ago

rschwiebert commented 9 years ago

Perhaps this is already known, or else is a problem in the database this tool consults, but I figured I would record some testcases here for use in bugfixing (if the problem is on this end.)

This:
[19] R. L. Graham, D. E. Knuth and T. S. Motzin, Complements and transitive closures, Discrete Math., 21 (1972), 17–29. [20] P. R. Halmos, Lectures on Boolean Algebras, Van Nostrand, Princeton, 1963. [21] P. C. Hammer, Kuratowski’s Closure theorem, Nieuw Arch. Wisk., 8 (1960), 74–80.

Generated this bibtex: @article{Wallace_1964, doi = {10.1126/science.144.3618.531-b}, url = {http://dx.doi.org/10.1126/science.144.3618.531-b}, year = 1964, month = {may}, publisher = {American Association for the Advancement of Science ({AAAS})}, volume = {144}, number = {3618}, pages = {531--532}, author = {A. D. Wallace}, title = {Lectures on Boolean Algebras. Paul R. Halmos. Van Nostrand, Princeton, N.J., 1963. vi $\mathplus$ 147 pp. Illus. Paper, {\textdollar}2.95}, journal = {Science} }

(Wallace appears nowhere in the pdf I'm extracting.) Similarly this: [23] E. Hewitt, A problem in set-theoretic topology, Duke Math. J., 10 (1943), 309–333. [24] G. E. Hughes and M. J. Cresswell, A New Introduction to Modal Logic, Routledge, London 1996. [25] M. Jackson, Closure semilattices, Algebra Universalis, 52 (2004), 1–37.

lead to this:

@article{Zakharyaschev_1997, doi = {10.2307/2275655}, url = {http://dx.doi.org/10.2307/2275655}, year = 1997, month = {dec}, publisher = {Cambridge University Press ({CUP})}, volume = {62}, number = {04}, pages = {1483--1484}, author = {Michael Zakharyaschev}, title = {Hughes G. E. and Cresswell M. J.. A new introduction to modal logic. Routledge, London and New York 1996, x $\mathplus$ 421 pp.}, journal = {The Journal of Symbolic Logic} }

and this

[26] M. Jackson and T. Stokes, Semilattice pseudocomplemented semigroups, Comm. Algebra, 32 (2004), 2895–2918. [27] J. L. Kelley, General Topology, Van Nostrand Reinhold Co. Inc. Princeton, NJ, 1955. [28] W. Koenen, The Kuratowski closure problem in the topology of convexity, Amer. Math. Monthly, 73 (1966), 704–708.

lead to this

@article{Larkin_1962, doi = {10.2307/2964144}, url = {http://dx.doi.org/10.2307/2964144}, year = 1962, month = {jun}, publisher = {Cambridge University Press ({CUP})}, volume = {27}, number = {02}, pages = {235}, author = {Francis P. Larkin}, title = {Kelley John L.. General topology. D. van Nostrand Company, Inc., New York, Toronto, and London, 1955, xiv $\mathplus$ 298 pp.}, journal = {The Journal of Symbolic Logic} }

and this

[31] N. Levine, On the commutativity of the closure and interior operators in topological spaces, Amer. Math. Monthly, 68 (1961), 474–477. [32] J. C. C. McKinsey and A. Tarski, The algebra of topology, Ann. Math., 45 (1944), 141–191. [33] L. E. Moser, Closure, interior and union in finite topological spaces, Colloq. Math., 38 (1977), 41–51.

lead to this: @article{Vaughan_1944, doi = {10.2307/2267577}, url = {http://dx.doi.org/10.2307/2267577}, year = 1944, month = {dec}, publisher = {Cambridge University Press ({CUP})}, volume = {9}, number = {04}, pages = {96--97}, author = {H. E. Vaughan}, title = {{McKinsey} J. C. C. and Tarski Alfred. The algebra of topology. Annals of mathematics, ser. 2 vol. 45 (1944), pp. 141{\textendash}191.}, journal = {The Journal of Symbolic Logic} }

rschwiebert commented 9 years ago

Ah heck: after clicking through to the doi's, I think I figured out what the problem is. In the last three cases, the citations returned are correct citations for reviews of the material in question. The title of the actual work is actually a substring of the title of the review's title. I can't get around the paywall in the first example, but I guess it is the same issue.

It would be nice if this problem could be fixed, but I'm not holding my breath.