SoftVarE-Group / MibTeX

Minimalistic tool to manage your references with BibTeX
GNU Lesser General Public License v3.0
2 stars 1 forks source link

Improved handling for titles including subtitles in citation service #32

Closed tthuem closed 6 years ago

tthuem commented 6 years ago

Entries such as SGB05 and KG06 are simply not found as google scholar omits there subtitles. If a paper cannot be found, it should be rather easy to just check the title again, but removing everything starting from ":".

Kogoro commented 6 years ago

Implemented a second check without subtitle

tthuem commented 6 years ago

Please let me know if I can tryout a new version.

Kogoro commented 6 years ago

@tthuem Please try again. Tested it with over 30 entries and seems not unreasonable.

tthuem commented 6 years ago

Got the value -2 for the following entries: FKPV:CEC16, KGP:EMSE13, NCB+SEI07, BHJM:STTT07

tthuem commented 6 years ago

More examples: citations.zip

Kogoro commented 6 years ago

I integrated the logging into a separate file called problems.csv (s. https://github.com/tthuem/MibTeX/commit/822d57d3c8cc1e666a299b49007bef6190d25cce)

Note: For several hours, the citation service worked quite fine until the calls were stopped by a CAPTCHA. Therefore, please check if the dev-robot branch with commit https://github.com/tthuem/MibTeX/commit/47ce47206214e325d6e49f7cd34135c1238191df is of any help.

tthuem commented 6 years ago

My export into a single HTML file does not work anymore (on both branches). It seems that my property file is somehow ignored.

Kogoro commented 6 years ago

Works on my system and I did not change anything related to the property files. Do you have a error message? My ini file looks like:

[options]
bibtex-dir=\\BibTags\\
main-dir=\\BibTags\\
out-dir-rel=..\\HTML\\
pdf-dir=..\\Library\\
pdf-dir-rel=PDF
tags=sampling-tags
clean=true
citationService=true
citation-dir=\\BibTags\\classification\\
out-format=HTML_NEW
tthuem commented 6 years ago

Works better now. Experienced problems only with the following keys so far: DDP:EMSE17, M:ICFI07

There is no entry in the problems.csv so far.

Kogoro commented 6 years ago

The site does not list any citations for DDP:EMSE17, but for M:ICFI07 it shows a count of one. Will test this specific two entries again.

tthuem commented 6 years ago

In both cases the score was -2. If there are no citations I would have expected 0.

Kogoro commented 6 years ago

At the moment, we have the following codes:

    public final static int UNINITIALIZED = -1;

    public final static int NOT_FOUND = -2;

    public final static int PROBLEM_OCCURED = -3;

    public final static int ROBOT = -4;

We can change -2 to 0, if the title is found.

tthuem commented 6 years ago

No, that would not help. Please keep the error codes as is.

0 is supposed to say the article has been found but has no citations.

Kogoro commented 6 years ago

Sorry, the regex for the title was wrong and therefore the citations were always -2. Now it is 0 again.

tthuem commented 6 years ago

Works fine now