Open lintool opened 6 years ago
I could separately crawl the BibTeX I suppose. It was just much easier to use the data I had already pulled.
Crawling the BibTeX and deduplicating is probably a better solution, but I pushed this to fix the accents. I'll try to take a look at the other oddity later.
For the proceedings oddity, it looks like "SIGMOD Conference" is the only useful thing DBLP returns from its XML API. Unfortunately, this means significant rewriting to fix which is out of scope for me right now.
Crawler seems to be mangling accents:
Also: https://dblp.uni-trier.de/rec/bibtex/conf/sigmod/BegoliCHML18
booktitle says:
Why is it "SIGMOD Conference" above?
Also - why can't we just crawl the bibtex here? https://dblp.uni-trier.de/rec/bibtex/conf/sigmod/BegoliCHML18