Code4SA / various-scrapers

Apache License 2.0
2 stars 2 forks source link

Authors not correctly extracted #1

Closed adieyal closed 10 years ago

adieyal commented 10 years ago

From what I can see, author is not working on everything (not the most important field but...). For example

"_id" : ObjectId("531f760f0e82536fc0e6a003"), "author" : "\nDeur: Evan NaudeFoto: Weslander\t",

"_id" : ObjectId("5317ebaf0e82531e64560700"),"author" : "Pongrass Import",

"_id" : ObjectId("531f6d3b0e82536e75741fd2"),"author" : "\nBy: web editorPhoto: Supplied\t",