tshrinivasan / tools-for-wiki

Tools required for wiki projects
6 stars 9 forks source link

I have a folder with 2000 books #9

Open jayantanth opened 8 years ago

jayantanth commented 8 years ago

I have a folder with 2000 books in PDF format downloaded from DLI website. I need csv format file for upload all file to Internet Archive website through https://github.com/kngenie/ias3upload. Please please make a python script of that?

The CSV file would like that https://github.com/kngenie/ias3upload/blob/master/metadata.csv

https://github.com/tshrinivasan/tools-for-wiki/blob/master/pdf-stats/pdf_stats.py , this is of your script may help

My all file of this folder with name like

4990010052039-Jiban-Charit., Chattopadhyay, Bankimchandra . 1911. bengali. GEOGRAPHY. BIOGRAPHY. HISTORY. 574 pgs.pdf

where

item: \ = Jiban-Charit creator: =Chattopadhyay, Bankimchandra file:** = 4990010052039-Jiban-Charit., Chattopadhyay, Bankimchandra . 1911. bengali. GEOGRAPHY. BIOGRAPHY. HISTORY. 574 pgs.pdf title:Jiban-Charit

jayantanth commented 8 years ago

untitled

jayantanth commented 8 years ago

for Example my PDF file name looks like 4990010208365 - Kabya Sanchayan Ed. 7, Dutta, Satyendra Nath, 250p, LANGUAGE. LINGUISTICS. LITERATURE, bengali (1953)

So I have a full meta data with my PDF file name downloaded from DLI. so the PDF file name contents with "DLI bar code No"-"Book name with ed", " author second name", " author first name", "no of page in book","subject","language","(Publication year)"

I need a tool to for those all pdf file to upload directly to Wikimedia commons as like PDF upload tool

OR

Create one CVS format text file from all my PDF name as like https://github.com/kngenie/ias3upload. So we can upload all those file to Internet archive

There is a tool http://tools.wmflabs.org/ia-upload/commons/init , we can uploads all files Internet archive to Commons.