tshrinivasan / OCR4wikisource

OCR for WikiSource using Google Drive OCR
GNU General Public License v2.0
33 stars 24 forks source link

Stats for usage of this tool #60

Open ravidreams opened 8 years ago

ravidreams commented 8 years ago

Is it possible to collect usage stats for this tool? This will help to demonstrate impact and push WMF / Google to come up with an official tool based on this.

Every time the tool runs, it can collect the following details:

Check https://meta.wikimedia.org/wiki/Grants:Learning_%26_Evaluation/Global_metrics#The_metrics to know how WMF assesses impact.

These details can also be presented in a leaderboard fashion based on language codes.

jayantanth commented 8 years ago

+1 but please check the feasibility of this script

bodhisattwawiki commented 8 years ago

+1 to this

tshrinivasan commented 8 years ago

Can anyone create a wiki page for this with required details with sample data?

I can extend that page with ocr operation details.

We can use the same wiki user that is used in script to update that page.

Regards, T.Shrinivasan

My Life with GNU/Linux : http://goinggnu.wordpress.com Free E-Magazine on Free Open Source Software in Tamil : http://kaniyam.com

Get Free Tamil Ebooks for Android, iOS, Kindle, Computer : http://FreeTamilEbooks.com

ravidreams commented 8 years ago

Please see

https://ta.wikisource.org/s/l70

for a sample stats table.

For template and other reference regarding similar stats, see

https://ta.wikipedia.org/s/5dzz - Weekly leaderboard in Tamil Wikipedia

https://ta.wikipedia.org/s/59rk - Daily contribution stats in Tamil Wikipedia

Global Wikisource stats - https://meta.wikimedia.org/wiki/Wikisource

A separate Grant total table is available there.

bodhisattwawiki commented 8 years ago

Sample stat

https://tools.wmflabs.org/phetools/statistics.php

tshrinivasan commented 8 years ago

Is there any lengh limit in wiki for such tabular data?

How can we add pagination? or is pagination is already there for lengthy wiki pages?

ravidreams commented 8 years ago

Storing data by month is one way of pagination. Otherwise, we can just create subpages like /1 and /2 based on table items count. There is no length limit for tabular data. Just that it becomes tedious to view in a browser when too many table items are added. We can very well add 1000s of files usig this tool.