GOKB is an environment for managing the collaborative import, cleaning and maintaining of publisher title lists. The focus is the supply of information around the administration and availability of print and electronic resources rather than specific bibliographic information. The goal of GOKb is to reduce the burden of data maintenance and editing on any one party by sharing the load over a collaborative network of subject matter experts. Because the information is global, and the same for everyone, the idea is to focus all our effort on editing one copy of the information that can then be freely reused and shared.
Some screenshots follow
To build the refine extension, ** cd into gokb-phase1/refine/extensions/gokb
run "ant" with any local overrides for refine installation dir and tomcat for server jars, eg ** ant -Drefine.dir=/home/ibbo/google-refine-2.5 -Dserver.dir=/home/ibbo/apache-tomcat
Copy (Or arrange to symlink) the gokb directory to your local equivalent of ~/google-refine-2.5/webapp/extensions/
I packaged up the gokb extension with #
zip -r gokb.zip ./gokb
from gokb-phase1/server/gokb grails run-app
A script is provdied to import the ncsu orgs list, run from gokb-phase1/scripts/import/orgs as follows:
./ncsu_orgs_imp.groovy ./ncsu-auth-orgs-roles-2013-01-11.csv
For parsing out Platform URLs http://docs.guava-libraries.googlecode.com/git/javadoc/com/google/common/net/InternetDomainName.html System.out.println(InternetDomainName.fromLenient(uriHost).topPrivateDomain().name());