::: :::: ::: ::: ::: :::::::: ::::::::
:+: :+: :+:+: :+: :+: :+: :+: :+: :+: :+:
+:+ +:+ :+:+:+ +:+ +:+ +:+ +:+ +:+
+#++:++#++: +#+ +:+ +#+ +#++: +#+ +#++: +#+ +#+ +#+ +#+#+# +#+ +#+ +#+
Anything To Triples (Any23) is a library and web service that extracts structured data in RDF format from a variety of Web documents.
any23-core The library core codebase. any23-service The library HTTP service codebase. lib Contains the Any23 the external deps (read lib/README.txt for further details). plugins Library plugins codebase (read plugins/README.txt for further details). RELEASE-NOTES.txt File reporting main release notes for every version. LICENSE.txt Applicable project license. README.txt This file.
For details on the command line tool and web interface, see: http://developers.any23.org/getting-started.html
For a guide to using any23 as a library in your Java applications, see: http://developers.any23.org/developers.html
Javadocs is available here: http://developers.any23.org/apidocs/
Be sure to have the Apache Maven v.2.2.x+ installed and included in $PATH.
For specific information about Maven see: http://maven.apache.org/
Go to the trunk folder:
$ cd trunk/
and execute the following command:
trunk$ mvn clean install
This will install the Any23 artifacts and its dependencies in your local Maven2 repository.
Any23 comes with some command line tools:
any23 allows to perform a metadata extraction on a file or URL source. any23tools provides access to some auxiliary tools.
The complete documentation about these tools can be found here: http://developers.any23.org/getting-started.html
To run such tools, go to the any23-core bin folder:
trunk$ cd any23-core/bin
and then invoke them:
bin$ ./any23 [usage instructions will be printed out]
bin$ ./any23tools [usage instructions will be printed out]
Any23 can be run as a service. To run the Any23 service go to the any23-service bin folder:
trunk$ cd any23-service/bin
and then invoke:
bin$ ./any23server
You can check the service is running by accessing http://localhost:8080/ with your browser.
The complete documentation about this service can be found here: http://developers.any23.org/getting-started.html
The Any23 Service WAR by default will be generated as self-contained, all the dependencies will be included as JAR within the WEB-INF/lib archive dir.
To generate the self contained WAR invoke:
any23-service$ mvn [-o] [-Dmaven.test.skip=true] clean package
Where -o will build the process offline, and -Dmaven.test.skip=true will force the test skipping.
The WAR will be generated in
target/any23-service-X.Y.Z-VERSION.war
To produce a instead a WAR WITHOUT the included JAR dependencies it is possible to use the war-without-deps profile:
any23-service$ mvn [-o] [-Dmaven.test.skip=true] -Pwar-without-deps clean package
Again the WAR will be generated in
target/any23-service-X.Y.Z-VERSION.war
To generate the project site locally execute the following command from the trunk dir:
trunk$ MAVEN_OPTS='-Xmx1024m' mvn clean site:site
You can speed up the site generation process specifying the offline option ( -o ), but it works only if all the involved plugin dependencies has been already downloaded in the local M2 repository:
trunk$ MAVEN_OPTS='-Xmx1024m' mvn -o clean site:site
If you're interested in generating the Javadoc enriched with navigable UML graphs, you can activate the umlgraphdoc profile. This profile relies on graphviz ( http://www.graphviz.org/) that must be installed in your system.
trunk$ MAVEN_OPTS='-Xmx1024m' mvn -P umlgraphdoc clean site:site
::Developers interest only.::
In order to correctly deploy the site to a remote FTP host you just need to properly set up
the following lines in your
<site>
<id>any23.developers</id>
<name>Any23 Developer Web Site</name>
<url>ftp://FTP-HOSTNAME</url>
</site>
Remember that you need to set up your username and password to access to that FTP in your settings.xml in this way:
<server>
<id>any23.developers</id>
<username>FTP-USERNAME</username>
<password>FTP-PASSWORD</password>
</server>
To perform the deployment simply run:
mvn clean site:site site:deploy
Optionally you may require to fix the mimetype for *.html files:
cd site svn up find . -name ".html" | xargs svn ps svn:mime-type text/html find . -name ".css" | xargs svn ps svn:mime-type text/css svn ci
::Developers interest only.::
Check the configuration in section distributionManagement within pom.xml:
<distributionManagement>
...
<repository>
<id>rdf-commons-googlecode</id>
<name>RDF Commons Google Code Snapshot Repository</name>
<url>svn:https://rdf-commons.googlecode.com/svn/repo/</url>
</repository>
...
<distributionManagement>
Then to deploy a snapshot release perform:
mvn clean deploy
::Developers interest only.::
To prepare a new release, just verify that the are no local changes and then invoke:
mvn release:prepare [-Dusername=<svn.username> -Dpassword=<svn.pass>]
if everything goes right, perform the release simply typing:
trunk$ MAVEN_OPTS='-Xmx2048m' mvn release:perform
Export the just created tag:
tmp-dir$ svn export <path/to/curr-tag>
Package all modules for direct download:
$ cd <curr-tag-export>/
<curr-tag-export>$ mvn clean package
cd any23-core
mvn assembly:assembly
cd ..
cd ..
tar cvzf any23-<curr-tag>.tar.gz tags/<curr-tag>
zip -r any23-<curr-tag>.zip tags/<curr-tag>
Upload the produced packages in download section:
http://code.google.com/p/any23/downloads/list
::Developers interest only.::
External Deps are libraries used by some Any23 modules which are not available in public Maven repositories. Such libraries are managed within the 'lib' dir.
EOF