dbpedia / databus-maven-plugin

Databus Maven Plugin: Aligning Data and Software Lifecycle with Maven
GNU Affero General Public License v3.0
6 stars 10 forks source link

Databus Maven Plugin

Build Status Maven Central Version Post dataid.ttl into your Databus space to consume your files via the Databus SPARQL API Feedback: https://forum.dbpedia.org

User Manual

Read the frolicking Manual

Data Dependencies for Maven

Our goal is to align the data and software lifecycle. We created a Maven plugin that can upload output of software, e.g. the DBpedia Information Extraction Framework to the Databus platform. The other tools can include it again, just like software dependencies via Maven Central and Archiva. See the Databus Client and Databus Derive Plugin for data dependencies and automating software with data. The plugin was developed to use the features of the Maven software build automation tool for data releases and metadata generation. Once configured properly (1-3 hours), data can be released and re-released systematically in minutes.

Development

License

License of the software is AGPL with intended copyleft. We expect that you spend your best effort to commit upstream to make this tool better or at least that your extensions are made available again. Any contribution will be merged under the copyright of the DBpedia Association.

Development notes

Super-pom

To set defaults for the plugin we developed a parent pom.xml in the super pom folder. All other pom.xml can use it. Devnote: When uploading to archiva, select pomFile for pom.xml and leave generate pom unchecked

Maven Archiva

We are running a Maven Archiva for Snapshots. New snapshots can be deployed with mvn deploy as configured in the pom.xml. Password is needed to be entered in ~/.m2/settings.xml

Troubleshooting

databus-shared-lib / Local install works, travis will show build error

databus-shared-lib is a co-developed library. If you update it, travis will fail. Travis chaches the shared lib every 6 hours, so you need to flush the cache manually or wait 6 hours

Download from http://databus.dbpedia.org:8081/repository/ fails, no dependency information available

Note: this section can be removed after completion of https://github.com/dbpedia/databus-maven-plugin/issues/12 Possible reason: we have installed a dev archiva for now. Depending on your org's network configuration, code might only be accepted from Maven Central and local/allowed maven repos.

Can potentially fixed by locally installing the shared-lib:

Then clone the repo and run mvn install which will install the databus-maven-plugin locally

databus plugin goals are not found after installing the plugin via sources (mvn install)

[ERROR] Could not find goal 'metadata' in plugin org.dbpedia.databus:databus-maven-plugin:1.1-SNAPSHOT among available goals -> [Help 1]
org.apache.maven.plugin.MojoNotFoundException: Could not find goal 'metadata' in plugin org.dbpedia.databus:databus-maven-plugin:1.1-SNAPSHOT among available goals 

Try to wipe (make a copy of it and then delete the original) your m2 (maven local repository) and then build it again.

BUILD FAILURE, no mojo-descriptors found (when using mvn install to install the databus-maven-plugin)

This is most likely caused by using an old maven version (observed in version 3.0.5) A workaround for this would be replacing:

<plugin>
    <groupId>org.apache.maven.plugins</groupId>
        <artifactId>maven-plugin-plugin</artifactId>
    <version>3.4</version>
</plugin>

with

<plugin>
    <groupId>org.apache.maven.plugins</groupId>
        <artifactId>maven-plugin-plugin</artifactId>
    <version>3.4</version>
        <configuration>
        <skipErrorNoDescriptorsFound>true</skipErrorNoDescriptorsFound>
    </configuration>
        <executions>
        <execution>
                <id>mojo-descriptor</id>
                <goals>
                            <goal>descriptor</goal>
                        </goals>
        </execution>
    </executions>
</plugin>

in databus-maven-plugin/pom.xml

UTF-8 - Encoding Errors in the produced data

On Unix: run: grep "LC_ALL" .* in your /root/ directory and make sure

.bash_profile:export LC_ALL=en_US.UTF-8
.bashrc:export LC_ALL=en_US.UTF-8

is set.

[ERROR] org.scalatra.HaltException at data deploy

Its possible you messed up at creating your webid (especially if you have multiple names in the webid) and used the wrong SAN in the cert.config.\ Check if your SAN is correct by using\ openssl x509 -in certificate.crt -text -noout\ on your .crt file.\ If you just have the pkcs12 file use openssl pkcs12 -in [yourfile.pfx] -clcerts -nokeys -out [drlive.crt]\ to generate it.\ Check if the SAN matches the one you used in the webid.ttl and if not change the cert.config and generate the pkcs12 file again.