meteo-ubonn / meanie3D

meanie3D: mean-shift clustering in NetCDF data sets
MIT License
0 stars 2 forks source link

meanie3D

[//]: # {#mainpage}

This project provides a very generic implementation of mean-shift clustering on NetCDF data sets. The data sets need to follow the "CF-Metadata":http://cfconventions.org. The mean-shift algorithm is a robust unsupervised clustering algorithm with a wide range of applications. Meanie3D provides a multivariate implementation with no limits as to the number of variables that can be used in constructing the Featurespace.

This software supports 2D as well as 3D data sets and comes with many configuration options. The code was written as part of my efforts within "OASE HErZ group":http://www.herz-tb1.uni-bonn.de and contains some modules that are specific to this application as a result. If interest is there I might remove all specific components at one point. Meanie3D is implemented largely as a C++ template library with only a few object files (mostly from the numerical recipes library) for performance reason and uses OpenMP for parallelisation.

The software package comes with a single command in the form of a python script 'meanie3D' which makes it very easy to use. The detection, tracking and post-processing steps (such as statistics, visualisation of tracks, source data and clusters as well as the track graph (a simple, graphical representation of a tracking dictionary for obtaining detailed view on the history of individual clusters and tracks) are all controlled through a single configuration file.

License

Meanie3D comes under [[MIT license]].

Version

The latest stable version is v1.6.0. Versions are tagged.

Build Instructions

Dependencies

Meanie3D comes with a number of dependencies that need to be installed prior to attempting installation:

Optionally, the following libraries may be used, if they are switched on with the appropriate flags:

Compiler Prerequisites

Meanie3D uses OpenMP by default (-DWITH_OPENMP=1) and requires an OpenMP enabled compiler, such as:

Obtain and compile Meanie3D

Meanie3D uses CMAKE to generate makefiles. You can use CMAKE's abilities to generate IDE files if you prefer. Start by cloning the master branch (for an up-to date but possibly unstable version) or one of the stable releases.

git clone https://github.com/meteo-ubonn/meanie3D.git
cd meanie3D
git submodule init
git submodule update radolan

If you want to download the OASE topology and mapdata file for visualisation, you can obtain this file by adding:

git clone http://git.meteo.uni-bonn.de/git/oase-mapdata

Create a build directory:

cd ..
mkdir meanie3D-make
cd meanie3D-make
cmake ../meanie3D

In order to switch optimizations on, tell cmake to use the release build type:

cmake -DCMAKE_BUILD_TYPE=Release ../meanie3D

Note that there have been some problems on Linux with aggressive optimization and NetCDF. Your mileage may vary. You should try the release build in any event, since it speeds up performance a lot. If you observe unexpected problems in reading/writing NetCDF files, you may have fallen victim to the problem and revert to standard build (leave the -DCMAKE_BUILD_TYPE=Release). Once all dependencies are successfully resolved, install the product by calling the following:

make install

There are a number of flags you can use to customize your installation:

VTK/ visualization code

Because of the heavyweight of the VTK package, the visualization code is disabled by default. If you have VTK6 installed, you can enable visualizations by adding the following flag to your cmake call:

-D WITH_VTK=YES

This will result in some binaries and command line options for visualization to become available to you.

Add tests

Meanie3D has a number of regression tests, that cover the core algorithms and collection classes. This will become important to you if you should decide to work on the core algorithms yourself. The tests are a good method of making sure you haven't broken anything critical. In order to enable unit tests, use the flag:

-D WITH_TESTS=YES

The unit tests can then be run by calling

make test

Source code level documentation

In order to start developing your own Meanie3D code, it might be useful to have API documentation of the various classes in the project. If you have doxygen installed, you can call the following make command to create a browsable HTML documentation in doc/html (open the file index.html).

make docs

Frequently Asked Quesions

I'm having trouble compiling the code

The code has been tested on MacOS X and Suse Linux. Feel free to report problems using the "New Issue":http://git.meteo.uni-bonn.de/projects/meanie3d/issues/new link above. Please prepend "[BUILD PROBLEM]:" to your subject line.

NetCDF/HDF5 trouble

Versions - make sure the NetCDF version is 4.2 or better. Also make certain that you have a HDF5 installation to match it. If you want to make sure, uninstall HDF5 and NetCDF and re-install them from source (HDF5 first, then NetCDF) Commonly there is a problem with the NetCDF C++ API - NetCDF has a legacy C++ API from NetCDF3, which does not work with Meanie3D. Make sure you have the correct one installed (http://www.unidata.ucar.edu/downloads/netcdf/ftp/netcdf-cxx4-4.2.tar.gz)

CMAKE does not find your installed packages

If cmake has trouble locating packages you installed, check the files in the cmake_modules directory and adjust the search paths there instead of hardwiring paths into the CMakeLists.txt. If your system layout is not considered and you have to make adjustments to the search paths, please send a copy of your Find.cmake file to juergen.simon@uni-bonn.de so I can include your changes in the next release and maybe save someone else the trouble.

More information

Check out the Main Wiki Page at http://git.meteo.uni-bonn.de/projects/meanie3d/wiki for details on how the method works, file formats and what the individual binaries do.