zcapjdb / UKAEAGroupProject

1 stars 0 forks source link

UKAEA Group Project

Repository for the UCL CDT in DIS group project with UKAEA.

Installation:

To install the package run: pip install . in the root directory (installation via a setup.py file is deprecated). If you wish to make changes to the package you can install using pip install -e . which will install the package in editable mode.

Pipeline:

The pipeline for the retraining of the classifier and regressors is included in the pipeline directory.

The pipeline consits of the following files:

Data Preparation:

The models used in the project take the .h5 file provided and convert it to pickled dataframes. The data is also split into a training, validation and test set at this point. This is done in QLKNNDataPreparation.ipynb.

Initial data exploration can also be found in DataExploration.ipynb.

Missing Outputs:

A major difficulty in applying a neural network surrogate model to the data is that QuaLiKiz, which is used to train the surrogate, does not always map an input to an output.

It is therefore of interest to try and understand why this is the case. To do so a classifier is trained on the model inputs to determine if a given set of inputs gives a corresponding output. Classifier and results are shown in Classifier.ipynb with hyperparameter tuning performed in classifier_optimisation.ipynb

QLKNN Reproduction:

Autoencoder:

Gaussian Process: