GPy - A Gaussian Process (GP) framework in Python¶

Introduction¶

GPy is a Gaussian Process (GP) framework written in Python, from the Sheffield machine learning group. It includes support for basic GP regression, multiple output GPs (using coregionalization), various noise models, sparse GPs, non-parametric regression and latent variables.

The GPy homepage contains tutorials for users and further information on the project, including installation instructions.

The documentation hosted here is mostly aimed at developers interacting closely with the code-base.

Source Code¶

The code can be found on our Github project page. It is open source and provided under the BSD license.

Installation¶

Installation instructions can currently be found on our Github project page.

Tutorials¶

Several tutorials have been developed in the form of Jupyter Notebooks.

Architecture¶

GPy is a big, powerful package, with many features. The concept of how to use GPy in general terms is roughly as follows. A model (GPy.models) is created - this is at the heart of GPy from a user perspective. A kernel (GPy.kern), data and, usually, a representation of noise are assigned to the model. Specific models require, or can make use of, additional information. The kernel and noise are controlled by hyperparameters - calling the optimize (GPy.core.gp.GP.optimize) method against the model invokes an iterative process which seeks optimal hyperparameter values. The model object can be used to make plots and predictions (GPy.core.gp.GP.predict).

$digraph GPy_Arch { rankdir=LR node[shape="rectangle" style="rounded,filled" fontname="Arial"] edge [color="#006699" len=2.5] Data->Model Hyperparameters->Kernel Hyperparameters->Noise Kernel->Model Noise->Model Model->Optimize Optimize->Hyperparameters Model->Predict Model->Plot Optimize [shape="ellipse"] Predict [shape="ellipse"] Plot [shape="ellipse"] subgraph cluster_0 { Data Kernel Noise } }$

For developers

API Documentation