PyTorch models with Xplique¶

PyTorch models: Getting started
Metrics: With PyTorch models
Other tutorials applying Xplique to PyTorch models: Attributions: Object Detection, Attributions: Semantic Segmentation

Note

We should point out that what we did with PyTorch should be possible for other frameworks. Do not hesitate to give it a try and to make a PR if you have been successful!

Is it possible to use Xplique with PyTorch models?¶

Yes, it is! Even though the library was mainly designed to be a Tensorflow toolbox we have been working on a very practical wrapper to facilitate the integration of your PyTorch models into Xplique's framework!

Quickstart¶

import torch

from xplique.wrappers import TorchWrapper
from xplique.attributions import Saliency
from xplique.metrics import Deletion

# load images, targets and model
# ...

device = 'cuda' if torch.cuda.is_available() else 'cpu'
wrapped_model = TorchWrapper(torch_model, device)

explainer = Saliency(wrapped_model, operator="classification")
explanations = explainer(inputs, targets)

metric = Deletion(wrapped_model, inputs, targets, operator="classification")
score_saliency = metric(explanations)

Does it work for every module?¶

It has been tested on both the attributions and the metrics modules.

Does it work for all attribution methods?¶

Not yet, but it works for most of them (even for gradient-based ones!):

Attribution Method	PyTorch compatible
Deconvolution	❌
Grad-CAM	❌
Grad-CAM++	❌
Gradient Input	✅
Guided Backprop	❌
Hsic Attribution	✅
Integrated Gradients	✅
Kernel SHAP	✅
Lime	✅
Occlusion	✅
Rise	✅
Saliency	✅
SmoothGrad	✅
Sobol Attribution	✅
SquareGrad	✅
VarGrad	✅

Does it work for all tasks?¶

It works for all tasks covered by Xplique, see the tasks covered and how to specify them.

Steps to make Xplique work on PyTorch¶

1. Make sure the inputs follow the Xplique API (and not what the model expects).¶

One thing to keep in mind is that attribution methods expect a specific inputs format as described in the API Description. Especially, for images inputs should be \((N, H, W, C)\) following the TF's conventions where:

\(N\) is the number of inputs
\(H\) is the height of the images
\(W\) is the width of the images
\(C\) is the number of channels

However, if you are using a PyTorch models it is most likely expecting images' shape to be \((N, C, H, W)\). So what should you do?

If you are using PyTorch's preprocessing functions what you should do is:

preprocess as usual
convert the data to numpy array
use np.moveaxis(np_inputs, [1, 2, 3], [3, 1, 2]) to change shape from \((N, C, H, W)\) to \((N, H, W, C)\)

Notes

The third step is necessary only if your data has a channel dimension which is not in the place expected with Tensorflow

Tip

If you want to be sure how this work you can look at the PyTorch models: Getting started notebook and compare it to the Attribution methods:Getting Started

2. Wrap your model¶

A TorchWrapper object can be initialized with 3 parameters:

torch_model: torch.nn.Module: A torch's model that inherits from nn.Module
device: Union['torch.device', str]: The device on which the torch's model and inputs should be mounted
is_channel_first: Optional[bool] = None: A boolean that is true if the torch's model expect a channel dim and if this one come first

The last parameter is the one that needs special care. Indeed, if it is set to True we assume that the torch model expects its inputs to be \((N, C, H, W)\). As the explainer requires inputs to be \((N, H, W, C)\) we change the inputs' axis order when a call is made to the wrapped model (transparently for the user). If it is set to False we do not move the axis at all. By default the wrapper is looking for torch.nn.Conv2d layers in the torch model and consider it is channel first if it finds one and not otherwise.

Info

It is possible that you used special treatments for your models or that it does not follow typical convention. In that case, we encourage you to have a look at the Source Code to adapt it to your needs.

3. Use this wrapped model as a TF's one¶

What are the limitations?¶

As it was previously mentionned this does not work with: Deconvolution, Grad-CAM, Grad-CAM++ and Guided Backpropagation.

Furthermore, when one use any white-box explainers one have the possibility to provide an output_layer parameter. This functionnality will not work with PyTorch models. The user will have to manipulate itself its model!

Warning

The output_layer parameter does not work for PyTorch models!

It is possible that all failure cases were not covered in the tests, in that case please open an issue so the team will work on it!