Package to accelerate research on generalized out-of-distribution (OOD) detection.
Under development. Please report any issues or bugs here.
This library is aimed at assisting researchers in the field of generalized OOD detection. It is inspired by HF's Transformers and features implementations of baselines, metrics, and data sets that allow researchers to perform meaningful benchmarking and development of ood detection methods. It features:
methods
: more than 20 detection methods implemented.pipelines
: evaluating OOD detectors on popular benchmarks, such as MNIST, CIFAR, and ImageNet benchmarks with random seed support for reproducibility.datasets
: OOD datasets implemented with md5 checksums and without the need to download them manually.models
: model architectures totally integrated with timm
.eval
: implementation of fast OOD evaluation metrics.Please follow the instructions here to install PyTorch. Installing PyTorch with CUDA support is strongly recommended.
pip install detectors
To install the latest version from the source:
git clone https://github.com/edadaltocg/detectors.git
cd detectors
pip install --upgrade pip setuptools wheel
pip install -e .
Also, you have easy access to the Python scripts from the examples:
cd examples
The following examples show how to use the library and how it can be integrated into your research. For more examples, please check the documentation.
The following example shows how to run a benchmark.
import detectors
import torch
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = detectors.create_model("resnet18_cifar10", pretrained=True)
model = model.to(device)
test_transform = detectors.create_transform(model)
pipeline = detectors.create_pipeline("ood_benchmark_cifar10", transform=test_transform)
method = detectors.create_detector("msp", model=model)
pipeline_results = pipeline.run(method)
print(pipeline.report(pipeline_results["results"]))
We recommend running benchmarks on machines equipped with large RAM and GPUs with 16GB of memory or larger to leverage large batch sizes and faster inference.
The following example shows how to create a detector. The only requirement is that the method takes an input x
and returns a score.
import torch
import detectors
@detectors.register_detector("awesome_detector")
def awesome_detector(x: torch.Tensor, model, **kwargs):
# Do something awesome with the model and the input
return scores
# Instantiating the detector
method = detectors.create_detector("awesome_detector", model=model)
Alternatively, you can use the Detector
class to create a detector that requires some initialization or state to be fitted before being called (e.g., Mahalanobis detector):
import torch
import detectors
@detectors.register_detector("awesome_detector")
class AwesomeDetector(detectors.Detector):
def __init__(self, model, **kwargs):
self.model = model
def __call__(self, x: torch.Tensor, **kwargs):
# Do something awesome with the model and the input
return scores
# Instantiating the detector
method = detectors.create_detector("awesome_detector", model=model)
Check the documentation for more information.
The following example shows how to list all available resources in the library.
import detectors
# list all available models (same as timm.list_models)
print(detectors.list_models())
# list all available models with a specific pattern
print(detectors.list_models("*cifar*"))
# list all available datasets
print(detectors.list_datasets())
# list all available detectors
print(detectors.list_detectors())
# list all available pipelines
print(detectors.list_pipelines())
Methods
Pipelines
Pypi
As an open-source project in a rapidly developing field, we are open to contributions, whether in the form of a new feature, improved infra, or better documentation.
See the contributing guidelines for instructions on how to make your first contribution to detectors
.
Concerning this package, its use, and bugs, use the issue page of the ruptures repository. For other inquiries, you can contact me here.
The detection of Out-of-Distribution (OOD) has created a new way of securing machine intelligence, but despite its many successes, it can be difficult to understand due to the various methods available and their intricate implementations. The fast pace of research and the wide range of OOD methods makes it challenging to navigate the field, which can be a problem for those who have recently joined the field or want to deploy OOD detection. The library we have created aims to lower these barriers by providing a resource for researchers of any background to understand the methods available, how they work, and how to be successful with OOD detection.
If you find this repository useful, please consider giving it a star 🌟 and citing it as below:
@software{detectors2023,
author = {Eduardo Dadalto},
title = {Detectors: a Python Library for Generalized Out-Of-Distribution Detection},
url = {https://github.com/edadaltocg/detectors},
doi = {https://doi.org/10.5281/zenodo.7883596},
month = {5},
year = {2023}
}