🎉 Nextra 2.0 is released. Read more →

Changelog

We release features and fixes weekly! See some of the changes we have made.

0.3.1

Date: 25 November 2022

We made large changes to our infrastructure to be more reliable and faster. This means faster deployment, less downtime and roughly half the inference time!

0.2.4

Date: 23 November 2022

Add more models to our Prebuilt library
mt0-xl
galactica
flan-xl

0.2.3

Date: 11 November 2022

Added support for post-processing functions in flows!
Add Prebuilt models to available deployments
whisper-medium

0.2.2

28 October 2022

Fix a bug where the internal flow would be malformed if the model was not ONNX
Force single model ONNX flow until post-processing is supported

0.2.1

Date: 26 October 2022

Support for ONNX models with Python 3.7-3.9 (can be used as a single model or as the initial model of a flow). We are working on supporting Python 3.10 and 3.11!

v0.2.0

Date: 23 October 2022

Model ensembles are here! You can now deploy a sequence of models to an endpoint and have them run in order.
Change the deploy call signature to accommodate for model ensembles.
Added explicit model types to be used in place of the previous plain strings.

v0.1.9

Date: 18 October 2022

Reduce minimum python version to 3.7 (and reduce dependency min versions accordingly). Now compatible with Google Colab!

v0.1.5

Date: 17 October 2022

Update license
Handle usage limits on different Cerebrium packages

v0.1.4

Date: 16 October 2022

Release on the public PyPi!
Refactor the package name to cerebrium

v0.1.3

Date: 12 October 2022

Fixed an issue with data input shapes
Updated our documentation
Released Model/Flow Versioning
Improved Error Handling and Messaging on the Neuron client
Implemented a utility function for easy API testing
Provided a dry-run option for model deployment, which returns a callable function for testing
Fixed a deserialization issue with PyTorch models (requires cloudpickle)
Support for Torchscript models
Support for XGB JSON models

v0.1.2

Date: 7th October 2022

Moved architecture to serverless CPU/GPU's
Support Pytorch, XGBoost and SKLearn models
Updated monitoring metrics per model version
Bug fixes and performance improvements

V0.1.1

Date: 26th September 2022

Released alpha version of Neuron.
Users are able to deploy models from their notebooks or .py files with just 4 lines of code.
Created dashboard interface for users to see deployed models, API calls and errors.
Created this documentation site.

Release Notes Roadmap