Releases: jrzaurin/pytorch-widedeep
mps backend support and more rec models
Added support for MPS backend
Added a series of models to the rec module: DCN, DCNv2, GDCN, AutoInt, AutoIntPlus
Added a DIN preprocessor
Reviewed the docs
Reviewed the examples
Other (minor and not so minor) fixes
The `rec` module
- After a number of issues opened and questions in slack about recommendation algorithms in the library I decided to include a rec module that initially contains a small number of recommendation algorithms. These are:
- Factorisation Machines (FM) and DeepFM
- Field Aware Factorisation Machines (FFM) and DeepFFM
- Extreme Deep Factorisation Machines (xDeepFM)
- Deep Interest Networks (DIN)
We will add more in the near future.
Multiple tabular components
- Added support to multiple tabular models for different columns (that adds to multiple text and image cols from previous versions)
- Removed support for FDS and LDS
- Carries the possibility of saving the optimiser which was added in the version 1.6.2 (short-life and never published)
Patch to limit numpy to version lower than 2.0
This is a quick patch to fix numpy>=1.21.6, <2.0.0
Otherwise, is exactly the same as 1.6.0
Huggingface integration, multi-text and image column support and multi target loss functions
What's Changed
- Huggingface integration by @jrzaurin in #209
- Multi text and image column support by @jrzaurin in #215
- Support for multi target loss functions by @jrzaurin in #215
- README has been almost completely re-written, with drawings of 7 possible architectures (where the boxes/component can be any of the models in the library) and fully runnable examples with a toy dataset that anyone can use as a starting point.
Full Changelog: v1.5.1...v1.6.0
Model Attributes named correctly
Mostly fixed issue #204
Embedding Methods for Numerical Features
Added two new embedding methods for numerical features described in On Embeddings for Numerical Features in Tabular Deep Learning and adjusted all models and functionalities accordingly
The `load_from_folder` module
This release mainly adds the functionality to be able to deal with large datasets via the load_from_folder
module.
This module is inspired by the ImageFolder
class in the torchvision
library but adapted to the needs of our library. See the docs for details.
Flash and Linear Attention mechanisms added to the TabTransformer
- Added Flash Attention
- Added Linear Attention
- Revisited and polished the docs
pytorch-widedeep in the context of recsys
- Added example scripts and notebooks on how to use the library in the context of recommendation systems using this notebook as example. This is a response to issue #133
- Used the opportunity to add the movielens 100k dataset to the library, so that now it can be imported from the datasets module
- Added a simple (not pre-trained) transformer model to to the text component
- Added citation file
- Fix a bug regarding the padding index not being 1 when using the fastai transforms