Releases: ZCode-opensource/image-artisan-xl
v0.3.0
New in this version:
Datasets
Create and manage datasets for training. You can crop the images and create captions for each of them individually or in bulk. Additionally, the option to use AI (FuseCap) for automatic captioning has been added. However, this should be used as a guide or aid, as the quality of a dataset with auto-generated captions is typically poor.
Training
A new module for training has been added (currently, Dreambooth LoRA). This module allows you to use the created datasets to train a LoRA. All of this can be done without the need to use the command line or connect to an external source.
Bucketed training wasn’t incorporated, as its value wasn’t clear. The models seem to train effectively using the default square ratio. Also, dealing with images of sizes up to 512x2048 presents no issues.
The inclusion of repeat images or data augmentation was not seen as beneficial. Repeating images equates to performing more epochs without augmentation. Augmentation, which often requires bigger size images and bucketing, often results in captions that don’t align if the images are cropped in areas being described by the caption. This can reduce the quality of the dataset. These features may be of interest to those who wish to experiment. However, for effective fine-tuning or LoRA, manual intervention is necessary, which aligns with the purpose of this software.
Exciting enhancements are on the horizon!
- IP Adapters: Add even more tools to your arsenal with this powerful adapters.
- Image to Image: Transform your visual content with our soon-to-be-launched Image to Image feature.
- Inpainting: Experience the magic of seamless photo editing with our forthcoming Inpainting tool.
Stay tuned for these game-changing additions that will take your productivity to new heights!
v0.2.0
New in version 0.2.0:
Welcome screen and initial configuration for first time use:
New downloader menu for essentials, Controlnet and T2I adapters:
Controlnet added to this release with a advanced controls for moving, zooming and painting the images, can also copy and paste between annotators.
Multiple controlnets can be used for image composition:
and T2I adapters which are lightweight and also add Line Art and Sketch that aren't available with Controlnet for SDXL yet.
Also new improved installer using NSIS from Nullsoft, no more console window, just a small weight installer that dowloads everything you need to get started with image generation.
Finally, since this application uses diffusers, SDXL Turbo is also supported.