v0.9.0-alpha - multi-dataset edition #249
Closed
bghira
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Changes
There's more info about these in
OPTIONS.mdandTUTORIAL.md.New behaviour
"scan_for_errors": trueit will be read entirely at startup and any bad images will be removed ifdelete_problematic_images: true. It will remove any outdated cache entries."disabled": truein the dataset config entry.Removed arguments
multidatabackend.json--data_backendis now--data_backend_configand is a path to a dataset config, seemultidatabackend.json.examplefor help converting your existing configurations overNew arguments
--data_backend_configDATALOADER_CONFIGinsdxl-env.sh--override_dataset_config--vae_cache_behaviour.jsoncache files from your dataset and switch the data backend config to use square images rather than aspect-crops. This will result in an inconsistent data cache, which can be corrected by settingscan_for_errorstotruein yourmultidatabackend.jsonconfiguration file. When this scan runs, it relies on the setting of--vae_cache_behaviourto determine how to resolve the inconsistency:recreate(the default) will remove the offending cache entry so that it can be recreated, andsyncwill update the bucket metadata to reflect the reality of the real training sample. Recommended value:recreate.Full Changelog: v0.8.2...v0.9.0-alpha
This discussion was created from the release v0.9.0-alpha - multi-dataset edition.
Beta Was this translation helpful? Give feedback.
All reactions