Diff-GCA

This is the implementation of our Diffusion based Generative Counterfactual Augmentation Framework

Preliminary Results

Figure : (a) SD-v1.5 w/o fine-tuning, (b-c) SD-v1.5 fine-tuned on RSNA dataset

Troubleshooting

# If issues with 'cached_downloads' library
https://github.com/easydiffusion/easydiffusion/issues/1851
    
# Can't import DIFFUSERS_REQUEST_TIMEOUT ?
add 'DIFFUSERS_REQUEST_TIMEOUT = 60' to diffusers/utils/constants.py

Install Required Libraries

#Python3.10
pip install diffusers==0.25.0
pip install transformers==4.39.3
pip install accelerate
pip install datasets
pip install --upgrade Pillow

Fine-Tune SD-v1.5 with DreamBooth

One Concept

python train_dreambooth.py --pretrained_model_name_or_path="sd-legacy/stable-diffusion-v1-5" --instance_data_dir="../CXR/datasets/rsna/" --output_dir="saved_models/one_concept_db/" --instance_prompt="photo of a Chest X-ray" --resolution=512 --train_batch_size=1 --gradient_accumulation_steps=1 --learning_rate=5e-6 --lr_scheduler="constant" --lr_warmup_steps=0 --max_train_steps=4000

Multiple Concepts

python train_dreambooth_shivam.py --pretrained_model_name_or_path="sd-legacy/stable-diffusion-v1-5"  --resolution=512 --train_batch_size=1 --gradient_accumulation_steps=1 --learning_rate=5e-6 --lr_scheduler="constant" --lr_warmup_steps=0 --max_train_steps=4000 --concepts_list="concepts.json" --output_dir="saved_models/n_concepts_db/"

Fine-Tune SD-v1.5 with Textual Inversion

python textual_inversion.py --pretrained_model_name_or_path="sd-legacy/stable-diffusion-v1-5" --train_data_dir="../CXR/datasets/rsna/" --learnable_property="object" --placeholder_token="<x-ray>" --initializer_token="ray" --resolution=512 --train_batch_size=1 --gradient_accumulation_steps=4 --max_train_steps=4000 --learning_rate=5.0e-04 --scale_lr --lr_scheduler="constant" --lr_warmup_steps=0 --output_dir="saved_models/textual_inversion_cxr"

Fine-Tune SD-v1.5 with Custom Diffusion

python train_custom_diffusion.py --pretrained_model_name_or_path="sd-legacy/stable-diffusion-v1-5" --instance_data_dir="../CXR/datasets/rsna/" --output_dir="saved_models/custom_diffusion_cxr" --class_data_dir="../CXR/datasets/rsna/" --real_prior --prior_loss_weight=1.0 --class_prompt="rsna x-ray" --num_class_images=30000 --instance_prompt="photo of a <rsna> x-ray" --resolution=512 --train_batch_size=2 --learning_rate=1e-5 --lr_warmup_steps=0 --max_train_steps=4000 --scale_lr --hflip --modifier_token "<rsna>" --validation_prompt="<rsna> x-ray" --no_safe_serialization

Fine-Tune SD-v1.5 with InstructPix2Pix

python train_instruct_pix2pix.py --pretrained_model_name_or_path="sd-legacy/stable-diffusion-v1-5" --dataset_name="../CXR/datasets/cxr_instruct_HFdataset" --resolution=256 --random_flip --train_batch_size=4 --gradient_accumulation_steps=4 --gradient_checkpointing --max_train_steps=15000 --checkpointing_steps=1000 --checkpoints_total_limit=2 --learning_rate=5e-05 --max_grad_norm=1 --lr_warmup_steps=0 --conditioning_dropout_prob=0.05 --mixed_precision=fp16 --seed=42 --output_dir="saved_models/instruct-pix2pix-model" --dataloader_num_workers 8

# to resume from last checkpoint
--resume_from_checkpoint ""

Cite this work

Kulkarni et al, Hidden in Plain Sight, MIDL 2024.

@article{kulkarni2024hidden,
  title={Hidden in Plain Sight: Undetectable Adversarial Bias Attacks on Vulnerable Patient Populations},
  author={Kulkarni, Pranav and Chan, Andrew and Navarathna, Nithya and Chan, Skylar and Yi, Paul H and Parekh, Vishwa S},
  journal={arXiv preprint arXiv:2402.05713},
  year={2024}
}

@misc{von-platen-etal-2022-diffusers,
  author = {Patrick von Platen and Suraj Patil and Anton Lozhkov and Pedro Cuenca and Nathan Lambert and Kashif Rasul and Mishig Davaadorj and Dhruv Nair and Sayak Paul and William Berman and Yiyi Xu and Steven Liu and Thomas Wolf},
  title = {Diffusers: State-of-the-art diffusion models},
  year = {2022},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/huggingface/diffusers}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.ipynb_checkpoints		.ipynb_checkpoints
figures		figures
LICENSE		LICENSE
README.md		README.md
concepts.json		concepts.json
construct_datasets.ipynb		construct_datasets.ipynb
edited_image.png		edited_image.png
sd-v1.5.ipynb		sd-v1.5.ipynb
textual_inversion.png		textual_inversion.png
textual_inversion.py		textual_inversion.py
train_custom_diffusion.py		train_custom_diffusion.py
train_dreambooth.py		train_dreambooth.py
train_dreambooth_shivam.py		train_dreambooth_shivam.py
train_instruct_pix2pix.py		train_instruct_pix2pix.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diff-GCA

Preliminary Results

Troubleshooting

Install Required Libraries

Fine-Tune SD-v1.5 with DreamBooth

One Concept

Multiple Concepts

Fine-Tune SD-v1.5 with Textual Inversion

Fine-Tune SD-v1.5 with Custom Diffusion

Fine-Tune SD-v1.5 with InstructPix2Pix

Cite this work

About

Uh oh!

Releases

Packages

Languages

License

Wazhee/Diff-GCA

Folders and files

Latest commit

History

Repository files navigation

Diff-GCA

Preliminary Results

Troubleshooting

Install Required Libraries

Fine-Tune SD-v1.5 with DreamBooth

One Concept

Multiple Concepts

Fine-Tune SD-v1.5 with Textual Inversion

Fine-Tune SD-v1.5 with Custom Diffusion

Fine-Tune SD-v1.5 with InstructPix2Pix

Cite this work

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages