3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation

This is the official PyTorch implementation of the paper 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation

Paper Abstract

The generation of medical images presents significant challenges due to their high-resolution and three-dimensional nature. Existing methods often yield suboptimal performance in generating high-quality 3D medical images, and there is currently no universal generative framework for medical imaging. In this paper, we introduce the 3D Medical Diffusion (3D MedDiffusion) model for controllable, high-quality 3D medical image generation. 3D MedDiffusion incorporates a novel, highly efficient Patch-Volume Autoencoder that compresses medical images into latent space through patch-wise encoding and recovers back into image space through volume-wise decoding. Additionally, we design a new noise estimator to capture both local details and global structure information during diffusion denoising process. 3D MedDiffusion can generate fine-detailed, high-resolution images (up to 512x512x512) and effectively adapt to various downstream tasks as it is trained on large-scale datasets covering CT and MRI modalities and different anatomical regions (from head to leg). Experimental results demonstrate that 3D MedDiffusion surpasses state-of-the-art methods in generative quality and exhibits strong generalizability across tasks such as sparse-view CT reconstruction, fast MRI reconstruction, and data augmentation.

✅ ToDo

~~📦 Training code for single-resolution release~~
~~🧠 Pre-trained weights (8x downsampling) release~~
~~🌐 Inference code release~~
~~📄 Pre-trained weights (4x downsampling) release~~
📝 Training code for multi-resolution release

Installation

## Clone this repo
git clone https://github.com/ShanghaiTech-IMPACT/3D-MedDiffusion.git


# Setup the environment
conda create -n 3DMedDiffusion python=3.11.11

conda activate 3DMedDiffusion 

pip install -r requirements.txt

Training

PatchVolume Autoencoder — Stage 1

## 4x compression
python train/train_PatchVolume.py --config config/PatchVolume_4x.yaml

## 8x compression
python train/train_PatchVolume.py --config config/PatchVolume_8x.yaml

Note:

All training images should be normalized to [-1, 1].
Update the default_root_dirand root_dir fileds in config/PatchVolume_4x.yaml / config/PatchVolume_8x.yaml to match your local paths.
Provide a data.json following the format shown in the config/PatchVolume_data.json example.

PatchVolume Autoencoder — Stage 2

## 4x compression
python train/train_PatchVolume_stage2.py --config config/PatchVolume_4x_s2.yaml

## 8x compression
python train/train_PatchVolume_stage2.py --config config/PatchVolume_8x_s2.yaml

Note: Set the resume_from_checkpoint in PatchVolume_4x.yaml / PatchVolume_8x.yaml to the checkpoint path from Stage 1 training.

Encode the Images to latents

python train/generate_training_latent.py --data-path config/Singleres_dataset.json --AE-ckpt checkpoints/trained_AE.ckpt --batch-size 4

BiFlowNet

torchrun --nnodes=1 --nproc_per_node=8 --master_port 29513 train/train_BiFlowNet_SingleRes.py --data-path config/Singleres_dataset.json --results-dir  /input/your/results/dir --num-classes 2  --AE-ckpt input/your/AE/checkpoint/path  --resolution 32 32 32  --batch-size 48 --num-workers 48

Inference

Generation using 8x downsampling

python evaluation/class_conditional_generation.py --AE-ckpt checkpoints/PatchVolume_8x_s2.ckpt --model-ckpt checkpoints/BiFlowNet_0453500.pt --output-dir input/your/save/dir

Generation using 4x downsampling

python evaluation/class_conditional_generation_4x.py --AE-ckpt checkpoints/PatchVolume_4x_s2.ckpt --model-ckpt checkpoints/BiFlowNet_4x.pt --output-dir input/your/save/dir

Note: Make sure your GPU has at least 40 GB of memory available to run inference at all supported resolutions.

Pretrained Models

The pretrained checkpoint is provided here:

Please download the checkpoints and put it to ./checkpoints.

Acknowledgements

This repository builds upon the following excellent open-source projects: LDMs and medicaldiffusion.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
AutoEncoder		AutoEncoder
assets		assets
config		config
dataset		dataset
ddpm		ddpm
docs		docs
evaluation		evaluation
train		train
warvito_MedicalNet-models_main		warvito_MedicalNet-models_main
3DMedDiffusion.yaml		3DMedDiffusion.yaml
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation

Paper Abstract

✅ ToDo

Installation

Training

PatchVolume Autoencoder — Stage 1

PatchVolume Autoencoder — Stage 2

Encode the Images to latents

BiFlowNet

Inference

Pretrained Models

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

ShanghaiTech-IMPACT/3D-MedDiffusion

Folders and files

Latest commit

History

Repository files navigation

3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation

Paper Abstract

✅ ToDo

Installation

Training

PatchVolume Autoencoder — Stage 1

PatchVolume Autoencoder — Stage 2

Encode the Images to latents

BiFlowNet

Inference

Pretrained Models

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages