U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking

Fenghe Tang^1,2, Chengqi Dong, Wenxin Ma^1,2, Zikang Xu³,
Heqin Zhu^1,2, Zihang Jiang^1,2, Rongsheng Wang^1,2, Yuhao Wang^1,2, Chenxu Wu^1,2, Shaohua Kevin Zhou^1,2

¹ School of Biomedical Engineering, University of Science and Technology of China
² Suzhou Institute for Advanced Research, University of Science and Technology of China
³ Anhui Province Key Laboratory of Biomedical Imaging and Intelligent Processing

SoTA paper & weights sharing 🤗

🔥🔥🔥 Welcome to share the paper, code and weights through the Issues and Discussions ! 🔥🔥🔥

News

25-10-16. U-Bench Model Zoo weights released [Quick Access] 🎉🎉🎉
25-10-15. U-Bench Data Zoo released [Quick Access] 🎉🎉🎉
25-10-08. U-Bench paper released 🎉🎉🎉

Catalog🚀🚀🚀

U-Bench Model Zoo weights 🤗🤗🤗
U-Bench Data Zoo 🤗🤗🤗
U-Bench code 🤗🤗🤗
U-Bench paper 🤗🤗🤗

Abstract

Over the past decade, U-Net has been the dominant architecture in medical image segmentation, leading to the development of thousands of U-shaped variants. Despite its widespread adoption, a comprehensive benchmark to systematically assess the performance and utility of these models is lacking, primarily due to insufficient statistical validation and limited attention to efficiency and generalization across diverse datasets. To address this gap, we present U-Bench, the first large-scale, statistically rigorous benchmark that evaluates 100 U-Net variants across 28 datasets and 10 imaging modalities. Our contributions are threefold: (1) Comprehensive Evaluation: U-Bench evaluates models along three key dimensions: statistical robustness, zero-shot generalization, and computational efficiency. We introduce a novel metric, U-Score, which jointly captures the performance-efficiency trade-off, offering a deployment-oriented perspective on model progress. (2) Systematic Analysis and Model Selection Guidance: We summarize key findings from the large-scale evaluation and systematically analyze the impact of dataset characteristics and architectural paradigms on model performance. Based on these insights, we propose a model advisor agent to guide researchers in selecting the most suitable models for specific datasets and tasks. (3) Public Availability: We provide all code, models, protocols, and weights, enabling the community to reproduce our results and extend the benchmark with future methods. In summary, U-Bench not only exposes gaps in previous evaluations but also establishes a foundation for fair, reproducible, and practically relevant benchmarking in the next decade of U-Net-based segmentation models.

Anaylsis🧐

(1) Trends Analysis

(2) Significance Analysis

(3) Data Characteristics Analysis

Quick Start 🤩🤩🤩

Model Zoo	Data Zoo
Code \| Weights (Hugging Face)	Data (Hugging Face)

1. Installation

git clone https://github.com/FengheTan9/U-Bench.git
cd U-Bench
conda create -n ubench python=3.9 -y  
conda activate ubench  
pip install -r requirements.txt

2. Datasets

Please put the dataset (e.g. BUSI) or your own dataset as the following architecture:

└── U-Bench
    ├── data
        ├── busi
            ├── images
            |   ├── benign (10).png
            │   ├── malignant (17).png
            │   ├── ...
            |
            └── masks
                ├── 0
                |   ├── benign (10).png
                |   ├── malignant (17).png
                |   ├── ...
        ├── your dataset
            ├── images
            |   ├── 0a7e06.png
            │   ├── ...
            |
            └── masks
                ├── 0
                |   ├── 0a7e06.png
                |   ├── ...
    ├── dataloader
    ├── models
    ├── utils
    ├── script
    ├── main.py
    └── main_multi3d.py

3. Training & Validation

# BUSI (in-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/busi --dataset_name busi
# BUSBRA (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/BUSBRA --dataset_name BUSBRA
# ISIC18 (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/isic18 --dataset_name isic18
# SkinCancer (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/uwaterlooskincancer --dataset_name uwaterlooskincancer
# Kvasir (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/Kvasir-SEG --dataset_name Kvasir-SEG
# CHASE (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/CHASEDB1 --dataset_name CHASEDB1
# DRIVE (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/DRIVE --dataset_name DRIVE
# DSB2018 (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/DSB2018 --dataset_name DSB2018
# GlaS (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/Glas --dataset_name Glas
# Monusac (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/monusac --dataset_name monusac
# Cell (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/cellnuclei --dataset_name cellnuclei
# Convidquex (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/covidquex --dataset_name covidquex
# Montgomery (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/Montgomery --dataset_name Montgomery
# DCA (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/dca1 --dataset_name dca1
# Cystoidfluid (In-domain)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/cystoidfluid --dataset_name cystoidfluid
# Synapse (3D-Slice)
python main_multi3d.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/synapse --dataset_name synapse --num_classes 9 --input_channel 3 --val_interval 100
# ACDC (3D-Slice)
python main_multi3d.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/ACDC--dataset_name ACDC --num_classes 4 --input_channel 3 --val_interval 100

4. In-domain Inference

U-Bench Model Zoo [Quick Access]

# BUSI
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/busi --dataset_name busi --just_for_test True

5. Zero-shot Inference

# BUSI -> BUS (zero-shot)
python main.py --max_epochs 300 --gpu 0 --batch_size 8 --model U_Net --base_dir ./data/busi --dataset_name busi --zero_shot_base_dir ./data/bus --zero_shot_dataset_name bus --just_for_zero_shot

5. U-Score calculator

Please refer U-Score calculator

6. Results

Citation

If using this work (dataset, weights, or benchmark results), please cite:

@article{tang2025u,
  title={U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking},
  author={Tang, Fenghe and Dong, Chengqi and Ma, Wenxin and Xu, Zikang and Zhu, Heqin and Jiang, Zihang and Wang, Rongsheng and Wang, Yuhao and Wu, Chenxu and Zhou, Shaohua Kevin},
  journal={arXiv preprint arXiv:2510.07041},
  year={2025}
}

Contact

For questions or collaborations:

Email: [email protected]
GitHub Issues: Open Issue

⭐ Star this repo if you find it useful!

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
dataloader		dataloader
imgs		imgs
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download_datasets_from_huggingface.py		download_datasets_from_huggingface.py
download_weights_from_huggingface.py		download_weights_from_huggingface.py
main.py		main.py
main_multi3d.py		main_multi3d.py
requirements.txt		requirements.txt
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking

SoTA paper & weights sharing 🤗

News

Catalog🚀🚀🚀

Abstract

Anaylsis🧐

(1) Trends Analysis

(2) Significance Analysis

(3) Data Characteristics Analysis

Quick Start 🤩🤩🤩

1. Installation

2. Datasets

3. Training & Validation

4. In-domain Inference

5. Zero-shot Inference

5. U-Score calculator

6. Results

Citation

Contact

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

FengheTan9/U-Bench

Folders and files

Latest commit

History

Repository files navigation

U-Bench: A Comprehensive Understanding of U-Net through 100-Variant Benchmarking

SoTA paper & weights sharing 🤗

News

Catalog🚀🚀🚀

Abstract

Anaylsis🧐

(1) Trends Analysis

(2) Significance Analysis

(3) Data Characteristics Analysis

Quick Start 🤩🤩🤩

1. Installation

2. Datasets

3. Training & Validation

4. In-domain Inference

5. Zero-shot Inference

5. U-Score calculator

6. Results

Citation

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages