基于深度学习的端到端视频压缩算法设计与实现

项目简介

本项目是一个基于 PyTorch 的端到端视频压缩框架，旨在探究神经网络在去除视频时空冗余方面的有效性。核心创新点在于引入了 CBAM (Convolutional Block Attention Module) 注意力机制，以提升视频重建的主观视觉质量。

目录结构

src/
  models/
    attention.py    # CBAM 注意力模块实现 (Scheme A)
    motion.py       # SPyNet 光流估计网络
    compression.py  # 运动压缩与残差压缩网络 (包含 CBAM)
    video_net.py    # 完整的视频压缩模型
  utils/
    metrics.py      # PSNR, MS-SSIM 评价指标
requirements.txt    # 依赖库
train.py            # 训练脚本 (示例)

环境配置

请确保安装了 Python 3.8+ 和 PyTorch。

pip install -r requirements.txt

核心架构

Motion Net: 使用 SPyNet 估计光流。
Motion Compensation: 基于光流进行运动补偿。
Residual Net: 引入 CBAM 的残差压缩网络。
Entropy Model: 使用 compressai 的熵瓶颈层。

运行指南

(此处为示例，需根据具体数据集调整)

python train.py --dataset /path/to/vimeo90k

评价指标

BPP (Bits Per Pixel): 压缩率
PSNR: 峰值信噪比
MS-SSIM: 多尺度结构相似性

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
sample_dataset/sequences/00001/0001		sample_dataset/sequences/00001/0001
src		src
README.md		README.md
checkpoint_attn_epoch_0.pth		checkpoint_attn_epoch_0.pth
checkpoint_attn_epoch_1.pth		checkpoint_attn_epoch_1.pth
checkpoint_attn_epoch_19.pth		checkpoint_attn_epoch_19.pth
checkpoint_attn_epoch_2.pth		checkpoint_attn_epoch_2.pth
checkpoint_attn_epoch_29.pth		checkpoint_attn_epoch_29.pth
checkpoint_attn_epoch_3.pth		checkpoint_attn_epoch_3.pth
checkpoint_attn_epoch_39.pth		checkpoint_attn_epoch_39.pth
checkpoint_attn_epoch_4.pth		checkpoint_attn_epoch_4.pth
checkpoint_attn_epoch_49.pth		checkpoint_attn_epoch_49.pth
checkpoint_attn_epoch_9.pth		checkpoint_attn_epoch_9.pth
checkpoint_epoch_0.pth		checkpoint_epoch_0.pth
checkpoint_no_attn_epoch_0.pth		checkpoint_no_attn_epoch_0.pth
checkpoint_no_attn_epoch_1.pth		checkpoint_no_attn_epoch_1.pth
checkpoint_no_attn_epoch_19.pth		checkpoint_no_attn_epoch_19.pth
checkpoint_no_attn_epoch_2.pth		checkpoint_no_attn_epoch_2.pth
checkpoint_no_attn_epoch_29.pth		checkpoint_no_attn_epoch_29.pth
checkpoint_no_attn_epoch_3.pth		checkpoint_no_attn_epoch_3.pth
checkpoint_no_attn_epoch_39.pth		checkpoint_no_attn_epoch_39.pth
checkpoint_no_attn_epoch_4.pth		checkpoint_no_attn_epoch_4.pth
checkpoint_no_attn_epoch_49.pth		checkpoint_no_attn_epoch_49.pth
checkpoint_no_attn_epoch_9.pth		checkpoint_no_attn_epoch_9.pth
comparison_attn.png		comparison_attn.png
comparison_attn_50.png		comparison_attn_50.png
comparison_attn_final.png		comparison_attn_final.png
comparison_attn_trainmode.png		comparison_attn_trainmode.png
comparison_no_attn.png		comparison_no_attn.png
comparison_no_attn_50.png		comparison_no_attn_50.png
comparison_no_attn_final.png		comparison_no_attn_final.png
download_sample.py		download_sample.py
foreman_qcif.y4m		foreman_qcif.y4m
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

基于深度学习的端到端视频压缩算法设计与实现

项目简介

目录结构

环境配置

核心架构

运行指南

评价指标

About

Uh oh!

Releases

Packages

Languages

JiangZhan-s/dmt

Folders and files

Latest commit

History

Repository files navigation

基于深度学习的端到端视频压缩算法设计与实现

项目简介

目录结构

环境配置

核心架构

运行指南

评价指标

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages