基于改进pix2pix的红外图像转换技术
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

基金项目:


Infrared image conversion technology based on improved pix2pix
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对不同波段图像获取代价不同的问题,提出一种基于pix2pix的图像转换方法并进行改进。主要针对生成器和鉴别器两方面进行改进。生成器方面,使用残差结构的生成器替换原来的U-Net生成器以缓解梯度消失问题;引入可变形卷积,提高目标边缘和小目标的生成效果;引入BAM注意力机制,提高了算法对图像中主要目标的特征提取能力以提升生成图像的效果。鉴别器方面:改变PatchGAN中卷积层的层数(原PatchGAN为3层卷积),设置对照实验找到转换效果最好的卷积层数。以可见光图像和红外图像之间的转换为例进行实验。实验结果表明,改进后的算法在生成图像上的均方根误差(MSE)下降了314%、结构相似性(SSIM)提高了112%,可以更好的实现红外图像和可见光图像之间的转换。

    Abstract:

    In order to solve the problem of different cost of image acquisition in different light segments,an image conversion method based on pix2pix was proposed.It mainly focuses on the generator and discriminator.In terms of generators,the residual structures generator was used instead of the original U Net generator to alleviate the gradient vanishing problem.Deformable convolution is introduced to improve the generation effect of target edges and small targets.The BAM attention mechanism is introduced to improve the feature extraction ability of the algorithm for the main target in the image to improve the image generation effect.In terms of discriminators:change the number of convolutional layers in PatchGAN (the original PatchGAN is 3 layer convolution),and set up a control experiment to find the convolutional layer with the best conversion effect.Some KAIST datasets are selected for training and testing.The experimental results show that the Root Mean Square Error (MSE) of the improved algorithm is reduced by 31.4% and the Structural Similarity (SSIM) is increased by 11.2%,which can better realize the conversion between infrared and visible images.

    参考文献
    相似文献
    引证文献
引用本文

叶明亮,史春景,郝永平,李大伟.基于改进pix2pix的红外图像转换技术[J].激光与红外,2024,54(7):1157~1163
YE Ming-liang, SHI Chun-jing, HAO Yong-ping, LI Da-Wei. Infrared image conversion technology based on improved pix2pix[J]. LASER & INFRARED,2024,54(7):1157~1163

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:2023-12-26
  • 录用日期:
  • 在线发布日期: 2024-07-23
  • 出版日期: