一种高效的自监督元迁移小样本学习算法

doi:10.19665/j.issn1001-2400.2021.06.007

摘要/Abstract

摘要：

当前深度学习的一个关键难点即小样本问题。尽管已出现了一些较有效的小样本算法,但现有方法的模型提取的特征有限,且模型的泛化能力较弱。另外,如果新类的数据和训练集中数据的分布差异大,分类结果就会很差。针对已有算法的上述缺陷,提出了残差注意力膨胀卷积网络作为网络模型的特征提取器,膨胀分支的设计增大了模型感受野且可以提取不同尺寸的特征,基于图片的残差注意力增强了模型对重要特征的关注度。提出基于自监督的网络模型预训练算法,预训练阶段使用自监督方式,对图像数据进行不同角度旋转且建立相应标签,设计基于图像结构信息的旋转分类器,增加了训练任务中的监督信息,以增强对数据信息进一步挖掘及算法的泛化能力。以目前一些性能最优的小样本算法作为基准性能对比算法,在标准的小样本数据集miniImageNet和Fewshot-CIFAR100上,将文中算法与基准算法进行了充分地实验。实验结果表明:该算法取得了最新最好的性能。

关键词: 小样本, 自监督, 膨胀卷积, 残差注意力

Abstract:

A key difficulty of current deep learning is the problem of few shots.Although some more effective few-shot algorithms/models have appeared,the existing deep models have limited features and the ability of the models to make generalization is low.If the distribution of the data in the new class and that of the data in the training dataset differ greatly,the classification result will be poor.In view of the above-mentioned shortcomings of the existing algorithms,the author proposes the residual attention dilation convolutional network as the feature extractor of the network model.The design of dilation branch increases the model’s receptive field and can extract features of different sizes.Image-based residual attention enhances the model’s attention to important features.A self-supervised network model pre-training algorithm is proposed.The self-supervised method is used in the pre-training stage to rotate the image data at different angles and establish corresponding labels.The rotation classifier based on the image structure information is designed to increase the supervision information in the training task so as to enhance the further mining of data information and the ability of the algorithm to make generalization.On the benchmark few-shot datasets miniImageNet and Fewshot-CIFAR100,the algorithm proposed in this paper is compared with the latest and best few-shot algorithm,with experimental results showing that the algorithm in this paper has achieved the latest and best performance.

Key words: few-shot, self-supervised, dilated convolution, residual attention

中图分类号:

TP183

史家辉,郝小慧,李雁妮. 一种高效的自监督元迁移小样本学习算法[J]. 西安电子科技大学学报, 2021, 48(6): 48-56.

SHI Jiahui,HAO Xiaohui,LI Yanni. Efficient self-supervised meta-transfer algorithm for few shot learning[J]. Journal of Xidian University, 2021, 48(6): 48-56.

图/表 8

图1

图2

图3

图4

图5

图6

表1

表2

参考文献 27

[1]	XIONG W, HE Y, ZHANG Y, et al. Fine-Grained Image-To-Image Transformation Towards Visual Recognition[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2020:5840-5849.
[2]	WANG D L, CHENJ.Supervised Speech Separation Based on Deep Learning:An Overview[J]. IEEE/ACM Transactions on Audio,Speech,and Language Processing, 2018, 26(10):1702-1726. doi: 10.1109/TASLP.6570655
[3]	ZHOU M, DUAN N, LIU S, et al. Progress In Neural NLP:Modeling,Learning,And Reasoning[J]. Engineering, 2020, 6(3):275-290. doi: 10.1016/j.eng.2019.12.014
[4]	晏媛, 孙俊, 孙晶明, 等. 雷达小样本目标识别方法及应用分析[J]. 系统工程与电子技术, 2021, 43(3):684-692.
	Yan Yuan, Sun Jun, SunJingming,et al.Radar few-shot target recognition method and application analysis[J]. System engineering and electronic technology, 2021, 43(3):684-692.
[5]	LITJENS G, KOOI T, BEJNORDI B E, et al. A survey on deep learning in medical image analysis[J]. Medical image analysis, 2017, 42:60-88. doi: 10.1016/j.media.2017.07.005
[6]	HUANG X, KWIATKOWSKA M, WANG S, et al. Safety Verification Of Deep Neural Networks[C]// International conference on computer aided verification.Heidelberg:Springer, 2017:3-29.
[7]	FEI-FEI L, FERGUS R, PERONA P. One-Shot Learning Of Object Categories[J]. IEEE transactions on pattern analysis and machine intelligence, 2006, 28(4):594-611. doi: 10.1109/TPAMI.2006.79
[8]	ELSKEN T, STAFFLER B, METZEN J H, et al. Meta-Learning Of Neural Architectures For Few-Shot Learning[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2020:12365-12375.
[9]	CUBUK E D, ZOPH B, MANE D, et al. Autoaugment:Learning augmentation policies from data[EB/OL]. [2018-8-13]https://arxiv.org/pdf/1805.09501.pdf .
[10]	SNELL J, SWERSKY K, ZEMEL R S. Prototypical networks for few-shot learning[J]. Advances in Neural Information Processing[EB/OL]. [2017-12-4]https//proceedings.newrips.cc/paper/.2010/file/cb8dab7674611f2812ae4290eac70bc42-paper.pbf .
[11]	SUNG F, YANG Y, ZHANG L, et al. Learning to Compare:Relation Network for Few-Shot Learning[C]// Proceedings of the IEEE conference on computer vision and pattern recognition.Piscataway:IEEE, 2018:1199-1208.
[12]	FINN C, ABBEEL P, LEVINE S. Model-Agnostic Meta-Learning For Fast Adaptation Of Deep Networks[C]// International Conference on Machine Learning.Sydney:PMLR, 2017:1126-1135.
[13]	ANTONIOU A, EDWARDS H, STORKEY A. How to train your MAML[EB/OL]. [2019-7-25]https://openreview.net/pdf?id=HJGven05Y7 .
[14]	ORESHKIN B N, RODRIGUEZ P, LACOSTE A. TADAM:Task dependent adaptive metric for improved few-shot learning[EB/OL].[2018-12-03]https://proceedings.neurips.cc/paper/2018/hash/66808e327dc79d135ba18e051673d906-Abstract.html.
[15]	KESHARI R, VATSA M, SINGH R, et al. Learning structure and strength of CNN filters for small sample size training[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2018:9349-9358.
[16]	HUANG J, RATHOD V, SUN C, et al. Speed/Accuracy Trade-Offs For Modern Convolutional Object Detectors[C]// Proceedings of the IEEE conference on computer vision and pattern recognition.Piscataway:IEEE, 2017:7310-7311.
[17]	ZHU R, ZHANG S, WANG X, et al. ScratchDet:Training Single-Shot Object Detectors From Scratch[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2019:2268-2277.
[18]	SUN Q, MA L, OH S J, et al. Natural and effective obfuscation by headinpainting[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2018:5050-5059.
[19]	LIU X, DENG Z, YANG Y. Recent progress in semantic image segmentation[J]. Artificial Intelligence Review, 2019, 52(2):1089-1106. doi: 10.1007/s10462-018-9641-3
[20]	CHEN L C, PAPANDREOU G, KOKKINOS I, et al. DeepLab:Semantic Image Segmentation with Deep Convolutional Nets,Atrous Convolution,and Fully Connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848. doi: 10.1109/TPAMI.2017.2699184
[21]	SUN Q, LIU Y, CHUA T S, et al. Meta-Transfer Learning For Few-Shot Learning[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2019:403-412.
[22]	YU Z, CHEN L, CHENG Z, et al. Transmatch:A Transfer-Learning Scheme For Semi-Supervised Few-Shot Learning[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE, 2020:12856-12864.
[23]	LIAN D, ZHENG Y, XU Y, et al. Towards Fast Adaptation of Neural Architectures With Meta Learning[EB/OL]. [2020-04-26]https://openreview.net/forum?id=r1eowANFvr .
[24]	MITCHELL TM. Machine learning,International Edition[EB/OL]. [1997-07-29]https://www.worldcat.org/oclc/61321007 .
[25]	WANG Y, YAO Q, KWOK J T, et al. Generalizing From A Few Examples:A Survey On Few-Shot Learning[J]. ACM Computing Surveys, 2020, 53(3):1-34.
[26]	HE K, ZHANG X, REN S, et al. Deep Residual Learning For Image Recognition[C]// Proceedings of the IEEE conference on computer vision and pattern recognition.Piscataway:IEEE, 2016:770-778.
[27]	PATACCHIOLA M, TURNER K, CROWLEY E J, et al. Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels[EB/OL]. [2020-12-06]https://proceedings.neurips.cc//paper/2020/file/b9cfe8b6042cf759dc4c0cccb27a6737-Paper.pdf .

算法名称	miniImageNet		FC100
算法名称	5-way-1-shot/(%)	5-way-5-shot/(%)	5-way-1-shot/(%)	5-way-5-shot/(%)	5-way-10-shot/(%)
MAML^[12](2017)	48.70±1.75	63.11±0.92	38.1±1.7	50.4±1.0	56.2±0.8
MAML++^[13](2019)	52.15±0.26	68.32±0.44	38.7±0.4	52.9±0.4	58.8±0.4
TADAM^[14](2018)	57.80±0.30	75.90±0.30	40.1±0.4	56.1±0.4	61.6±0.5
MTL^[21](2019)	57.10±1.40	72.80±0.70	43.8±1.4	55.4±1.1	61.2±0.6
TransMatch^[22](2020)	58.43±0.93	76.43±0.61	44.7±1.1	55.8±0.7	63.9±0.7
T-NAS^[23](2020)	54.11±1.30	69.59±0.80	40.4±1.2	54.6±0.9	60.2±0.7
DKT^[27](2020)	49.73±0.07	64.73±0.21	39.4±0.9	52.1±0.4	57.8±0.3
SS-MTL(ours)	59.80±1.60	76.60±0.90	45.7±1.3	56.9±1.1	64.2±0.9

实验设置	Pretrain/(%)		2-SS-Pretrain/(%)		4-SS-Pretrain/(%)
实验设置	Meta-Tr	Meta-Te	Meta-Tr	Meta-Te	Meta-Tr	Meta-Te
5-way-1-shot	59.7	57.1	60.2	57.9	62.4	58.4
5-way-5-shot	73.6	72.8	74.7	73.5	75.8	74.3