基于有偏采样的连续进化神经架构搜索

doi:10.19678/j.issn.1000-3428.0066406

摘要/Abstract

摘要：

由于需要对每一个搜索到的架构进行独立的性能评估，神经架构搜索(NAS)往往需要耗费大量的时间和计算资源。提出一种基于有偏采样的连续进化NAS方法(OEvNAS)。OEvNAS在架构搜索过程中维护一个超网络，搜索空间中所有的神经网络架构都是该超网络的子网络。在演化计算的每一代对超网络进行少量的训练，子网络直接继承超网络的权重进行性能评估而无需重新训练。为提高超网络的预测性能，提出一种基于有偏采样的超网络训练策略，以更大的概率训练表现优异的网络，在减少权重耦合的同时提高训练效率。此外，设计一种新颖的交叉变异策略来提高算法的全局探索能力。在NATS-Bench和可微分架构搜索（DARTS）两个搜索空间上验证OEvNAS的性能。实验结果表明，OEvNAS的性能超越了对比的主流算法。在NATS-Bench搜索空间上，提出的超网络训练策略在CIFAR-10、CIFAR-100和ImageNet16-200上均取得了优异的预测性能；在DARTS搜索空间上，搜索到的最优神经网络架构在CIFAR-10和CIFAR-100上分别取得了97.67%和83.79%的分类精度。

关键词: 神经架构搜索, 网络性能评估, 超网络, 有偏采样, 权重耦合

Abstract:

Neural Architecture Search(NAS) typically requires a considerable amount of time and computing resources due to the independent performance evaluation of each architecture it searches. To address this challenge, the continuous evolutionary NAS method based on biased sampling(OEvNAS) is proposed. This method involves the maintenance of a supernet during the architecture search, where all neural network architectures within the search space are subsets of this supernet. Throughout each evolutionary computation generation, the supernet is trained for a few epochs. Subsequently, the subnets inherit the supernet's weights for performance evaluation, eliminating the need for retraining. To enhance the supernet's prediction performance, a training strategy based on biased sampling is introduced. This strategy prioritizes training superior networks, thereby augmenting training efficiency and diminishing weight coupling. Additionally, an innovative crossover and mutation strategy is implemented to enhance global exploration capabilities. The effectiveness of OEvNAS is tested on two search spaces, NATS-Bench and Differentaible Architecture Search(DARTS). Results indicate that OEvNAS outperforms comparative leading algorithms. In the NATS-Bench search space, the new supernet training strategy demonstrates remarkable prediction accuracy on CIFAR-10, CIFAR-100 and ImageNet16-200. In the DARTS search space, the optimally searched neural network architecture exhibits classification accuracies of 97.67% and 83.79% on CIFAR-10 and CIFAR-100, respectively.

Key words: Neural Architecture Search(NAS), network performance evaluation, supernet, biased sampling, weight coupling

薛羽, 卢畅畅. 基于有偏采样的连续进化神经架构搜索[J]. 计算机工程, 2024, 50(2): 91-97.

YU XUE, Changchang LU. Continuous Evolutionary Neural Architecture Search Based on Biased Sampling[J]. Computer Engineering, 2024, 50(2): 91-97.

http://www.ecice06.com/CN/Y2024/V50/I2/91

图/表 7

图1 编码方式示意图

Fig.1 Schematic diagram of coding mode

图2 有偏采样

Fig.2 Biased sampling

图3 交叉操作

Fig.3 Cross operation

图4 变异操作

Fig.4 Variation operation

图5

$ \mathit{\alpha } $

对分类精度的影响

Fig.5 Influence of

$ \mathit{\alpha } $

on classification accuracy

参考文献 22

1	曹行健, 张志涛, 孙彦赞, 等. 面向智慧交通的图像处理与边缘计算. 中国图象图形学报, 2022, 27 (6): 1743- 1767. URL
	CAO X J, ZHANG Z T, SUN Y Z, et al. The review of image processing and edge computing for intelligent transportation system. Journal of Image and Graphics, 2022, 27 (6): 1743- 1767. URL
2	HESAMIAN M H, JIA W J, HE X J, et al. Deep learning techniques for medical image segmentation: achievements and challenges. Journal of Digital Imaging, 2019, 32 (4): 582- 596. doi: 10.1007/s10278-019-00227-x
3	宋菲菲, 隋栋, 周湘贞. 基于深度学习的智能学习资源推荐算法. 南京理工大学学报, 2022, 46 (2): 185- 191. doi: 10.13196/j.cims.2019.10.006
	SONG F F, SUI D, ZHOU X Z. Intelligence learning resource recommendation algorithm based on deep learning. Journal of Nanjing University of Science and Technology, 2022, 46 (2): 185- 191. doi: 10.13196/j.cims.2019.10.006
4	REAL E, AGGARWAL A, HUANG Y, et al. Aging evolution for image classifier architecture search[C]//Proceedings of AAAI Conference on Artificial Intelligence. Hawaii, USA: AAAI Press, 2019: 2.
5	ZOPH B, LE Q V. Neural architecture search with reinforcement learning[C]//Proceedings of International Conference on Learning Representations. Toulon, France: OpenReview. net, 2017: 24-26.
6	PHAM H, GUAN M, ZOPH B, et al. Efficient neural architecture search via parameters sharing[C]//Proceedings of International Conference on Machine Learning. Stockholm, Sweden: OpenReview. net, 2018: 4095-4104.
7	YOU S, HUANG T, YANG M M, et al. GreedyNAS: towards fast One-shot NAS with greedy supernet[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2020: 1996-2005.
8	LIU H X, SIMONYAN K, YANG Y M. Differentiable architecture search[C]//Proceedings of International Conference on Learning Representations. New Orleans, USA: OpenReview. net, 2019: 1-10.
9	缪斯, 祝永新. 针对图像盲去模糊的可微分神经网络架构搜索方法. 计算机工程, 2021, 47 (9): 313- 320. URL
	MIAO S, ZHU Y X. Differentiable neural architecture search method for blind image deblurring. Computer Engineering, 2021, 47 (9): 313- 320. URL
10	YANG Z H, WANG Y H, CHEN X H, et al. CARS: continuous evolution for efficient neural architecture search[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2020: 1826-1835.
11	DONG X Y, LIU L, MUSIAL K, et al. NATS-Bench: benchmarking NAS algorithms for architecture topology and size. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44 (7): 3634- 3646. doi: 10.1109/TPAMI.2021.3054824
12	DONG X Y, YANG Y. Searching for a robust neural architecture in four GPU hours[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2020: 1761-1770.
13	DONG X Y, YANG Y. One-shot neural architecture search via self-evaluated template network[C]//Proceedings of IEEE/CVF International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2020: 3680-3689.
14	HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2016: 770-778.
15	HUANG G, LIU Z, VAN DER MAATEN L, et al. Densely connected convolutional networks[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2017: 2261-2269.
16	ZOPH B, VASUDEVAN V, SHLENS J, et al. Learning transferable architectures for scalable image recognition[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D. C., USA: IEEE Press, 2018: 8697-8710.
17	CHU X X, ZHOU T B, ZHANG B, et al. Fair DARTS: eliminating unfair advantages in differentiable architecture search[C]//Proceedings of European Conference on Computer Vision. Berlin, Germany: Springer, 2020: 465-480.
18	CHEN X, XIE L X, WU J, et al. Progressive differentiable architecture search: bridging the depth gap between search and evaluation[C]//Proceedings of IEEE/CVF International Conference on Computer Vision. Washington D. C., USA: IEEE Press, 2020: 1294-1303.
19	LU Z, WHALEN I, BODDETI V, et al. NSGA-Net: a multi-objective genetic algorithm for neural architecture search[C]//Proceedings of the Genetic and Evolutionary Computation Conference. Nusle, Czech Republic: Association for Computing Machinery, 2019: 419-427.
20	ZHANG H Y, JIN Y C, CHENG R, et al. Efficient evolutionary search of attention convolutional networks via sampled training and node inheritance. IEEE Transactions on Evolutionary Computation, 2021, 25 (2): 371- 385. doi: 10.1109/TEVC.2020.3040272
21	ZHANG H Y, JIN Y C, HAO K R. Evolutionary search for complete neural network architectures with partial weight sharing. IEEE Transactions on Evolutionary Computation, 2022, 26 (5): 1072- 1086. doi: 10.1109/TEVC.2022.3140855
22	SINHA N, CHEN K W. Evolving neural architecture using one shot model[C]//Proceedings of the Genetic and Evolutionary Computation Conference. New York, USA: ACM Press, 2021: 910-918.

[1]	白俊卿, 韩柏迅, 张丰侠. 基于深度学习的无人机图像语义分割算法研究[J]. 计算机工程, 2023, 49(4): 233-239.
[2]	刘贞国, 朱宇, 王晓英, 黄建强, 曹腾飞. 基于集合约束的异质超网络表示学习[J]. 计算机工程, 2023, 49(11): 115-122.
[3]	尚迪雅, 孙华, 洪振厚, 曾庆亮. 基于无梯度进化的神经架构搜索算法研究综述[J]. 计算机工程, 2020, 46(9): 16-26.
[4]	孟磊, 冶忠林, 赵海兴, 杨燕琳. 超网络模型构建中优先连接方法研究[J]. 计算机工程, 2020, 46(10): 103-111.

选择文件类型/文献管理软件名称

选择包含的内容