Investigation of Parallel and Hyperparameters Strategy on Performance of Image Classification Training

Yannan Cao; Weiran Shen

doi:10.2991/978-94-6463-300-9_105

<Previous Article In Volume

Next Article In Volume>

Investigation of Parallel and Hyperparameters Strategy on Performance of Image Classification Training

Authors

Yannan Cao¹^{, *}, Weiran Shen²

¹Software Engineering, Dalian University of Technology, Shenyang, 116081, China

²Software Engineering, Beijing University of Technology, Beijing, 100124, China

^*Corresponding author. Email: natascha1203@mail.dlut.edu.cn

Corresponding Author

Yannan Cao

Available Online 27 November 2023.

DOI: 10.2991/978-94-6463-300-9_105 How to use a DOI?
Keywords: Convolutional Neural Networks; Image classification; GPU utilization; Data parallel
Abstract: Convolutional Neural Networks (CNNs) have witnessed widespread adoption in the domain of image classification, while deep neural networks have been developed to tackle more intricate tasks. In the experimental investigation, a remarkable downward trend in GPU utilization was observed as the batch size of the LeNet model was increased, regardless of the parallel or non-parallel mechanism employed. The research findings establish that this phenomenon can be ascribed to a constraint in data loading speed, which in turn diminishes the efficiency of training when dealing with larger batch sizes, ultimately resulting in reduced GPU utilization. To mitigate this issue, the data loading thread are enhanced by adjusting the "num_worker" parameter in the dataloader, thereby investigating its impact on GPU utilization. Moreover, a series of comprehensive experiments are conducted to ascertain the appropriate learning rates required for maintaining satisfactory classification accuracy when utilizing large batch sizes. This paper contributes to the field in two primary ways. Firstly, it identifies the cause of decreased GPU utilization when the batch size is increased and proposes a solution to enhance efficiency. Secondly, it verifies the adjustment of learning rates when adopting large batch sizes to achieve comparable loss curves and classification accuracies.
Copyright: © 2023 The Author(s)
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 2023 International Conference on Image, Algorithms and Artificial Intelligence (ICIAAI 2023)
Series: Advances in Computer Science Research
Publication Date: 27 November 2023
ISBN: 978-94-6463-300-9
ISSN: 2352-538X
DOI: 10.2991/978-94-6463-300-9_105 How to use a DOI?
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

ris enw bib

TY  - CONF
AU  - Yannan Cao
AU  - Weiran Shen
PY  - 2023
DA  - 2023/11/27
TI  - Investigation of Parallel and Hyperparameters Strategy on Performance of Image Classification Training
BT  - Proceedings of the 2023 International Conference on Image, Algorithms and Artificial Intelligence (ICIAAI 2023)
PB  - Atlantis Press
SP  - 1046
EP  - 1061
SN  - 2352-538X
UR  - https://doi.org/10.2991/978-94-6463-300-9_105
DO  - 10.2991/978-94-6463-300-9_105
ID  - Cao2023
ER  -

download .riscopy to clipboard