Hand Gesture Recognition for human-robot interaction Based on BiLSTM-Boosted EfficientNet Network

المؤلفون

  • Hawraa Abdul-elah Kadhum المؤلف
  • Asmaa Mutar Jaber المؤلف

DOI:

https://doi.org/10.59992/IJSR.2026.v5n2p9

الكلمات المفتاحية:

BBE-Net، BiLSTM-Boosted EfficientNet، Hand Gesture Recognition، Human-robot Interaction

الملخص

This study introduces BBE-Net, a novel BiLSTM-Boosted EfficientNet architecture designed for highly accurate and robust static hand-gesture recognition. The proposed framework integrates EfficientNet as a deep feature extractor and Bidirectional LSTM (BiLSTM) as a spatiotemporal dependency modeler, enabling the network to capture both fine-grained spatial structures and contextual relationships within gesture images. A preprocessing pipeline—consisting of standardized image resizing and histogram equalization—enhances contrast and illumination invariance, producing clearer input representations for feature extraction. EfficientNet generates multi-scale, semantically rich feature maps, which are subsequently refined by BiLSTM layers to model long-range and bidirectional correlations. The resulting discriminative features are classified through an ensemble-learning module that employs Bagging with Decision Trees and majority voting to improve stability and reduce variance. Experiments conducted on the Sebastian Marcel Static Hand Posture Database demonstrate the effectiveness of the proposed method. With extensive augmentation, 10-fold cross-validation, and repeated trials, BBE-Net achieves an accuracy of 99.70%, outperforming several recent state-of-the-art approaches. Analyses using confusion matrices, ROC curves, and class-wise metrics confirm the method’s near-perfect discriminative capability.

السير الشخصية للمؤلفين

  • Hawraa Abdul-elah Kadhum

    Mechatronic Engineering Department, Al-Muthanna University, Al-Muthanna, Iraq 

  • Asmaa Mutar Jaber

    MSc in Construction Management Engineering, Civil Engineering Department,

    Al-Muthanna University, Samawah, Iraq

المراجع

[1] Sarowar, M. S., Farjana, N. E. J., Khan, M. A. I., Mutalib, M. A., Islam, S., & Islam, M. Hand Gesture Recognition Systems: A Review of Methods, Datasets, and Emerging Trends. International Journal of Computer Applications, 975, 8887.

[2] Mohamed, A. S., Hassan, N. F., & Jamil, A. S. (2024). Real-Time Hand Gesture Recognition: A Comprehensive Review of Techniques, Applications, and Challenges. Cybernetics and Information Technologies, 24(3), 163-181.

[3] Hashi, A. O., Hashim, S. Z. M., & Asamah, A. B. (2024). A systematic review of hand gesture recognition: An update from 2018 to 2024. IEEE Access, 12, 143599-143626.

[4] Murad, B. K., & Alasadi, A. H. H. (2024). Advancements and challenges in hand gesture recognition: A comprehensive review. Iraqi Journal for Electrical and Electronic Engineering, 20(2), 154-164.

[5] Hax, D. R. T., Penava, P., Krodel, S., Razova, L., & Buettner, R. (2024). A novel hybrid deep learning architecture for dynamic hand gesture recognition. IEEE Access, 12, 28761-28774.

[6] Rahman, M. M., Uzzaman, A., Khatun, F., Aktaruzzaman, M., & Siddique, N. (2025). A comparative study of advanced technologies and methods in hand gesture analysis and recognition systems. Expert Systems with Applications, 266, 125929.

[7] Kim, B., & Seo, S. (2023). EfficientNetV2-based dynamic gesture recognition using transformed scalogram from triaxial acceleration signal. Journal of Computational Design and Engineering, 10(4), 1694-1706.

[8] Hussain, A., Ul Amin, S., & Fayaz, M. (2023). An Efficient and Robust Hand Gesture Recognition System of Sign Language Employing Finetuned Inception-V3 and Efficientnet-B0 Network. Computer Systems Science & Engineering, 46(3).

[9] Rezaee, K., Khavari, S. F., Ansari, M., Zare, F., & Roknabadi, M. H. A. (2024). Hand gestures classification of sEMG signals based on BiLSTM-metaheuristic optimization and hybrid U-Net-MobileNetV2 encoder architecture. Scientific Reports, 14(1), 31257.

[10] Tchantchane, R., Zhou, H., Zhang, S., & Alici, G. (2023). A review of hand gesture recognition systems based on noninvasive wearable sensors. Advanced intelligent systems, 5(10), 2300207.

[11] Alam, M. M., Islam, M. T., & Rahman, S. M. (2022). Unified learning approach for egocentric hand gesture recognition and fingertip detection. Pattern recognition, 121, 108200.

[12] Jency Rubia, J., Babitha Lincy, R., & Sherin Shibi, C. (2024). Hybrid Convolution-Based Efficientnet-Based Hand Gesture Recognition Framework with Optimized Algorithm. International Journal of Pattern Recognition and Artificial Intelligence, 38(12), 2456008.

[13] Singh, R. P., & Singh, L. D. (2025). Dyhand: dynamic hand gesture recognition using BiLSTM and soft attention methods. The Visual Computer, 41(1), 41-51.

[14] Ilham, A. A., & Nurtanio, I. (2024). Applying LSTM and GRU methods to recognize and interpret hand gestures, poses, and face-based sign language in real time. Journal of Advanced Computational Intelligence and Intelligent Informatics, 28(2), 265-272.

[15] Xi, J., Zhang, W., Xu, Z., Zhu, S., Tang, L., & Zhao, L. (2025). Three-dimensional dynamic gesture recognition method based on convolutional neural network. High-Confidence Computing, 5(1), 100280.

[16] Karsh, B., Laskar, R. H., & Karsh, R. K. (2024). mIV3Net: modified inception V3 network for hand gesture recognition. Multimedia Tools and Applications, 83(4), 10587-10613.

[17] Al Mudawi, N., Ansar, H., Alazeb, A., Aljuaid, H., AlQahtani, Y., Algarni, A., ... & Liu, H. (2024). Innovative healthcare solutions: robust hand gesture recognition of daily life routines using 1D CNN. Frontiers in Bioengineering and Biotechnology, 12, 1401803.

[18] Miah, A. S. M., Hasan, M. A. M., & Shin, J. (2023). Dynamic hand gesture recognition using multi-branch attention based graph and general deep learning model. IEEE Access, 11, 4703-4716.

[19] López, L. I. B., Ferri, F. M., Zea, J., Caraguay, Á. L. V., & Benalcázar, M. E. (2024). CNN-LSTM and post-processing for EMG-based hand gesture recognition. Intelligent Systems with Applications, 22, 200352.

[20] Chanda, Bristy, and Hussain Nyeem. "Depth-Aware Spatiotemporal Fusion for Advancing Dynamic Hand Gesture Recognition." Available at SSRN 5011540.

[21] Biswas, Sougatamoy, et al. "Attention-enabled hybrid convolutional neural network for enhancing human–robot collaboration through hand gesture recognition." Computers and electrical engineering 123 (2025): 110020.

[22] Awaluddin, Baiti-Ahmad, Chun-Tang Chao, and Juing-Shian Chiou. "A hybrid image augmentation technique for user-and environment-independent hand gesture recognition based on deep learning." Mathematics 12.9 (2024): 1393.

[23] Idiap Research Institute. "Gesture Database." Idiap, https://www.idiap.ch/webarchives/sites/www.idiap.ch/resource/gestures. Accessed 24 Nov. 2025.

التنزيلات

منشور

2026-02-15

إصدار

القسم

Articles

كيفية الاقتباس

Hand Gesture Recognition for human-robot interaction Based on BiLSTM-Boosted EfficientNet Network. (2026). المجلة الدولية للبحوث العلمية, 5(2). https://doi.org/10.59992/IJSR.2026.v5n2p9