Real-Time Emotion Detection using Hybrid CNN-BiLSTM Deep Learning Model
Keywords:
Emotion recognition, CNN, Bi-LSTM, Hybrid, FER, Intelligent systemAbstract
Emotion recognition from facial expressions is a critical task in affective computing and human-computer interaction, with applications spanning healthcare, education, surveillance, and entertainment. Traditional convolutional neural networks (CNNs) have shown promising results in extracting spatial features from facial images, but often lack the temporal sensitivity needed to capture nuanced emotional patterns. In this, the hybrid deep learning model is proposed that integrates CNN and Bidirectional Long Short-Term Memory (BiLSTM) layers for real-time facial emotion recognition. The CNN layers effectively extract hierarchical spatial features from the FER-2013 dataset—a benchmark dataset consisting of 48x48 grayscale facial images categorized into seven basic emotion classes. These extracted features are reshaped and passed to BiLSTM units that model temporal dependencies and contextual relevance within spatial encodings. Our method results in notable enhancements in performance over baseline CNN models, with enhanced recognition accuracy, particularly in distinguishing subtle emotions like fear and sadness. Experimental evaluation using confusion matrices and classification reports confirms the robustness of the hybrid architecture. The results suggest that the integration of BiLSTM with CNN offers a more expressive and context-aware solution for real-time emotion detection, paving the way for more adaptive and emotionally intelligent systems.
Downloads
References
Goodfellow, I., et al., "Challenges in representation learning: A report on three machine learning contests", Neural Networks, 2013.
2. Mollahosseini, A., Hasani, B., & Mahoor, M. H., "AffectNet: A database for facial expression, valence, and arousal computing in the wild", IEEE Transactions on Affective Computing, 2017.
3. Chollet, F., "Deep Learning with Python", Manning Publications, 2018.
4. FER-2013 Dataset: https://www.kaggle.com/datasets/msambare/fer2013
5. Mollahosseini, A., Hasani, B., & Mahoor, M. H. (2017). AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild. IEEE Transactions on Affective Computing, 10(1), 18–31. https://doi.org/10.1109/TAFFC.2017.2740923
6. Khorrami, P., Paine, T., & Huang, T. S. (2015). Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition? In Proceedings of the IEEE International Conference on Computer Vision Workshops (pp. 19–27).
https://doi.org/10.1109/ICCVW.2015.11
7. Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Processing Letters, 23(10), 1499–1503. https://doi.org/10.1109/LSP.2016.2603342
8. Li, X., Song, D., Zhang, P., et al. (2018). Hybrid Deep Neural Network for Automatic Facial Expression Recognition. Journal of Visual Communication and Image Representation,62,1–8. https://doi.org/10.1016/j.jvcir.2019.04.001
9. Goodfellow, I., Erhan, D., Luc Carrier, P., Courville, A., Mirza, M., Hamner,B.,&Bengio,Y.(2013), Challenges in Representation Learning: A Report on Three Machine Learning Contests. In Neural Information Processing (pp. 117–124). Springer. https://doi.org/10.1007/978-3-642-42051-1_16
10. Lakshmi, M., & Rajesh, R. (2020).Facial Emotion Recognition Using CNN and BiLSTM. International Journal of Advanced Computer Science and Applications (IJACSA),11(9),508–514. https://doi.org/10.14569/IJACSA.2020.0110964
11. S.Shivaprasad Dr.M Sadanandam “ Dialect Identification using modified features with Deep neural networks” Traitement du Signal, Vol. 38, No. 6, December, 2021, pp. 1793-1799,2021. https://doi.org/10.18280/ts.380622.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.