Mathematical Analysis of Neural network - Theory and Computational Network

Authors

  • J. Satish Kumar
  • B. Archana
  • V. Senthil Kumar

DOI:

https://doi.org/10.52783/jns.v14.3033

Keywords:

Neural networks, mathematical analysis, optimization, computational framework, convergence, stability

Abstract

Neural networks revolutionized artificial intelligence and machine learning because their mathematical basis directly influences their operational efficiency and achievement of desired results. The research examines neural networks through theoretical calculation methods to study mathematical expressions and optimization techniques and stability assessment algorithms. Records of numerical training methods with their respective convergence properties appear in the analysis. Well-designed neural network structures require stringent mathematical models because they produce efficient and resistant network architectures.

Downloads

Download data is not yet available.

References

R. M. Golden, Mathematical Methods for Neural Network Analysis and Design. MIT Press, 1996. Available: https://dl.acm.org/doi/10.5555/548086

W. E, C. Ma, S. Wojtowytsch, and L. Wu, "Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't," arXiv preprint arXiv:2009.10713, 2020. Available: https://arxiv.org/abs/2009.10713

S. Albeverio and B. Tirozzi, "An introduction to the mathematical theory of neural networks," in Fourth Granada Lectures in Computational Physics, P. L. Garrido and J. Marro, Eds. Springer, 1997, pp. 197–221. Available: https://doi.org/10.1007/BFb0105988

J. J. Hopfield, "Neural networks and physical systems with emergent collective computational abilities," Proceedings of the National Academy of Sciences, vol. 79, no. 8, pp. 2554–2558, 1982. Available: https://doi.org/10.1073/pnas.79.8.2554

D. Amit, Modeling Brain Function: The World of Attractor Neural Networks. Cambridge University Press, 1989. Available: https://doi.org/10.1017/CBO9780511623257

S. Haykin, Neural Networks: A Comprehensive Foundation, 2nd ed. Prentice Hall, 1998. Available: https://www.pearson.com/us/higher-education/program/Haykin-Neural-Networks-A-Comprehensive-Foundation-2nd-Edition/PGM332539.html

C. M. Bishop, Neural Networks for Pattern Recognition. Oxford University Press, 1995. Available: https://global.oup.com/academic/product/neural-networks-for-pattern-recognition-9780198538646

Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998. Available: https://doi.org/10.1109/5.726791

G. Cybenko, "Approximation by superpositions of a sigmoidal function," Mathematics of Control, Signals and Systems, vol. 2, no. 4, pp. 303–314, 1989. Available: https://doi.org/10.1007/BF02551274

K. Hornik, M. Stinchcombe, and H. White, "Multilayer feedforward networks are universal approximators," Neural Networks, vol. 2, no. 5, pp. 359–366, 1989. Available: https://doi.org/10.1016/0893-6080(89)90020-8

Y. Bengio, P. Simard, and P. Frasconi, "Learning long-term dependencies with gradient descent is difficult," IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 157–166, 1994. Available: https://doi.org/10.1109/72.279181

S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997. Available: https://doi.org/10.1162/neco.1997.9.8.1735

D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," arXiv preprint arXiv:1412.6980, 2014. Available: https://arxiv.org/abs/1412.6980

S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," in Proceedings of the 32nd International Conference on Machine Learning, 2015, pp. 448–456. Available: https://proceedings.mlr.press/v37/ioffe15.html

X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks," in Proceedings of the 13th International Conference on Artificial Intelligence and Statistics, 2010, pp. 249–256. Available: https://proceedings.mlr.press/v9/glorot10a.html

R. Pascanu, T. Mikolov, and Y. Bengio, "On the difficulty of training recurrent neural networks," in Proceedings of the 30th International Conference on Machine Learning, 2013, pp. 1310–1318. Available: https://proceedings.mlr.press/v28/pascanu13.html

M. Abadi et al., "TensorFlow: Large-scale machine learning on heterogeneous systems," 2015. Available: https://www.tensorflow.org/

A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Advances in Neural Information Processing Systems 25, 2012, pp. 1097–1105. Available: https://proceedings.neurips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html

J. Berner, P. Grohs, G. Kutyniok, and P. Petersen, "The Modern Mathematics of Deep Learning," arXiv preprint arXiv:2105.04026, 2021. Available: https://arxiv.org/abs/2105.04026

J. Sirignano and K. Spiliopoulos, "Mean Field Analysis of Neural Networks: A Law of Large Numbers," arXiv preprint arXiv:1805.01053, 2018. Available: https://arxiv.org/abs/1805.01053

M. Taheri, F. Xie, and J. Lederer, "Statistical Guarantees for Regularized Neural Networks," arXiv preprint arXiv:2006.00294, 2020. Available: https://arxiv.org/abs/2006.00294

Z. Lu, H. Pu, F. Wang, Z. Hu, and L. Wang, "The Expressive Power of Neural Networks: A View from the Width," in Advances in Neural Information Processing Systems 30 (NIPS 2017), 2017. Available: https://proceedings.neurips.cc/paper/2017/hash/34fbce0bb6b0b3c8ac2ed8bfc9674ff0-Abstract.html

A. Pinkus, "Approximation Theory of the MLP Model in Neural Networks," ActaNumerica, vol. 8, pp. 143–195, 1999. Available: https://doi.org/10.1017/S0962492900002919

L. Grigoryeva and J.-P. Ortega, "Echo State Networks are Universal," Neural Networks, vol. 108, pp. 279–285, 2018. Available: https://doi.org/10.1016/j.neunet.2018.08.025

Downloads

Published

2025-04-04

How to Cite

1.
Kumar JS, B. Archana BA, Kumar VS. Mathematical Analysis of Neural network - Theory and Computational Network. J Neonatal Surg [Internet]. 2025 Apr. 4 [cited 2026 May 25];14(11S):607-14. Available from: https://jneonatalsurg.com/index.php/jns/article/view/3033