Mathematical Analysis of Neural network - Theory and Computational Network
DOI:
https://doi.org/10.52783/jns.v14.3033Keywords:
Neural networks, mathematical analysis, optimization, computational framework, convergence, stabilityAbstract
Neural networks revolutionized artificial intelligence and machine learning because their mathematical basis directly influences their operational efficiency and achievement of desired results. The research examines neural networks through theoretical calculation methods to study mathematical expressions and optimization techniques and stability assessment algorithms. Records of numerical training methods with their respective convergence properties appear in the analysis. Well-designed neural network structures require stringent mathematical models because they produce efficient and resistant network architectures.
Downloads
References
R. M. Golden, Mathematical Methods for Neural Network Analysis and Design. MIT Press, 1996. Available: https://dl.acm.org/doi/10.5555/548086
W. E, C. Ma, S. Wojtowytsch, and L. Wu, "Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't," arXiv preprint arXiv:2009.10713, 2020. Available: https://arxiv.org/abs/2009.10713
S. Albeverio and B. Tirozzi, "An introduction to the mathematical theory of neural networks," in Fourth Granada Lectures in Computational Physics, P. L. Garrido and J. Marro, Eds. Springer, 1997, pp. 197–221. Available: https://doi.org/10.1007/BFb0105988
J. J. Hopfield, "Neural networks and physical systems with emergent collective computational abilities," Proceedings of the National Academy of Sciences, vol. 79, no. 8, pp. 2554–2558, 1982. Available: https://doi.org/10.1073/pnas.79.8.2554
D. Amit, Modeling Brain Function: The World of Attractor Neural Networks. Cambridge University Press, 1989. Available: https://doi.org/10.1017/CBO9780511623257
S. Haykin, Neural Networks: A Comprehensive Foundation, 2nd ed. Prentice Hall, 1998. Available: https://www.pearson.com/us/higher-education/program/Haykin-Neural-Networks-A-Comprehensive-Foundation-2nd-Edition/PGM332539.html
C. M. Bishop, Neural Networks for Pattern Recognition. Oxford University Press, 1995. Available: https://global.oup.com/academic/product/neural-networks-for-pattern-recognition-9780198538646
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324, 1998. Available: https://doi.org/10.1109/5.726791
G. Cybenko, "Approximation by superpositions of a sigmoidal function," Mathematics of Control, Signals and Systems, vol. 2, no. 4, pp. 303–314, 1989. Available: https://doi.org/10.1007/BF02551274
K. Hornik, M. Stinchcombe, and H. White, "Multilayer feedforward networks are universal approximators," Neural Networks, vol. 2, no. 5, pp. 359–366, 1989. Available: https://doi.org/10.1016/0893-6080(89)90020-8
Y. Bengio, P. Simard, and P. Frasconi, "Learning long-term dependencies with gradient descent is difficult," IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 157–166, 1994. Available: https://doi.org/10.1109/72.279181
S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997. Available: https://doi.org/10.1162/neco.1997.9.8.1735
D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," arXiv preprint arXiv:1412.6980, 2014. Available: https://arxiv.org/abs/1412.6980
S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift," in Proceedings of the 32nd International Conference on Machine Learning, 2015, pp. 448–456. Available: https://proceedings.mlr.press/v37/ioffe15.html
X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks," in Proceedings of the 13th International Conference on Artificial Intelligence and Statistics, 2010, pp. 249–256. Available: https://proceedings.mlr.press/v9/glorot10a.html
R. Pascanu, T. Mikolov, and Y. Bengio, "On the difficulty of training recurrent neural networks," in Proceedings of the 30th International Conference on Machine Learning, 2013, pp. 1310–1318. Available: https://proceedings.mlr.press/v28/pascanu13.html
M. Abadi et al., "TensorFlow: Large-scale machine learning on heterogeneous systems," 2015. Available: https://www.tensorflow.org/
A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Advances in Neural Information Processing Systems 25, 2012, pp. 1097–1105. Available: https://proceedings.neurips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html
J. Berner, P. Grohs, G. Kutyniok, and P. Petersen, "The Modern Mathematics of Deep Learning," arXiv preprint arXiv:2105.04026, 2021. Available: https://arxiv.org/abs/2105.04026
J. Sirignano and K. Spiliopoulos, "Mean Field Analysis of Neural Networks: A Law of Large Numbers," arXiv preprint arXiv:1805.01053, 2018. Available: https://arxiv.org/abs/1805.01053
M. Taheri, F. Xie, and J. Lederer, "Statistical Guarantees for Regularized Neural Networks," arXiv preprint arXiv:2006.00294, 2020. Available: https://arxiv.org/abs/2006.00294
Z. Lu, H. Pu, F. Wang, Z. Hu, and L. Wang, "The Expressive Power of Neural Networks: A View from the Width," in Advances in Neural Information Processing Systems 30 (NIPS 2017), 2017. Available: https://proceedings.neurips.cc/paper/2017/hash/34fbce0bb6b0b3c8ac2ed8bfc9674ff0-Abstract.html
A. Pinkus, "Approximation Theory of the MLP Model in Neural Networks," ActaNumerica, vol. 8, pp. 143–195, 1999. Available: https://doi.org/10.1017/S0962492900002919
L. Grigoryeva and J.-P. Ortega, "Echo State Networks are Universal," Neural Networks, vol. 108, pp. 279–285, 2018. Available: https://doi.org/10.1016/j.neunet.2018.08.025
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
Terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.