DL背后的理论

深度学习中保证局部最优解可以作为全局最优解近似解的合理性:

Dauphin, Y. et al. Identifying and attacking the saddle point problem in high- dimensional non-convex optimization. In Proc. Advances in Neural Information Processing Systems 27 2933–2941 (2014).

Choromanska, A., Henaff, M., Mathieu, M., Arous, G. B. & LeCun, Y. The loss surface of multilayer networks.

对深层网络来说,初始值的设置不合理会使得网络难以收敛:

K. He, X. Zhang, S. Ren, J. Sun, Delving deep into rectifiers: surpassing human- level performance on ImageNet classification, 2015, /arXiv:1502.01852S.

简单堆叠网络深度测试准确率会更差(degradation problem): R. K. Srivastava, K. Greff, and J. Schmidhuber. Highway networks. arXiv:1505.00387, 2015. Deep Residual Learning for Image Recognition arXiv:1512.03385v1

DL背后的理论

DL背后的理论

results matching ""

No results matching ""