24. července 2023
Speaker · 0 followers
Speaker · 0 followers
Deep neural networks have been demonstratedto achieve phenomenal success in many domains,and yet their inner mechanisms are not well understood. In this paper, we investigate the curvatureof image manifolds, i.e., the manifold deviationfrom being flat in its principal directions. We findthat state-of-the-art trained convolutional neuralnetworks for image classification have a characteristic curvature profile along layers: an initial steepincrease, followed by a long phase of a plateau,and followed by another increase. In contrast,this behavior does not appear in untrained networks in which the curvature flattens. We alsoshow that the curvature gap between the last twolayers has a strong correlation with the generalization capability of the network. Moreover, we findthat the intrinsic dimension of latent codes is notnecessarily indicative of curvature. Finally, weobserve that common regularization methods suchas mixup yield flatter representations when compared to other methods. Our experiments showconsistent results over a variety of deep learningarchitectures and multiple data sets.Deep neural networks have been demonstratedto achieve phenomenal success in many domains,and yet their inner mechanisms are not well understood. In this paper, we investigate the curvatureof image manifolds, i.e., the manifold deviationfrom being flat in its principal directions. We findthat state-of-the-art trained convolutional neuralnetworks for image classification have a characteristic curvature profile along layers: an initial steepincrease, followed by a long phase of a plateau,and follow…
Profesionální natáčení a streamování po celém světě.
Prezentace na podobné téma, kategorii nebo přednášejícího
Biao Liu, …
Yuchao Cai, …
Yuchen Liu, …