ai seminar - özlem aslan

  • University of Alberta Computing Sciences Centre 3-33 Edmonton Alberta

convex deep modeling

Abstract: "Training deep predictive models with latent hidden layers poses a hard computational problem: since the model parameters have to be trained jointly with inference over latent variables, the convexity of the training problem is usually destroyed. In this talk, we first present a novel reformulation of supervised training of a two-layer architecture by introducing a latent feature kernel, which allows a rich set of latent feature representations to be captured while still allowing useful convex formulations via semidefinite relaxation. To tackle the resulting computational problem, efficient training algorithms are developed to exploit the specific structure of the problem.

In practice, deeper models have been essential for obtaining state of the art results.Therefore we then show that the two-layer approach can be extended to handle an arbitrary number of latent layers. To achieve this extension, a novel layer loss is proposed that is jointly convex in the adjacent normalized latent feature kernels. An efficient algorithmic approach is then developed for this extended formulation yielding promising empirical results. These results demonstrate the first fully convex formulation of training a deep architecture with an arbitrary number of hidden layers."

