Moacir A. Ponti

SIBGRAPI 2017 Tutorial

Everything you wanted to know about Deep Learning for Computer Vision but were afraid to ask

(An overview of Deep Learning methods for Computer Vision with applications)

Authors: Moacir A. Ponti, Leonardo S. F. Ribeiro, Tiago S. Nazare, Tu Bui, John Collomosse

ICMC - Universidade de São Paulo; CVSSP - University of Surrey



Deep Learning methods are currently the state-of-the-art in many Computer Vision and Image Processing problems, in particular image classification. After years of intensive investigation, a few models matured and became important tools, including Convolutional Neural Networks (CNNs), Siamese and Triplet Networks, Auto-Encoders (AEs) and Generative Adversarial Networks (GANs). The field is fast-paced and there is a lot of terminologies to catch up for those who want to adventure in Deep Learning waters. This paper has the objective to introduce the most fundamental concepts of Deep Learning for Computer Vision in particular CNNs, AEs and GANs, including architectures, inner workings and optimization. We offer an updated description of the theoretical and practical knowledge of working with those models. After that, we describe Siamese and Triplet Networks, not often covered in tutorial papers, as well as review the literature on recent and exciting topics such as visual stylization, pixel-wise prediction and video processing. Finally, we discuss the limitations of Deep Learning for Computer Vision.


Source code


  • Tutorial Survey Paper: M.Ponti; L.Ribeiro; T.Nazare; T.Bui; J.Collomosse. Everything you wanted to know about Deep Learning for Computer Vision but were afraid to ask. PDF Preprint, 2017.

  • Journal Paper: T.Bui; L.Ribeiro; M.Ponti; J.Collomosse. Compact descriptors for sketch-based image retrieval using a triplet loss convolutional neural network. Computer Vision and Image Understanding PDF Preprint, Paper web page with code and data, 2017 .

  • Conference Paper: T.Nazare; G.Costa; W.Contato; M.Ponti. Deep Neural Networks and Noisy Images. CIARP. PDF Preprint, 2017 .

  • Submitted Paper: R.F.Mello; M.Dais; M.Ponti. Providing theoretical learning guarantees to Deep Learning Networks. arXiv preprint arXiv:1711.10292. PDF Preprint, 2017 .

  • Book Chapter (in Portuguese) M.Ponti; G.Costa. Como Funciona o Deep Learning. Tópicos em Gerenciamento de Dados e Informações. PDF Book Chapter, 2017 .

Last update October, 2017