Semantic Pyramid for Image Generation (CVPR 2020), lecture by the author


Details
Google Research and Weizmann Institute of Science feature inversion model to generate image space representations from classification classes.
The model provides a unified versatile framework for various image generation and manipulation tasks, including: (a) generating images with a controllable extent of semantic similarity to a reference image, obtained by reconstructing images from different layers of a classification model; (b) generating realistic image samples from unnatural reference image such as line drawings; (c) semantically compositing different images, and (d) controlling the semantic content of an image by enforcing a new, modified class label.
The lecturer is the paper's author.
Lecture abstract:
We present a novel GAN-based model that utilizes the space of deep features learned by a pre-trained classification model. Inspired by classical image pyramid representations, we construct our model as a Semantic Generation Pyramid - a hierarchical framework which leverages the continuum of semantic information encapsulated in such deep features; this ranges from low level information contained in fine features to high level, semantic information contained in deeper features. More specifically, given a set of features extracted from a reference image, our model generates diverse image samples, each with matching features at each semantic level of the classification model. We demonstrate that our model results in a versatile and flexible framework that can be used in various classic and novel image generation tasks. These include: generating images with a controllable extent of semantic similarity to a reference image, and different manipulation tasks such as semantically-controlled in-painting and compositing; all achieved with the same model, with no further training.
https://arxiv.org/abs/2003.06221
Project website: https://semantic-pyramid.github.io/
Presenter BIO:
Assaf Shocher is a deep Learning and Computer Vision researcher, working in Google Research and Weizmann Institute of Science.
Linkedin: https://www.linkedin.com/in/assaf-shocher-271424b7
This is a technical deep learning talk, prior DL knowledge is advised.
** ** Please register through the zoom link right after your RSVP. We will send the links to the zoom event via email only to those who have registered through zoom. ** **
-------------------------
Find us at:
All lectures are uploaded to our Youtube channel ➜ https://www.youtube.com/channel/UCHObHaxTXKFyI_EI8HiQ5xw
Newsletter for updates about more events ➜ http://eepurl.com/gJ1t-D
Sub-reddit for discussions ➜ https://www.reddit.com/r/2D3DAI/
Discord server for, well, discord ➜ https://discord.gg/MZuWSjF
Blog ➜ https://2d3d.ai

Semantic Pyramid for Image Generation (CVPR 2020), lecture by the author