PhD Position F/ M PhD Position Computer Vision / Deep Learning: Video Generation
Contexte et atouts du poste
Inria, the French National Institute for computer science and applied mathematics, promotes “scientific excellence for technology transfer and society”. Graduates from the world’s top universities, Inria's 2,700 employees rise to the challenges of digital sciences. With its open, agile model, Inria is able to explore original approaches with its partners in industry and academia and provide an efficient response to the multidisciplinary and application challenges of the digital transformation. Inria is the source of many innovations that add value and create jobs.
Team
The STARS research team combines advanced theory with cutting edge practice focusing on cognitive vision systems.
Team web site :
Mission confiée
The Inria STARS team is seeking for a Ph.D. researcher with strong background in computer vision, deep learning and machine learning.
The candidate is expected to conduct research related to generative models, including the development of computer vision algorithms for image and video generation.
Principales activités
Context:
Generative models have witnessed increased interest from academia and industry, due to exceptional capacity in generating highly realistic images. Videos signify more complex data, due to the additional temporal dimension. While some research works showed early results in video generation, there are many open questions in the field.
The thesis firstly will investigate, how to design model architectures for video generation.
Learning 3D-aware models from 2D data has become a popular research topic in image generation. In this thesis, we will go one step further in this direction to explore novel view synthesis in video generation.
Finally, we will aim to design a universal model which is able to generate videos across categories. Most of current models focus on generating single category (e.g., faces, sky…). Currently, there is no models, which are able to generate complex multi-category videos (e.g. Kinetics-600). We plan to increase the complexity of video generative models and design a large-scale video generative model. The objective is to study whether big generative models are able to capture the distribution of complex video datasets and create semantic meaningful videos.
Compétences
Candidates must hold a Master degree or equivalent in Computer Science or a closely related discipline by the start date.
The candidate must be grounded in the basics of computer vision, have solid mathematical and programming skills.
Preferably in Python, OpenCV, deep learning framework Pytorch or Tensorflow.
The candidate must be committed to scientific research and strong publications.
Avantages
Rémunération
Gross Salary per month: 2100€ brut per month (year 1 & 2) and 2190€ brut per month (year 3)