Indtast nøgleord

Hurtige fakta

    • Technopole de Sophia Antipolis

Ansøgningsfrist: 2025-05-30

PhD Position F/ M PhD Position Computer Vision / Deep Learning: Video Generation

Udgivet 2025-03-31

Contexte et atouts du poste

Inria, the French National Institute for computer science and applied mathematics, promotes “scientific excellence for technology transfer and society”. Graduates from the world’s top universities, Inria's 2,700 employees rise to the challenges of digital sciences. With its open, agile model, Inria is able to explore original approaches with its partners in industry and academia and provide an efficient response to the multidisciplinary and application challenges of the digital transformation. Inria is the source of many innovations that add value and create jobs.

Team

The STARS research team combines advanced theory with cutting edge practice focusing on cognitive vision systems.

Team web site :

Mission confiée

The Inria STARS team is seeking for a Ph.D. researcher with strong background in computer vision, deep learning and machine learning.

The candidate is expected to conduct research related to generative models, including the development of computer vision algorithms for image and video generation.

Principales activités

Context:

Generative models have witnessed increased interest from academia and industry, due to exceptional capacity in generating highly realistic images. Videos signify more complex data, due to the additional temporal dimension. While some research works showed early results in video generation, there are many open questions in the field.

  • Model architecture
  • The thesis firstly will investigate, how to design model architectures for video generation.

  • 3D-aware generation
  • Learning 3D-aware models from 2D data has become a popular research topic in image generation. In this thesis, we will go one step further in this direction to explore novel view synthesis in video generation. 

  • Generalizability
  • Finally, we will aim to design a universal model which is able to generate videos across categories. Most of current models focus on generating single category (e.g., faces, sky…). Currently, there is no models, which are able to generate complex multi-category videos (e.g. Kinetics-600). We plan to increase the complexity of video generative models and design a large-scale video generative model. The objective is to study whether big generative models are able to capture the distribution of complex video datasets and create semantic meaningful videos.

    Compétences

    Candidates must hold a Master degree or equivalent in Computer Science or a closely related discipline by the start date.

    The candidate must be grounded in the basics of computer vision, have solid mathematical and programming skills. 

    Preferably in Python, OpenCV, deep learning framework Pytorch or Tensorflow.

    The candidate must be committed to scientific research and strong publications.

    Avantages

  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training
  • Contribution to mutual insurance (subject to conditions)
  • Rémunération

    Gross Salary per month: 2100€ brut per month (year 1 & 2) and 2190€ brut per month (year 3)

    Lignende job

    Udgivet: 2025-03-31
    • Tyne and Wear
    Udgivet: 2025-03-31
    • Birmingham
    Udgivet: 2025-03-31
    • Londonderry