Research Scientist, Multimodal Generative AI (Image/Video) at Google DeepMind

Position Research Scientist, Multimodal Generative AI (Image/Video)
Posted 2025 July 09
Expired 2025 August 08
Company Google DeepMind
Location New York City, NY | US
Job Type Full Time
Affiliate Banner

Job Description:

Latest job information from Google DeepMind for the position of Research Scientist, Multimodal Generative AI (Image/Video). If the Research Scientist, Multimodal Generative AI (Image/Video) vacancy in New York City, NY matches your qualifications, please submit your latest application or CV directly through the updated Jobkos job portal.

Please note that applying for a job may not always be easy, as new candidates must meet certain qualifications and requirements set by the company. We hope the career opportunity at Google DeepMind for the position of Research Scientist, Multimodal Generative AI (Image/Video) below matches your qualifications.

Snapshot

The role of the Research Scientist will be to develop state-of-the-art methods for multimodal generative AI models, with a primary focus on image generation and editing

At Google DeepMind, we've built a unique culture and work environment where long-term ambitious research can flourish. Our special interdisciplinary team combines the best techniques from deep learning, reinforcement learning, and systems neuroscience to build general-purpose learning algorithms. We have already made a number of high-profile breakthroughs towards building artificial general intelligence, and we have all the ingredients in place to make further significant progress over the coming year!

About Us

Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we're a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

The Role

Research Scientists at Google DeepMind lead our efforts in developing novel tools, infrastructure, and algorithms towards the end goal of solving and building Artificial General Intelligence.

Having pioneered research in the world's leading academic and industrial labs, PhDs, post-docs, or professorships, Research Scientists join Google DeepMind to work collaboratively within and across Research fields. They are expected to drive independent research initiatives, work with teams on large scale AI, and develop solutions to fundamental questions in machine learning and AI.

Drawing on expertise from a variety of disciplines including deep learning, computer vision, language modeling, and advanced generative architectures, our Research Scientists are at the forefront of groundbreaking research.

Key responsibilities:
  • Design, rapidly implement, and rigorously evaluate cutting-edge deep learning algorithms and data curation for multimodal generative AI, with a particular emphasis on image synthesis.
  • Report and present research findings and developments clearly and efficiently both internally and externally, verbally and in writing.
  • Suggest and engage in team collaborations to meet ambitious research goals, while also driving significant individual contributions.
  • Work in collaboration with our Ethics and Governance teams to ensure our advances in intelligence are developed ethically and provide broad benefits to humanity.
About You

In order to set you up for success as a Research Scientist at Google DeepMind,  we look for the following skills and experience:

  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, or equivalent practical experience.
  • Proven experience in deep learning research and development, particularly in generative AI and related to image synthesis. This includes diffusion models and autoregressive generative models. 
  • Exceptional engineering skills in Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems.
  • Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV).

In addition, the following would be an advantage: 

  • Demonstrated experience in multimodal generative modeling, especially combining large language models with visual generation (e.g., text-to-image/video systems, joint autoregressive and diffusion models).
  • A keen eye for visual aesthetics and detail, coupled with a passion for creating high-quality, visually compelling generative content.
  • A real passion for AI!

The US base salary range for this full-time position is between $188,000 - $262,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

 

Job Info:

  • Company: Google DeepMind
  • Position: Research Scientist, Multimodal Generative AI (Image/Video)
  • Work Location: New York City, NY
  • Country: US

How to Submit an Application:

After reading and understanding the criteria and minimum qualification requirements explained in the job information Research Scientist, Multimodal Generative AI (Image/Video) at the office New York City, NY above, immediately complete the job application files such as a job application letter, CV, photocopy of diploma, transcript, and other supplements as explained above. Submit via the Next Page link below.

Next Page »

Similar Job Vacancies

  Enterprise Customer Success ManagerNew di Perplexity AI

Posted: 2025 October 07
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $
Company: Perplexity AI
Location: New York City, NY

  Senior Editor, Commerce di Hearst

Posted: 2025 October 07
Overview (Why This Role?)Are you passionate about the intersection of content, commerce, and culture? Hearst Magazines is seeking a driven and creative Senior E
Company: Hearst
Location: New York City, NY

  Newsroom AI and Automation Engineer di Hearst

Posted: 2025 October 07
Hearst Television (HTV) is hiring a Newsroom AI and Automation Engineer. The ideal candidate will have a journalism background and be passionate about Generativ
Company: Hearst
Location: New York City, NY

  Temporary Freelance Food Editor, ThePioneerWoman.com di Hearst

Posted: 2025 October 07
ThePioneerWoman.com is looking for a freelance temporary food editor to join our food team. The senior food editor will work directly with the site's test kitch
Company: Hearst
Location: New York City, NY

  Director, Finance & Operations di Hearst

Posted: 2025 October 07
What You'll Do Financial Planning & Analysis •    Lead financial planning & analysis (FP&A) including budgeting, forecasting,
Company: Hearst
Location: New York City, NY