Research Engineer position to help advance the state of the art in multimodal AI, and bring its benefits to Google products used by billions of people worldwide.
Our team at Google DeepMind works on cutting-edge research to advance the foundational capabilities of multimodal AI systems. In addition to producing highly-cited research published at top academic venues, our innovations land in flagship models like Gemini, and in Google products used by people every day.
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
You will be part of the REMY (Research & dEvelopment in Multimodal technologY) team in the Media Understanding organization, at Google DeepMind. In this role, you will have the opportunity to push forward state-of-the-art research in multimodal AI representation models, in the context of recent advancements in multimodal foundation models generally. You'll be at the forefront of developing models that power Google products used by billions of people worldwide. Your work will directly impact how these products understand and interact with images, text and video. This is a unique opportunity to shape the future of multimodal AI and its applications in a dynamic and impactful environment.
We are a team of research/software engineers, research scientists, and machine learning experts, working together to enable superhuman understanding of the multimodal world.
You'll be developing the next SOTA models for multimodal understanding. Your work will include researching new modeling techniques, implementing research ideas, running experiments to evaluate improvements, and identifying new opportunities.
As a member of the Media Understanding team, you will be responsible for conducting fundamental and applied research in multimodal AI (computer vision, language understanding, machine learning, and related areas). Your job responsibilities will include:
In order to set you up for success as a Research Engineer at Google DeepMind, we look for the following skills and experience:
We are an applied research team that takes on challenging real-world problems and thrives on finding solutions in the presence of ambiguity. In order to set you up for success as a Research Engineer/Scientist at Google DeepMind, we look for the following skills and experience:
In addition, the following would be an advantage:
The US base salary range for this full-time position is between $141,000 - $202,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
Application deadline: October 13, 2025
Note: In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf. For more information on how we handle your data, please see our Applicant and Candidate Privacy Policy
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.