Home
/
Comprehensive
/
Research Scientist, AI Safety and Alignment
Research Scientist, AI Safety and Alignment-March 2024
San Francisco
Mar 18, 2025
ABOUT GOOGLE
Our mission is to organize the world’s information and make it universally accessible and useful.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Research Scientist, AI Safety and Alignment

  Office locations: Also open to Mountain View and London.

  At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

  Snapshot

  Our team is responsible for enabling AI systems to reliably work as intended, including identifying potential risks from current and future AI systems, and conducting technical research to mitigate them. As a Research Scientist, you will design, implement, and empirically validate approaches to alignment and risk mitigation, and integrate successful approaches into our best AI systems.

  About Us

  Conducting research into any transformative technology comes with the responsibility to build mechanisms for safe and reliable development and deployment at every step. Technical safety research at Google DeepMind investigates questions related to evaluations, reward learning, fairness, interpretability, robustness, and generalisation in machine learning systems. Proactive research in these areas is essential to the fulfilment of the long-term goal of Google DeepMind: to build safe and socially beneficial AI systems.

  Research Scientists work on the forefront of technical approaches to designing systems that reliably function as intended while discovering and mitigating risks, in close collaboration with other AI research groups within and outside of Google DeepMind.

  The Role

  Key responsibilities:

  Identify and investigate possible failure modes for foundation models, ranging from sociotechnical harms (e.g. fairness, misinformation) to misuse (e.g. weapons development, criminal activity) to loss of control (e.g. high-stakes failures, rogue AI).

  Develop and implement technical approaches to mitigate these risks, such as benchmarking and evaluations, dataset design, scalable oversight, interpretability, adversarial robustness, monitoring, and more, in coordination with the team’s broader technical agenda.

  Report and present research findings and developments to internal and external collaborators with effective written and verbal communication.

  Collaborate with other internal teams to ensure that Google DeepMind AI systems and products (e.g. Gemini) are informed by and adhere to the most advanced safety research and protocols.

  About You

  You have extensive research experience with deep learning and/or foundation models (for example, a PhD in machine learning).

  You are adept at generating ideas and designing experiments, and implementing these in Python with real AI systems.

  You are keen to address risks from foundation models, and have thought about how to do so. You plan for your research to impact production systems on a timescale between “immediately” and “a few years”.

  You are excited to work with strong contributors to make progress towards a shared ambitious goal. With strong, clear communication skills, you are confident engaging technical stakeholders to share research insights tailored to their background.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Color Matcher III
Seeks assistance on priorities for scheduling work, organizes work efficiently.Verbally communicates routine information to supervisor and others in platform.Is receptive and attentive to instruction
Panda Express In Person Interview Day - Atwater, CA - 02/03 (1768)
Panda Express Open House Interview Day! Thank you so much for your interest in joining our In Person Open House Interview Day. Saturday, February 3rd, 2024, 11AM - 4PM Now Hiring:Service Team, Kitche
Provider Operations, Workforce Management - Manager
Specialty/Competency: Operations Industry/Sector: Health Services Time Type: Full time Travel Requirements: Up to 60% A career within Operations Consulting services, will provide you with the opportu
Assistant Professor - Accountancy
Position Title: Assistant Professor - Accountancy   Location: Big Rapids (Main Campus)   Department: 34200 - Accountancy Finance & Info Systems   Advertised Salary: $115,000 - $125,000. Salary co
Oracle NetSuite - Account Executive - GB East - Mid-market
Job Description About Oracle NetSuite Do you want to advance your career with the world’s first cloud company? Since 1998, Oracle NetSuite has been on a mission to deliver an agile, unified applicati
Full-Time Store Associate
As a Store Associate, you’ll be responsible for merchandising and stocking product, cashiering, and cleaning to keep the store looking its best. You’ll enhance the customer shopping experience by wor
Registered Nurse RN Dialysis
Primary City/State: Mesa, Arizona Department Name: Renal Dialysis-Hosp Work Shift: Day Job Category: Nursing $* 15,000.00 Sign-on incentive; requires a 2 year commitment* Banner Desert Medical Center
UI Developer
About Us: Innovating to solve real-world problems Applied Insight enhances the ability of federal government customers to preserve national security, deliver justice and serve the public with advance
Freight Handler Part-Time
Starting Rate of Pay: $19.94 / hour Work Hours: 5:00pm - 9:00pm POSITION OVERVIEW: Transport freight across dock area to/from trailers for loading to trailers. ESSENTIAL JOB DUTIES/RESPONSIBILITIES:
Director of Category Non-Perishables
Starting Wage: Salary and (annual bonus) Calling all Foodies! Don’t romaine calm, Busch’s is HIRING! Do you love food, fun and people? Are you looking for growth, development and excellent wages? We
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved