Research Engineer, AI Safety & Alignment

Reposted 15 Days Ago
Be an Early Applicant
Redwood City, CA
In-Office
225K-400K Annually
Expert/Leader
Artificial Intelligence • Software • Conversational AI • Generative AI
The Role
As a Research Engineer, you will tackle AI safety challenges, develop evaluation methodologies, conduct adversarial testing, and work on model alignment and interpretability. You'll bridge theoretical and practical applications of AI safety research and collaborate with product teams to implement safety solutions.
Summary Generated by Built In
About the role and team

Joining us as a Research Engineer, you'll be at the forefront of tackling one of the most critical challenges in AI today: safety and alignment. Your work will be pivotal in understanding and mitigating the risks of advanced AI, conducting foundational research to make our models safer, and solving the core technical problems of AI alignment—ensuring our models behave in accordance with human values and intentions.

The Safety team is dedicated to pioneering and implementing techniques that make our models more robust, honest, and harmless. As a Research Engineer, you will bridge the gap between theoretical research and practical application, writing high-quality code to test hypotheses and integrating successful safety solutions directly into our products. Your research will not only protect millions of users but also contribute to the broader scientific community's understanding of how to build safe, beneficial AI.

What you'll do
  • Develop and implement novel evaluation methodologies and metrics to assess the safety and alignment of large language models.

  • Research and develop cutting-edge techniques for model alignment, value learning, and interpretability.

  • Conduct adversarial testing to proactively uncover potential vulnerabilities and failure modes in our models.

  • Analyze and mitigate biases, toxicity, and other harmful behaviors in large language models through techniques like reinforcement learning from human feedback (RLHF) and fine-tuning.

  • Collaborate with engineering and product teams to translate safety research into practical, scalable solutions and best practices.

  • Stay abreast of the latest advancements in AI safety research and contribute to the academic community through publications and presentations.

Who you are
  • Hold a PhD (or equivalent experience) in a relevant field such as Computer Science, Machine Learning, or a related discipline.

  • Write clear and clean production-facing and training code

  • Experience working with GPUs (training, serving, debugging)

  • Experience with data pipelines and data infrastructure

  • Strong understanding of modern machine learning techniques, particularly transformers and reinforcement learning, with a focus on their safety implications.

  • Are passionate about the responsible development of AI and dedicated to solving complex safety challenges.

Nice to Have
  • Experience with product experimentation and A/B testing

  • Experience training large models in a distributed setting

  • Familiarity with ML deployment and orchestration (Kubernetes, Docker, cloud)

  • Experience with explainable AI (XAI) and interpretability techniques.

  • Have research in AI safety, alignment, ethics, or a related area.

  • Knowledge of the broader societal and ethical implications of AI, including policy and governance.

  • Publications in relevant academic journals or conferences in the field of machine learning

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Top Skills

Ai Safety
Data Pipelines
Docker
Explainable Ai
Gpus
Kubernetes
Machine Learning
Reinforcement Learning
Transformers
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
Menlo Park, California
30 Employees
Year Founded: 2021

What We Do

Creating revolutionary open-ended conversational applications through breakthrough research.

Similar Jobs

Deepgram Logo Deepgram

Director, Sales Development Representatives

Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
In-Office
San Francisco, CA, USA
150 Employees
150K-175K Annually

System1 Logo System1

Software Engineer

AdTech • Big Data • Digital Media • Marketing Tech
Easy Apply
Hybrid
2 Locations
300 Employees
133K-200K Annually

BlackRock Logo BlackRock

Quantitative Systems Researcher, Associate / VP

Fintech • Information Technology • Financial Services
In-Office
San Francisco, CA, USA
25000 Employees
145K-215K Annually

BlackRock Logo BlackRock

Managing Director, Global Head of ETF Servicing

Fintech • Information Technology • Financial Services
In-Office
3 Locations
25000 Employees
225K-350K Annually

Similar Companies Hiring

PRIMA Thumbnail
Travel • Software • Marketing Tech • Hospitality • eCommerce
US
15 Employees
Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Idler Thumbnail
Artificial Intelligence
San Francisco, California
6 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account