Director, AI Alignment and Interpretability (Remote)

Posted Yesterday
Be an Early Applicant
Hiring Remotely in USA
Remote or Hybrid
195K-290K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Define your future at CrowdStrike.
The Role
Lead and conduct mechanistic interpretability and alignment research for security-specialized AI. Develop methods to read model internals, detect misuse signals, design training interventions and evaluation frameworks, publish original research, and recruit and mentor a lean research team.
Summary Generated by Built In

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role:

Security-domain AI creates alignment and interpretability challenges without good answers in the existing literature. A model trained on offensive techniques, vulnerability research, and proprietary threat telemetry develops internal representations that matter in ways general-purpose models do not. Understanding what that model knows, how it represents threat concepts, and where its behavior could diverge from intent is the research this role exists to do. Most of it hasn't been figured out yet.

In this role, you'll lead alignment and interpretability research for CrowdStrike's security-domain AI systems. You'll build methods for reading model internals: identifying features and representations tied to offensive security concepts, detecting misuse signal, and closing the gap between what a model is trained to do and what it actually does. You'll translate those findings into training interventions, behavioral constraints, and evaluation protocols that give the team real confidence in how these models behave. This is hands-on research leadership.

The team is lean and the problem space is novel. The right candidate has deep grounding in mechanistic interpretability or a closely related field, clear instincts about what questions matter in a security context, and the ability to advance the state of the art in a space the field is still forming.

What You'll Do:

  • Own the alignment and interpretability research agenda for security-domain AI. Set priorities, personally lead the hardest open problems, and develop methods that explain model behavior mechanistically: not just what models do, but why, and what that implies at the edges of their training distribution.

  • Build and apply techniques for detecting offensive-misuse signal in model internals, including probing for latent representations of vulnerability knowledge, circuit analysis to understand how security-relevant capabilities are encoded, and activation analysis to surface risk that behavioral testing alone would miss. Work closely with the adversarial evaluation team to close the loop between what they find in testing and what you find in the weights.

  • Develop alignment methodology for security-domain AI and own the evaluation framework that makes it measurable. This includes behavioral constraints, training interventions grounded in interpretability findings, deployment guardrails, and the benchmarks and tests that give the team confidence that models operate within intended bounds as a demonstrated property, not an assertion.

  • Contribute original research through publications and external engagement. Interpretability for security-specialized models is understudied. Publishing this work is part of the job.

  • Recruit, develop, and retain a lean team of research scientists. Set a technical bar through your own contributions, not just your expectations.

What You'll Need:

  • MS or PhD in machine learning, computer science, or a related field, with research depth in interpretability, AI alignment, or a closely adjacent area.

  • 8+ years in ML research or engineering, with direct experience doing interpretability or alignment research on large language models.

  • Hands-on expertise with mechanistic interpretability methods (probing classifiers, circuit analysis, activation patching, causal tracing, feature visualization) applied to real models. You've done this work, not just reviewed it.

  • Experience designing and running alignment evaluations: behavioral testing, capability elicitation, red-lining, or similar methodologies rigorous enough to support meaningful safety claims.

  • Track record of leading and growing researchers while remaining an active technical contributor yourself.

Ways to Stand Out:

  • Background in offensive security, vulnerability research, or adversarial ML, with enough depth to recognize what you find in model internals and reason about misuse potential.

  • Published research in mechanistic interpretability, AI alignment, or AI safety.

  • Experience applying interpretability methods to domain-specialized or fine-tuned models, not only general-purpose foundation models.

  • Familiarity with alignment challenges specific to models with dual-use capability: systems that understand and can reason about offensive techniques, and what that means for responsible deployment.

  • History of working closely with adversarial evaluation or red teams, using behavioral findings to motivate internal analysis and vice versa.

#LI-JF1

#LI-Remote

Benefits of Working at CrowdStrike:

  • Market leader in compensation and equity awards

  • Comprehensive physical and mental wellness programs 

  • Competitive vacation and holidays for recharge  

  • Paid parental and adoption leaves

  • Professional development opportunities for all employees regardless of level or role

  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections

  • Vibrant office culture with world class amenities

  • Great Place to Work Certified™ across the globe

CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at [email protected] for further assistance.

Find out more about your rights as an applicant.

CrowdStrike participates in the E-Verify program.

Notice of E-Verify Participation

Right to Work

CrowdStrike, Inc. is committed to fair and equitable compensation practices. Placement within the pay range is dependent on a variety of factors including, but not limited to, relevant work experience, skills, certifications, job level, supervisory status, and location. The base salary range for this position for all U.S. candidates is $195,000 - $290,000 per year, with eligibility for bonuses, equity grants and a comprehensive benefits package that includes health insurance, 401k and paid time off.

For detailed information about the U.S. benefits package, please click here

Expected Close Date of Job Posting is:08-11-2026

Skills Required

  • MS or PhD in machine learning, computer science, or a related field with research depth in interpretability or AI alignment
  • 8+ years in ML research or engineering with direct experience doing interpretability or alignment research on large language models
  • Hands-on expertise with mechanistic interpretability methods (probing classifiers, circuit analysis, activation patching, causal tracing, feature visualization) applied to real models
  • Experience designing and running alignment evaluations (behavioral testing, capability elicitation, red-lining, or similar rigorous methodologies)
  • Proven track record of leading and growing research teams while remaining an active technical contributor
  • Background in offensive security, vulnerability research, or adversarial ML
  • Published research in mechanistic interpretability, AI alignment, or AI safety
  • Experience applying interpretability methods to domain-specialized or fine-tuned models
  • Familiarity with alignment challenges for dual-use capability models and working with adversarial evaluation or red teams

What the Team is Saying

Andrew C.
Lauren P.
Brian P.
Alexa Z.
Theo K.
Sara I.
Lam N.
Lauren B.
Adeeb C.
Kristan C.
Alena C.
Thaddeus M.
Alyssa J.
KT T.

CrowdStrike Compensation & Benefits Highlights

  • Equity Value & Accessibility Equity is emphasized through RSUs and an ESPP with a lookback discount. Feedback suggests these stock programs are considered meaningful parts of total compensation.
  • Healthcare Strength Health coverage encompasses medical, dental, vision, mental‑health resources, and FSAs/HSAs. Feedback suggests these offerings are positioned as comprehensive across official materials and benefit listings.
  • Leave & Time Off Breadth Time off includes generous or “unlimited” PTO, paid holidays, volunteer time, and “Birthday PTO.” Feedback suggests these policies are presented as standard parts of the package.

CrowdStrike Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Austin, TX
10,000 Employees
Year Founded: 2011

What We Do

CrowdStrike has redefined security with the world’s most advanced cloud-native platform that protects and enables the people, processes and technologies that drive modern enterprise. Tested and proven, the world's largest organizations trust CrowdStrike to stop breaches with unparalleled protection against the most sophisticated cyberattacks. The CrowdStrike culture has been built upon our Core Values since the day we began. We are Fanatical About the Customer, Relentlessly Focused on Innovation and believe that our Limitless Passion drives Unlimited Potential for every CrowdStriker. As a purpose-built remote-first company, we believe cultivating a connected culture for every employee, no matter where they are in the world, is a key ingredient in building a high-performing, diverse team. We don’t have a mission statement. We’re on a mission—to stop breaches. Ready to join a mission that matters?

Why Work With Us

We have a culture that celebrates achievement, encourages flexibility and innovation and thrives on teamwork. We all work towards a single mission: to stop breaches. This common goal drives a sense of community and connection among our people across the globe.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

CrowdStrike Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
HQAustin, TX
Osaka
Aarhus, DK
Arlington, VA
Barcelona, ES
Bengaluru, IN
Brussels, BE
Bucharest, RO
Cheltenham, GB
Copenhagen, DK
Dubai, Dubai
Irvine, CA
Kirkland, WA
Minneapolis, MN
Mumbai, IN
New Delhi, IN
Pune, IN
Reading, GB
Riyadh, SA
Saint Louis, MO
Singapore
Sunnyvale, CA
Sydney, Sydney
Tel Aviv-Yafo, IL
Tokyo, Japan
Learn more

Similar Jobs

CrowdStrike Logo CrowdStrike

Sales Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
New York, NY, USA
10000 Employees
130K-175K Annually

CrowdStrike Logo CrowdStrike

Senior Automation Engineer

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
140K-215K Annually

CrowdStrike Logo CrowdStrike

Technical Support

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
USA
10000 Employees
70K-110K Annually

CrowdStrike Logo CrowdStrike

Marketing Manager

Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Remote or Hybrid
CA, USA
10000 Employees
130K-200K Annually

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account