Lead SageMaker Platform Engineer

Posted Yesterday
2 Locations
In-Office or Remote
73K-171K Annually
Senior level
Information Technology
The Role
Lead the build and production rollout of SageMaker Pipelines: pair with data scientists to debug pipelines, triage failures via AWS logs/telemetry, resolve multi-account IAM and cross-account artifact sync issues, improve CI/CD (Terraform, GitHub Actions), and raise the team's operational knowledge of the platform.
Summary Generated by Built In

 

Job Overview:

Our team is undergoing a large data + ML migration onto AWS SageMaker Pipelines. We deploy via Terraform and GitHub Actions across multiple AWS accounts aligned to our SDLC, sync model artifacts to a shared-services account, and validate models in dedicated testing accounts. Data is sourced primarily from Redshift, including trusted identity propagation.

We’re standing these pipelines up for the first time, and we need an expert who can help us debug and ship them to production quickly and reliably.

 

Responsibilities
  • Pair with our data scientists in live debugging sessions to diagnose and fix broken SageMaker pipelines and get them through the SDLC to prod.

  • Rapidly triage failures using AWS logs and telemetry (CloudWatch, CloudTrail, SageMaker pipeline/execution logs, etc.) and pinpoint root causes.

  • Untangle permissions issues across pipeline execution roles, cross-account access, and CI/CD identity (GitHub Actions OIDC, Terraform-managed IAM).

  • Help debug cross-account model artifact syncing (shared services) and the testing-account validation flow.

  • Level up the team’s mental model for how the platform works and where to look when things break.


 

Qualifications
  • Expert-level AWS operational experience, especially debugging via logs and telemetry (CloudWatch Logs/Metrics, CloudTrail, X-Ray or equivalent) — can move from a vague failure to a root cause fast.

  • Deep IAM / permissions expertise in a multi-account setup: execution roles, assume-role/cross-account access, resource policies, KMS/encryption permissions, and reasoning about “who is allowed to do what, as which principal.”

  • Hands-on SageMaker experience, including SageMaker Studio and SageMaker Pipelines — knows how pipelines are defined, deployed, and executed, and where to look when a step fails. (Operating/debugging, not modeling.)

  • Multi-account AWS experience aligned to an SDLC (dev/test/prod), including cross-account resource sharing and promotion patterns.

  • Comfortable working embedded and hands-on: live pairing, screen-sharing, and debugging under time pressure.

  • Strong communicator who can explain why something broke and how to avoid it next time.

 

Nice to Haves:

  • Terraform experience, especially managing IAM and SageMaker/data infrastructure as code.

  • GitHub Actions CI/CD experience, particularly OIDC-based authentication to AWS (no long-lived keys) and the IAM trust policies behind it.

  • Experience with Amazon Redshift, and ideally trusted identity propagation / IAM Identity Center integration.

  • Some ML/MLOps background — enough to speak the language of model training, artifacts, and deployment (helpful, not required).

  • AWS certifications (e.g., Solutions Architect Pro, DevOps Engineer Pro, ML Specialty) as a signal, though hands-on evidence matters more.

 

WHAT WE BELIEVE

 

At Perficient, we promise to challenge, champion, and celebrate our people. You will experience a unique and collaborative culture that values every voice. Join our team, and you’ll become part of something truly special. We believe in developing a workforce that is as diverse and inclusive as the clients we work with. We’re committed to actively listening, learning, and acting to further advance our organization, our communities, and our future leaders… and we’re not done yet. Perficient, Inc. proudly provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, national origin, age, disability, genetic information, marital status, amnesty, or status as a protected veteran in accordance with applicable federal, state and local laws. Perficient, Inc. complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. Perficient, Inc. expressly prohibits any form of unlawful employee harassment based on race, color, religion, gender, sexual orientation, national origin, age, genetic information, disability, or covered veterans. Improper interference with the ability of Perficient, Inc. employees to perform their expected job duties is absolutely not tolerated. Disability Accommodations: Perficient is committed to providing a barrier-free employment process with reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or accommodation due to a disability, please contact us.

 

The salary range for this position takes into consideration a variety of factors, including but not limited to skill sets, level of experience, applicable office location, training, licensure and certifications, and other business and organizational needs. The new hire salary range displays the minimum and maximum salary targets for this position across all US locations, and the range has not been adjusted for any specific state differentials. It is not typical for a candidate to be hired at or near the top of the range for their role, and compensation decisions are dependent on the unique facts and circumstances regarding each candidate. A reasonable estimate of the current salary range for this position is $73,008 to $170,640. Please note that the salary range posted reflects the base salary only and does not include benefits or any potential variable compensation programs. Information regarding the benefits available for this position are in our benefits overview.

 

 

Disclaimer:  The above statements are not intended to be a complete statement of job content, rather to act as a guide to the essential functions performed by the employee assigned to this classification.  Management retains the discretion to add or change the duties of the position at any time. 

#LI-RS1

 

 

About UsPerficient is the global AI and technology consulting firm disrupting the traditional consulting model. Powered by our 7,000+ advisors, engineers, and designers, Perficient implements AI-first solutions that break conventions and deliver outcomes that matter. Proudly serving clients that represent the world’s most innovative brands, and in collaboration with our powerful technology partner ecosystem, we bring deep industry expertise and data-driven design to redefine how businesses run and succeed. Perficient is different. For real. Learn more at perficient.com.

Skills Required

  • Expert-level AWS operational experience (logs/telemetry: CloudWatch, CloudTrail, X-Ray)
  • Deep IAM and permissions expertise in multi-account setups (execution roles, assume-role, resource policies, KMS)
  • Hands-on SageMaker experience, including SageMaker Studio and SageMaker Pipelines (operating/debugging pipelines)
  • Multi-account AWS experience aligned to SDLC (dev/test/prod), cross-account resource sharing and promotion patterns
  • Comfortable working embedded and hands-on: live pairing, screen-sharing, debugging under time pressure
  • Strong communication skills to explain root causes and preventative measures
  • Terraform experience, especially managing IAM and SageMaker/data infrastructure as code
  • GitHub Actions CI/CD experience, particularly OIDC-based authentication to AWS and IAM trust policies
  • Experience with Amazon Redshift and trusted identity propagation / IAM Identity Center integration
  • Some ML/MLOps background to speak the language of model training, artifacts, and deployment
  • AWS certifications (Solutions Architect Pro, DevOps Pro, ML Specialty) as signals

Perficient Compensation & Benefits Highlights

The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about Perficient and has not been reviewed or approved by Perficient.

  • Retirement Support Retirement offerings are positioned as robust, including a 401(k) with company match, an Employee Stock Purchase Plan, and an option for after-tax contributions (mega backdoor Roth). Eligibility details are described as clear for core benefits, supporting confidence in plan access timing.
  • Parental & Family Support Parental benefits are described as structured, with paid maternity recovery time and paid parental leave for all new parents. Company-paid disability coverage is also highlighted, strengthening the overall family support posture.
  • Fair & Transparent Compensation Compensation is characterized as generally market-aligned for a portion of roles, with examples of pay being viewed as fair or decent in certain contexts (such as remote or region-specific situations). Variable pay potential appears stronger in some tracks, improving perceived competitiveness for those roles.

Perficient Insights

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Saint Louis, MO
3,295 Employees
Year Founded: 1997

What We Do

Perficient is a leading global digital consultancy. We imagine, create, engineer, and run digital transformation solutions that help our clients exceed customers’ expectations, outpace competition, and grow their business. With unparalleled strategy, creative, and technology capabilities, we bring big thinking and innovative ideas, along with a practical approach to help the world’s largest enterprises and biggest brands succeed.

Similar Jobs

Micron Technology Logo Micron Technology

Development Engineer

Artificial Intelligence • Hardware • Information Technology • Machine Learning
In-Office or Remote
New York, NY, USA
45000 Employees
107K-182K Annually
In-Office or Remote
Chicago, IL, USA
1805 Employees
62K-111K Annually
In-Office or Remote
Chicago, IL, USA
1805 Employees
62K-150K Annually

Dynatrace Logo Dynatrace

Senior Devops Engineer

Artificial Intelligence • Big Data • Cloud • Information Technology • Software • Big Data Analytics • Automation
Remote or Hybrid
Grand Rapids, MI, USA
5600 Employees
127K-191K Annually

Similar Companies Hiring

Scrunch  Thumbnail
Artificial Intelligence • Information Technology • Marketing Tech • Software • SEO
Salt Lake City, Utah
Standard Template Labs Thumbnail
Artificial Intelligence • Information Technology • Software
New York, NY
25 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account