Cloud Support Engineer - Managed Cloud Services

Posted 17 Days Ago
Be an Early Applicant
San Jose, CA, USA
In-Office
137K-254K Annually
Mid level
Artificial Intelligence • Cloud • Hardware • Software • Semiconductor
The Role
The Cloud Support Engineer will provide technical support for cloud-based silicon design environments, focusing on customer satisfaction and efficient resolution of issues related to cloud infrastructure and HPC performance.
Summary Generated by Built In
At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology.

Job Summary

We are seeking a highly motivated candidate for the position of Cloud Support Engineer with a strong infrastructure background to support our secure, cloud‑based silicon chip design environments used by external customers for mission‑critical EDA, HPC, and containerized workloads. This role is customer‑facing and service‑oriented, requiring deep technical expertise across Linux, cloud infrastructure, and platform operations, along with a strong commitment to responsiveness, professionalism, and delivering an exceptional customer experience.

This role is well‑suited for engineers with hands‑on experience operating OpenStack and/or OpenShift platforms, along with traditional infrastructure components such as compute, storage, networking, and identity services. Success is measured not only by technical outcomes, but by customer satisfaction, trust, and confidence in the service.

This position involves working with export‑restricted data (ITAR/CUI) and supporting highly secure environments with stringent operational and compliance standards.

Key Responsibilities

Customer Support & Service Excellence

  • Serve as a primary technical support contact for external customers using secure cloud‑based silicon design and HPC platforms

  • Deliver timely, responsive, and high‑quality support, ensuring customer issues are acknowledged, communicated, and resolved effectively

  • Proactively minimize downtime, anticipate customer needs, and resolve issues before they impact workloads

  • Clearly communicate complex technical issues, status updates, and resolutions to customers with varying levels of expertise

  • Build long‑term customer trust through professionalism, ownership, and consistent follow‑through

Platform, Infrastructure & Environment Support

  • Support and troubleshoot Linux‑based infrastructure and cloud environments, including compute, storage, networking, and identity components

  • Operate and support OpenStack‑based private or hybrid cloud platforms, including core services (Nova, Neutron, Cinder, Glance, Keystone, etc.)

  • Support OpenShift / Kubernetes platforms, including cluster operations, workload troubleshooting, networking, storage integration, and upgrades

  • Maintain availability, performance, and reliability of secure multi‑tenant environments

  • Perform system‑level diagnosis across infrastructure layers to identify root cause and remediation paths

  • Partner with internal platform and engineering teams to drive stability and performance improvements

HPC, Licensing & Performance Management

  • Monitor HPC cluster performance, job scheduling, throughput, and queue health

  • Identify and resolve HPC job performance issues, including scheduler configuration, resource contention, I/O bottlenecks, and memory constraints

  • Troubleshoot and resolve license availability, utilization, and checkout issues impacting customer workloads

  • Support distributed resource managers such as Slurm, LSF, SGE, or equivalent schedulers

Automation & Operational Efficiency

  • Design, develop, and maintain automation for recurring operational tasks, including:

    • Infrastructure and platform health monitoring

    • Capacity tracking and alerting

    • User provisioning and de‑provisioning

    • License usage monitoring

    • Detection of abnormal system, container, or job behavior

  • Use Python, shell scripting, Perl, or similar tools to reduce manual effort and improve mean time to resolution (MTTR)

  • Apply AI‑assisted or agentic automation where appropriate to improve operational efficiency and customer experience

Security, Compliance & Operations

  • Operate and support systems containing ITAR‑controlled and CUI data in compliance with regulatory and corporate requirements

  • Follow documented security, access control, auditing, and change management procedures

  • Participate in incident response, post‑incident root cause analysis, and corrective action planning

  • Create and maintain runbooks, knowledge base articles, and customer‑facing documentation

Required Qualifications

Technical Skills

  • Strong hands‑on experience with Linux system administration and troubleshooting

  • Broad infrastructure experience, including compute, storage, networking, and identity services

  • Experience operating and supporting OpenStack and/or OpenShift (Kubernetes) environments

  • Experience supporting HPC or large‑scale compute environments

  • Proficiency in Python, shell scripting, Perl, or similar automation‑focused languages

  • Experience with monitoring, logging, and alerting platforms

  • Familiarity with license management systems (e.g., FlexNet / FLEXlm or equivalent)

Customer Service & Professional Skills

  • Demonstrated ability to deliver excellent customer service in a technical support, SRE, or infrastructure operations role

  • Strong sense of ownership and urgency when addressing customer‑impacting issues

  • Ability to balance deep technical problem‑solving with clear, customer‑friendly communication

  • Highly organized and able to manage multiple concurrent customer issues

Security & Compliance

  • Ability to work with export‑restricted data (ITAR/CUI)

  • U.S. Person status or eligibility as required to support export‑controlled environments

Preferred Qualifications

  • Experience supporting EDA, semiconductor, or silicon design environments

  • Experience with cloud‑based or on‑prem HPC platforms (private, public, or hybrid)

  • Strong background in infrastructure operations, SRE, or platform engineering roles

  • Experience with configuration management or infrastructure‑as‑code tools (e.g., Ansible, Terraform)

  • Experience applying AI‑assisted automation in production operations or support contexts

Education

  • Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience

The annual salary range for California is $136,500 to $253,500. You may also be eligible to receive incentive compensation: bonus, equity, and benefits. Sales positions generally offer a competitive On Target Earnings (OTE) incentive compensation structure. Please note that the salary range is a guideline and compensation may vary based on factors such as qualifications, skill level, competencies and work location. Our benefits programs include: paid vacation and paid holidays, 401(k) plan with employer match, employee stock purchase plan, a variety of medical, dental and vision plan options, and more.

We’re doing work that matters. Help us solve what others can’t.

Skills Required

  • Strong hands-on experience with Linux system administration
  • Broad infrastructure experience, including compute, storage, networking, and identity services
  • Experience operating and supporting OpenStack and/or OpenShift environments
  • Experience supporting HPC or large-scale compute environments
  • Proficiency in Python, shell scripting, Perl, or similar automation-focused languages
  • Experience with monitoring, logging, and alerting platforms
  • Familiarity with license management systems (e.g., FlexNet / FLEXlm or equivalent)
  • U.S. Person status or eligibility as required to support export-controlled environments
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Jose, CA
8,216 Employees
Year Founded: 1988

What We Do

Cadence enables electronic systems and semiconductor companies to create the innovative end products that are transforming the way people live, work and play. Cadence® software, hardware and IP are used by customers to deliver products to market faster. The company's Intelligent System Design strategy helps customers develop differentiated products—from chips to boards to intelligent systems—in mobile, consumer, cloud, data center, automotive, aerospace, IoT, industrial and other market segments. Cadence is listed as one of Fortune Magazine's 100 Best Companies to Work For.

Similar Jobs

Scale AI Logo Scale AI

Staff Software Engineer

Artificial Intelligence • Big Data • Machine Learning
In-Office
2 Locations
523 Employees
252K-315K Annually

Doximity Logo Doximity

Marketing Manager

Healthtech • Information Technology • Mobile • Productivity • Software • Analytics • Telehealth
Easy Apply
In-Office or Remote
San Francisco, CA, USA
740 Employees

Doximity Logo Doximity

Regional Vice President, Pharma

Healthtech • Information Technology • Mobile • Productivity • Software • Analytics • Telehealth
Easy Apply
In-Office or Remote
2 Locations
740 Employees

Samsara Logo Samsara

Mid-market Account Executive

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
USA
4000 Employees
152K-190K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Kepler  Thumbnail
Fintech • Software
New York, New York
6 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account