Customer Support Engineer (GPU Cluster)

Reposted 21 Days Ago
Be an Early Applicant
San Francisco, CA, USA
In-Office
160K-230K Annually
Mid level
Artificial Intelligence • Information Technology
The Role
As a Customer Support Engineer, you will resolve technical challenges related to GPU clusters, collaborate with teams, and document processes to enhance customer satisfaction.
Summary Generated by Built In
About the role

As a Customer Support Engineer at a pioneering AI company, you'll be the first line of defense to support customers as they build out training, fine tuning, and inference solutions with Together AI. You'll dive deep into complex technical challenges, providing swift and effective solutions while serving as a product expert. As a part of the Customer Experience organization, you will collaborate closely with product and sales, driving continuous improvement of our offerings. This is an exciting opportunity for a deeply technical professional passionate about AI and customer success to make a significant impact in a fast-paced, innovative environment.

Responsibilities
  • Engage directly with customers to tackle and resolve complex technical challenges involving our cutting-edge Kubernetes GPU clusters; ensure swift and effective solutions every time.
  • Become a product expert in our GPU Cluster service, serving as the last line of technical defense before issues are escalated to Engineering and Product teams.
  • Collaborate seamlessly across Engineering, Research, and Product teams to address customer concerns; collaborate with senior leaders both internally and externally to ensure the highest levels of customer satisfaction.
  • Transform customer insights into action by identifying patterns in support cases and working with Engineering and Go-To-Market teams to drive Together’s roadmap (e.g., future models to support)
  • Maintain detailed documentation of system configurations, procedures, troubleshooting guides, and FAQs to facilitate knowledge sharing with team and customers.
  • Be flexible in providing support coverage during holidays, nights and weekends as required by business needs to ensure consistent and reliable service for our customers.
Requirements
  • 3+ years of experience in a customer-facing technical role with at least 1 year in a support function in AI or supporting a mission-critical API in SaaS
  • Strong technical background, with knowledge of AI, ML, GPU technologies and their integration into high-performance computing (HPC) environments.
  • Familiarity with infrastructure services (e.g., Kubernetes, SLURM), infrastructure as code solutions (e.g., Ansible) high-performance network fabrics, NFS-based storage management, container infrastructure, and scripting and programming languages.
  • Foundational understanding in the installation, configuration, administration, troubleshooting, and securing of compute clusters.
  • Complex technical problem solving and troubleshooting, with a proactive approach to issue resolution
  • Ability to work cross-functionally with teams such as Sales, Engineering, Support, Product and Research to drive customer success.
  • Strong sense of ownership and willingness to learn new skills to ensure both team and customer success.
  • Excellent communication and interpersonal skills, with the ability to explain complex technical concepts to non-technical stakeholders.
  • Ability to operate in dynamic environments, adept at managing multiple projects, and comfortable with frequent context switching and prioritization.
About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. 

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000-230,000K + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Please see our Privacy Policy at https://www.together.ai/privacy

Skills Required

  • 3+ years of experience in a customer-facing technical role
  • 1 year in a support function in AI or mission-critical API in SaaS
  • Knowledge of AI, ML, GPU technologies in HPC environments
  • Familiarity with Kubernetes, SLURM, Ansible and other infrastructure services
  • Foundational understanding of compute clusters installation and administration
  • Complex technical problem solving and troubleshooting
  • Ability to work cross-functionally with various teams
  • Excellent communication skills for explaining technical concepts
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
San Francisco, California
84 Employees
Year Founded: 2022

What We Do

Together AI is a research-driven artificial intelligence company. We contribute leading open-source research, models, and datasets to advance the frontier of AI. Our decentralized cloud services empower developers and researchers at organizations of all sizes to train, fine-tune, and deploy generative AI models. We believe open and transparent AI systems will drive innovation and create the best outcomes for society

Similar Jobs

Scale AI Logo Scale AI

Staff Software Engineer

Artificial Intelligence • Big Data • Machine Learning
In-Office
2 Locations
523 Employees
252K-315K Annually

Doximity Logo Doximity

Marketing Manager

Healthtech • Information Technology • Mobile • Productivity • Software • Analytics • Telehealth
Easy Apply
In-Office or Remote
San Francisco, CA, USA
740 Employees

Doximity Logo Doximity

Regional Vice President, Pharma

Healthtech • Information Technology • Mobile • Productivity • Software • Analytics • Telehealth
Easy Apply
In-Office or Remote
2 Locations
740 Employees

Samsara Logo Samsara

Operations Manager

Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Easy Apply
Remote or Hybrid
United States
4000 Employees
119K-200K Annually

Similar Companies Hiring

Bellagent Thumbnail
Artificial Intelligence • Machine Learning • Business Intelligence • Generative AI
Chicago, IL
20 Employees
Golden Pet Brands Thumbnail
Digital Media • eCommerce • Information Technology • Marketing Tech • Pet • Retail • Social Media
El Segundo, California
178 Employees
Onshore Thumbnail
Artificial Intelligence • Fintech • Software • Financial Services
New York, New York
60 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account