Senior Software Engineer - Platform Reliability

Sorry, this job was removed at 01:58 a.m. (CST) on Saturday, Mar 16, 2024
Easy Apply
Remote
130K-175K Annually
3-5 Years Experience
Artificial Intelligence • Big Data • Computer Vision • Machine Learning • Analytics
Build, Deploy, and Maintain AI for an Unpredictable World
The Role

The Role

As a Software Engineer - Platform Reliability at Striveworks, you will play a crucial role in supporting our customers and their experience using the Chariot platform in complex, air-gapped and cloud computing environments for mission-critical National Security systems. As a member of the Customer Experience team, you’ll assist in the planning and technical delivery of installations and upgrades, and you’ll be there to support users when they have questions or run into problems using our platform. The customer’s experience–for both the platform’s users and the economic sponsors–will always be top-of-mind with every interaction. How you do your work will be as important as what you do to support our customers, and an attention to detail for developing best practices, smooth hand-offs, clear planning and communications will be paramount. 

You are right for this challenge if you value and possess technical expertise and you enjoy working with customers as much as you enjoy performing the work itself.

Day-to-day Responsibilities:

  • Oversight for the automation of infrastructure-as-code for standing up virtual machines and custom Kubernetes clusters in AWS, Azure, GCP, on-premises, or hybrid cloud environments
  • Triaging bugs reported by platform users and writing code to fix those issues in the code base 
  • Working with platform developers to define requirements and build solutions for customer use cases of the platform
  • Software deployments to on-prem unclassified, CUI, Secret, and Top Secret networks
  • Participate in on-call rotations and incident response to swiftly address and resolve critical system issues.

What you’ll own and do:

  • Drive the continuous improvement of our support process and customer experience by identifying areas for improvement and implementing changes.
  • Collaborate with the CX team to develop and maintain best technical support processes and procedures.
  • Work directly with customers and customer-facing teams to ensure well-planned and timely delivery of installations and upgrades. 
  • Work directly with customers and Striveworks’ Core Dev and Product teams to ensure customer issues are well documented, well understood, and appropriately championed for resolution.
  • Collaborate with the CX team to build a robust support metrics framework and provide regular reports to management on support metrics performance.

The Software Engineer - Platform Reliability on the Customer Experience (CX) team, leads a small team of Site Reliability Engineers who are directly engaged with the customer, predominantly with their networking and platform management teams, to sustain the Chariot platform in air-gapped computing environments. You will have a direct contribution to the success of mission-critical systems within National Security and Commercial clients. You will be expected to wear multiple hats, step into vacuums where more work is needed, and will be given the breadth to explore new technologies. You will work side-by-side daily with software engineers, data scientists and end users of our products, learning from them so that functional decisions become second nature to you.

The anticipated base pay range for this position is $180,000–$240,000/year. Striveworks’ total compensation package includes a competitive base salary, annual performance-based equity grants, and a lucrative yearly cash bonus. 

This position offers a fully remote work environment, but requires a willingness to travel to customer sites up to 50% of the time. Preferred locations of residency for this role are in Southern Pines, NC or Tampa, FL. Alternatively, you can work hybrid/onsite at our office in northwest Austin, TX.

The Right Fit

We spend a lot of time during our hiring process talking about shared values. 

Why? We passionately believe that fostering an environment where people can self-actualize and pursue greatness is the best way to achieve our individual and collective goals. 

What does this mean for you? We want to provide you with the conditions to thrive in an environment where you can achieve your goals, where you know the team shares your goals, and where you make and accept decisions for the team with humility. At Striveworks, we want your say/do ratio to be 1 and to know that being part of a top-tier team means that there is no smartest person in the room. If that makes sense, we are already on the same page. 

What we're looking for:

  • Top Secret U.S. security clearance
  • 8+ years experience as a Software Engineer, Site Reliability Engineer, or DevOps Engineer
  • 2+ years relevant experience in:
    • Developing for and/or deploying microservices in Kubernetes
    • Programming in Python and Golang
    • Writing and deploying Helm Charts
    • Deploying a web-based application to a DoD/IC air-gapped network
    • Automation and infrastructure-as-code (e.g. Terraform, Ansible)
    • Deploying infrastructure in a cloud such as AWS, Azure, GCP, or OpenStack
  • Understanding of networking concepts, security best practices, and disaster recovery strategies.
  • Excellent communication and collaboration skills to work effectively in a cross-functional team environment; Strong problem-solving skills and the ability to troubleshoot complex technical issues.

The Wish List

We are very interested in candidates who possess the above qualifications, and we appreciate and consider the addition of:

  • Deploying, maintaining, or contributing to CNCF projects
  • Deploying, managing, and/or supporting enterprise information systems in a DoD environment
  • U.S. federal information system security policies, including Security Technical Implementation Guides (STIGs), NIST 800-171, NIST 800-53, CMMC, ICD 503
  • DevSecOps, CI/CD pipelines, or automated security scanning
  • Administration and deployment of GPU-enabled servers
  • Storage technologies, NAS/SAN tools
  • Directly leading engineering initiatives and/or teams
  • Experience deploying infrastructure to AWS C2S, or similar
  • Service mesh
  • Blue-green and Canary deployments
  • Multi-cloud 

The Benefits

  • Top-of-market salary and total compensation
  • Generous equity plan
  • Health/vision/dental insurance
  • Flexible PTO
  • Parental leave

Build, Deploy, and Maintain AI for an Unpredictable World

AI is driving a new Industrial Revolution. But most AI tools only work when the world looks the same tomorrow as it did yesterday. That's rarely the case.
Striveworks was formed to fix this problem. Our platform lets teams build AI models, deploy them into unpredictable environments, and watch them deliver trustworthy results—day after day. Our approach has transformed AI outcomes for organizations where failure is never an option. As a result, Striveworks was recognized as an exemplar in the National Security Commission on Artificial Intelligence Final Report. 

In 2023, Striveworks placed on the Deloitte Technology Fast 500 as one of the most rapidly growing technology companies in North America. In 2024, Striveworks was honored with a Built In Best Places to Work award—for the third year running.

Striveworks is an Equal Opportunity Employer and does not discriminate in employment on the basis of race, color, religion, belief, sex (including pregnancy and gender identity or expression), national origin, social or ethnic origin, political affiliation, sexual orientation, marital status, disability, genetic information, age, membership in an employee organization, retaliation, parental status, military service, or other non-merit factors. Striveworks will not tolerate discrimination or harassment of any kind.

If you require assistance or a reasonable accommodation in the application process, please contact Operations at [email protected].

Striveworks is a participating employer in the E-Verify program.

What the Team is Saying

Andrew
Will
Eddy
George
The Company
HQ: Austin, TX
110 Employees
Hybrid Workplace
Year Founded: 2018

What We Do

AI is driving a new Industrial Revolution. But most AI tools only work when the world looks the same tomorrow as it did yesterday. That's rarely the case.

Striveworks was formed to fix this problem. Our platform lets teams build AI models, deploy them into unpredictable environments, and watch them deliver trustworthy results—day after day. Our approach has transformed AI outcomes for organizations where failure is never an option. As a result, Striveworks was recognized as an exemplar in the National Security Commission on Artificial Intelligence Final Report.

In 2023, Striveworks placed on the Deloitte Technology Fast 500 as one of the most rapidly growing technology companies in North America. In 2024, Striveworks was honored with a Built In Best Places to Work award—for the third year running.

Why Work With Us

Our people and products shape solutions that directly affect the geopolitical landscape. We are cross-functional talent working together on the cutting edge to provide mission critical solutions.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Striveworks Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
Company Office Image
HQAustin, TX
Company Office Image
Vass, NC
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account