Engineer, Cloud Operations (12am - 12pm EST)

| USA +80 more | Remote
Sorry, this job was removed at 12:43 p.m. (CST) on Tuesday, May 14, 2024
Find out who’s hiring remotely Nationwide
See all Remote jobs Nationwide
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry’s fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at www.coreweave.com.

About the role:

***Please note that this is a night shift role, which operates from 12am - 12pm EST.***

The Cloud Operations Team is the heart of CoreWeave’s operational practice.  As a Cloud Operations Engineer, you’ll work as a team in a Virtual (remote) Operations Center responding to performance and availability issues across the CoreWeave cloud.  This role places you on the front line of operations, bridging the gap between Customer Support and Service Owning teams.  Working in shifts ensuring 24x7 coverage, you’ll develop proactive health monitoring, triage alerts and incidents serving in the commander role during Priority Incident events, and participate in ongoing analysis and reliability improvement practices.   

Collaborating across development and engineering, this role allows you to operate horizontally and vertically within the CoreWeave ecosystem to root out problems, initiate and coordinate responses, and drive lower MTTR and MTTD scores.

This role is ideal for an individual with broad technology and troubleshooting skills who has the desire to expand their knowledge in critical areas such as networking, storage, Kubernetes, automation, and observability.  Candidates in this role likely aspire to a career as an SRE, domain specific engineer or engineering management.

You will work with a team of 8 Operations Engineers and have the opportunity to work on the full gamut of rewarding challenges that come with operating the AI Cloud in a communicative, supportive, and high-performing environment. 

As a member of the Cloud Operations Team you have the opportunity to:

  • With a customer first mindset, proactively identify performance and availability issues in production 
  • Interpret operational and observability data to assess system performance and adherence to Service Level Objectives
  • Investigate, validate and triage alerts and incidents
  • Enable insight into the customer experience by developing and maintaining dashboarding and alerting
  • Provide Tier 2 support for internal and customer facing services
  • Act with autonomy to initiate and coordinate response to priority incidents as Incident Commander
  • Participate in, and/or conduct incident post mortems
  • Draft Post Incident Review documents
  • Identify opportunities and implement solutions to improve response processes
  • Partner with SRE and Service Owning teams to ensure operational readiness for services and applications
  • Create and maintain knowledge articles and documentation
  • Learn and navigate the tools, systems and processes that enable the AI cloud
  • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and above all, be yourself.

Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk. 

  • You enjoy using observability data to visualize service health, and triangulate proximate cause of performance and availability issues.
  • You have experience in a support capacity with broad understanding and ability to navigate and interact with modern applications and infrastructure
  • You are comfortable managing communication and coordinating multiple engineers during an incident
  • You have a desire to learn or have experience with automation.
  • You are comfortable with the idea of working on the Linux CLI and have a foundational understanding of scripting including elements such as conditionals, variables, and loop structures.
  • You’re comfortable in open source environments.
  • You’re excited to help bootstrap a new team and contribute to developing processes that enable the future scalability of the team.
  • You are open to feedback, coaching, and being an active participant in improving how the team functions
  • You’re excited to join a team with diverse perspectives and backgrounds that believe in tackling challenges, growing hand in hand, and winning together.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $110,000-$140,000 annually. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

Hybrid Workplace

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences

Why CoreWeave?

At CoreWeave, we work hard, have fun, and move fast!  We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: 

  • Be Curious at your Core
  • Act like an Owner
  • Empower Employees
  • Deliver Best In-Class Client Experience 
  • Achieve More Together

We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! 

Benefits

We offer a competitive salary and benefits, including:

  • Medical, dental and vision insurance - 100% paid for the employee
  • Company paid Life Insurance 
  • Voluntary supplemental life insurance 
  • Short and long-term disability insurance 
  • Flexible Spending Account
  • Tuition Reimbursement 
  • Mental Wellness Benefits through Spring Health 
  • Family-Forming support provided by Carrot
  • Paid Parental Leave 
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our offices
  • Weekly massages in NJ office
  • A casual work environment
  • Work culture focused on innovative disruption

California Consumer Privacy Act - California applicants only

CoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.


Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Candidate Location Eligibility:
Albuquerque, NM
Ann Arbor, MI
Atlanta, GA
Austin, TX
Baltimore, MD
Baton Rouge, LA
Birmingham, AL
Boise, ID
Boston, MA
Buffalo, NY
Charleston, SC
Charlotte, NC
Chicago, IL
Cincinnati, OH
Cleveland, OH
Colorado, CO
Columbus, OH
Dallas-Fort Worth, TX
Dayton, OH
Des Moines, IA
Detroit, MI
Fayetteville-Springdale-Rogers, AR
Greensboro, NC
Hampton Roads, VA
Hartford, CT
Houston, TX
Huntsville, AL
Indianapolis, IN
Jacksonville, FL
Kansas City, MO
Las Vegas, NV
Lexington, KY
Lincoln, NE
Little Rock, AR
Los Angeles, CA
Louisville, KY
Madison, WI
Memphis, TN
Miami, FL
Milwaukee, WI
Minneapolis–Saint Paul, MN
Nashville, TN
New Orleans, LA
New York City, NY
Ogden, UT
Oklahoma City, OK
Omaha, NE
Orlando, FL
Other US Location
Palm Bay-Melbourne-Titusville
Pensacola, FL
Peoria, IL
Philadelphia, PA
Phoenix – Mesa – Scottsdale, AZ
Pittsburgh, PA
Portland, ME
Portland, OR
Providence, RI
Provo, UT
Raleigh-Durham, NC
Reno, NV
Richmond, VA
Rochester, NY
Sacramento, CA
Salt Lake City, UT
San Antonio, TX
San Diego, CA
San Francisco, CA
San Luis Obispo, CA
Santa Cruz, CA
Seattle, WA
Spokane, WA
St. Louis, MO
Tallahassee, FL
Tampa Bay, FL
Tucson, AZ
Tulsa, OK
Washington DC
Wichita, KS
Wilmington, NC

Technology we use

  • Engineering
  • Product
  • Sales & Marketing
  • People Operations
    • CSSLanguages
    • GolangLanguages
    • JavascriptLanguages
    • PythonLanguages
    • SqlLanguages
    • TypeScriptLanguages
    • BashLanguages
    • LodashLibraries
    • ReactLibraries
    • ReduxLibraries
    • BootstrapLibraries
    • ApolloLibraries
    • DjangoFrameworks
    • DockerFrameworks
    • ExpressFrameworks
    • GraphQLFrameworks
    • gRPCFrameworks
    • JestFrameworks
    • KubernetesFrameworks
    • Node.jsFrameworks
    • OAuthFrameworks
    • Ruby on RailsFrameworks
    • TensorFlowFrameworks
    • TerraformFrameworks
    • Gitlab CIFrameworks
    • Controller-RuntimeFrameworks
    • PrometheusFrameworks
    • FastAPIFrameworks
    • MemcachedDatabases
    • MySQLDatabases
    • PostgreSQLDatabases
    • RedisDatabases
    • SQLiteDatabases
    • CockroachDBDatabases
    • GitHubServices
    • GitLabServices
    • SlurmServices
    • CiliumServices
    • CumulusServices
    • Teleport Services
    • Google AnalyticsAnalytics
    • FigmaDesign
    • MiroDesign
    • LucidChartDesign
    • Google DriveManagement
    • Google DocsManagement
    • Google SlidesManagement
    • JIRAManagement
    • NotionManagement
    • WebflowCMS
    • DocuSignCRM
    • HubSpotCRM
    • SalesforceCRM
    • SendGridEmail
    • SlackCollaboration
    • ZoomCollaboration

An Insider's view of CoreWeave

What’s the vibe like in the office?

Our office space is an open concept layout with pods. One of my favorite things has always been that everybody sits together. The CEO, junior engineers, the vfx dept, are all in the same room with the ability to ask questions and interact with each other. This leads to increased productivity and the ability to work together to solve problems.

Taylor

Client Relations Specialist

What's the biggest problem your team is solving?

We are making high-powered, reliable, and efficient cloud computing accessible to digital designers, AI platforms, and researchers (to name just a few) all around the world!

Matt

Team Lead, Cloud

How does the company support your career growth?

CoreWeave has provided me with an amazing environment to grow and develop as an engineer. Having joined straight out of college it was a little intimidating to work directly with our senior leadership team but I really enjoyed the challenge and I am grateful for the experience I was able to gain.

Anthony

Frontend Engineer

What are CoreWeave Perks + Benefits

CoreWeave Benefits Overview

We offer a competitive salary and benefits, including medical and dental insurance, 401(k) with a generous employer match, flexible PTO, catered lunch each day in our NJ office and weekly massages, a casual work environment, and a work culture focused on innovation.

Culture
Open door policy
Open office floor plan
In-person all-hands meetings
Flexible work schedule
Remote work program
Diversity
Hiring practices that promote diversity
Health Insurance + Wellness
Flexible Spending Account (FSA)
Disability insurance
Dental insurance
Vision insurance
Health insurance
Life insurance
Wellness programs
Mental health benefits
Transgender health care benefits
Financial & Retirement
401(K)
401(K) matching
Company equity
Child Care & Parental Leave
Childcare benefits
Generous parental leave
Fertility benefits
Vacation + Time Off
Generous PTO
Paid holidays
Paid sick days
Office Perks
Commuter benefits
Free daily meals
Free snacks and drinks
Professional Development
Job training & conferences
Tuition reimbursement
Lunch and learns
Promote from within
Continuing education stipend

More Jobs at CoreWeave

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about CoreWeaveFind similar jobs like this