The Role
Design and enhance ML system infrastructure, build dashboards for performance tracking, and collaborate on ML projects in a DevOps environment.
Summary Generated by Built In
Level AI (thelevel.ai) is an Enterprise SaaS startup. Our vision is to build AI tools that augment, not replace, humans. Our first market is in contact centres.
Level AI was founded in 2019 and is a Series C startup headquartered in Mountain View, California. Level AI revolutionises customer engagement by transforming contact centres into strategic assets. Our AI-native platform leverages advanced technologies such as Large Language Models to extract deep insights from customer interactions. By providing actionable intelligence, Level AI empowers organisations to enhance customer experience and drive growth. Consistently updated with the latest AI innovations, Level AI stands as the most adaptive and forward-thinking solution in the industry.
Responsibilities:
- Design, build, and develop/enhance state of art machine Learning system infrastructure (cloud and on-premise) core components and architect platforms to create, train and deploy ML models.
- Build operating dashboards and charts to track system errors, performance and enable root cause analysis.
- Identify gaps and evaluate relevant tools and technologies as needed to improve processes and systems, leveraging open-source and cloud computing technologies to build effective solutions.
- Collaborate with the AI team to drive ML projects from conception to completion and production monitoring.
Requirements:
- Bachelor's or above with a good academic background.
- 2-4 years of meaningful work experience in DevOps handling complex services.
- Strong troubleshooting skills to keep our services highly available.
- Strong expertise and experience with Google Cloud Platform (GCP), Docker, Kubernetes, CI/CD, and Jenkins.
- Extensive experience in designing, implementing, and maintaining infrastructure as code, preferably using Terraform.
- Create and maintain deployment manifest files for microservices using HELM.
- Having LLMOps or MLOps experience is a bonus.
- Strong expertise is required with deployment at scale on a Kubernetes cluster via HPA.
- Broad technical background and experience with architecture, design, and operations of cloud solutions and how to meet security compliance requirements.
- Monitoring system health, ensuring security, scalability, and reliability.
- Design, implement, and maintain observability, monitoring, logging, and alerting using tools like Prometheus, Grafana, Promtail, Loki, and Datadog.
Compensation : We offer market-leading compensation, based on the skills and aptitude of the candidate.
To learn more visit : https://thelevel.ai/
Funding : https://www.crunchbase.com/organization/level-ai
LinkedIn : https://www.linkedin.com/company/level-ai/
Our AI platform : https://www.youtube.com/watch?v=g06q2V_kb-s
Top Skills
Ci/Cd
Datadog
Docker
Google Cloud Platform
Grafana
Helm
Jenkins
Kubernetes
Loki
Prometheus
Promtail
Terraform
Am I A Good Fit?
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.
Success! Refresh the page to see how your skills align with this role.
The Company
What We Do
Level AI (https://thelevel.ai) is a Mountain View, CA and Delhi, India based startup innovating in the Voice AI space. We are backed by top VCs, technologists from Silicon Valley and industry experts. We are on a mission for AI to augment the worker and not replace them. We are innovating in speech AI, NLP and information retrieval systems to bring customers and businesses closer to one another.
The team has experience from Amazon Alexa, Google, and other leading AI organizations.


.jpg)






