Key Responsibilities
- Define and drive organization-wide performance engineering strategy aligned with business KPIs, customer experience, and cost efficiency
- Architect and build scalable, self-service performance engineering platforms enabling teams to run performance tests and analysis independently
- Design and implement AI-driven performance engineering solutions including anomaly detection, predictive performance insights, adaptive load testing, and automated optimization recommendations
- Lead the design and execution of advanced performance testing strategies for serverless, distributed, and event-driven systems
- Establish and standardize performance benchmarks, SLAs, SLOs, and KPIs across services
- Drive integration of performance testing and validation into CI/CD pipelines to enable continuous performance engineering (shift-left approach)
- Analyze system-wide performance bottlenecks including latency, cold starts, concurrency limits, and resource utilization across distributed systems
- Collaborate with engineering, SRE, and architecture teams to influence system design for scalability, resilience, and performance optimization
- Own performance in production environments by leveraging observability tools, distributed tracing, and real-time monitoring systems
- Implement intelligent observability solutions using tools such as CloudWatch, Datadog, New Relic, and AI-based monitoring platforms
- Lead capacity planning and scalability initiatives for high-throughput and globally distributed systems
- Drive cost-performance optimization strategies in cloud-native environments (FinOps alignment)
- Mentor and guide engineers across teams, promoting a performance-first culture and best practices
- Stay updated with emerging trends in performance engineering, including AI/ML-driven optimization and cloud-native innovations
Desired Skill and Requirements
Must Have
- 8+ years of experience in performance engineering within large-scale SaaS or cloud-native environments
- Performance testing tools - JMeter, Gatling, Locust, or similar
- Serverless architectures - AWS Lambda, API Gateway, event-driven systems
- Performance monitoring and observability tools - CloudWatch, Datadog, New Relic, distributed tracing systems
- Building performance engineering frameworks or platforms at scale
- Performance optimization in distributed and serverless systems - latency, cold starts, concurrency, and scaling behavior
- Integration of performance engineering into CI/CD pipelines
- Programming/scripting - Python (preferred), Java, or similar
- AI/ML-based performance optimization techniques - anomaly detection, predictive analysis, adaptive load modeling
- Cloud platforms (AWS preferred) and performance optimization techniques
- Ability to identify and resolve complex performance bottlenecks
- Large-scale load testing and capacity planning
- Cost-performance optimization in cloud environments
Good To Have
- Kubernetes, containerized, and serverless architectures
- Chaos engineering and resilience testing
- Internal developer platforms and self-service tooling
- FinOps and cloud cost optimization strategies
- Globally distributed and multi-region architectures
- API performance optimization
- Modern distributed data stores - DynamoDB, Aurora Serverless, NoSQL systems
- AIOps platforms and intelligent observability systems
Soft Skills
- Strong problem-solving and analytical thinking
- Ability to influence architectural and technical decisions across teams
- Excellent communication and stakeholder management skills
- Ownership mindset with the ability to drive cross-functional initiatives
- Mentorship and leadership capabilities
- Ability to operate in a fast-paced, high-growth SaaS environment
Experience
- 8+ years of experience in performance engineering in large-scale SaaS or cloud-native environments
- 3+ years of experience in Senior, Lead, or Staff-level performance engineering roles
- 4+ years of experience performance testing large-scale SaaS or distributed systems
- 5+ years of hands-on experience with performance testing tools such as JMeter, Gatling, k6, or Locust
- Experience designing and executing large-scale performance tests in production-like environments
- Experience identifying and resolving performance bottlenecks across application, database, network, and infrastructure layers
- Experience tuning databases for performance at scale
- Experience defining and implementing performance benchmarks, KPIs, and capacity planning strategies
- Experience working with observability and monitoring platforms for performance analysis
- Experience optimizing event-driven and serverless architectures
- Experience influencing architecture and engineering decisions across teams and domains
- Experience operating in fast-paced, high-growth SaaS environments
Education
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
- Equivalent practical experience in performance engineering or cloud-native systems
Skills Required
- 8+ years of experience in performance engineering within large-scale SaaS or cloud-native environments
- Performance testing tools - JMeter, Gatling, Locust, or similar
- Serverless architectures - AWS Lambda, API Gateway, event-driven systems
- Performance monitoring and observability tools - CloudWatch, Datadog, New Relic, distributed tracing systems
- Building performance engineering frameworks or platforms at scale
- Performance optimization in distributed and serverless systems - latency, cold starts, concurrency, and scaling behavior
- Integration of performance engineering into CI/CD pipelines
- Programming/scripting - Python (preferred), Java, or similar
- AI/ML-based performance optimization techniques - anomaly detection, predictive analysis, adaptive load modeling
- Cloud platforms (AWS preferred) and performance optimization techniques
- Ability to identify and resolve complex performance bottlenecks
- Large-scale load testing and capacity planning
- Cost-performance optimization in cloud environments
MontyCloud Compensation & Benefits Highlights
The following summarizes recurring compensation and benefits themes identified from responses generated by popular LLMs to common candidate questions about MontyCloud and has not been reviewed or approved by MontyCloud.
-
Fair & Transparent Compensation — Feedback suggests compensation and benefits are viewed favorably overall, indicating competitive pay positioning for many roles.
-
Healthcare Strength — Job postings indicate medical, dental, and vision coverage as part of a comprehensive package in the U.S.
-
Equity Value & Accessibility — Listings highlight equity participation as a standard component, signaling accessible ownership opportunities for employees.
MontyCloud Insights
What We Do
MontyCloud is a Seattle, WA based intelligent Cloud Management Platform Company. Our customers use MontyCloud DAY2™ to instantly close the cloud skills gap, simplify CloudOps, and reduce the total cost of cloud operations up to 70%, all in just a few clicks. By leveraging the AWS public cloud, AI, and ML, DAY2 ™ simplifies provisioning, security, compliance, cost optimization, and routine operations. DAY2™’s automation first, No-Code approach helps customers immediately derive deep insights and deliver intelligent Cloud Operations in just a few minutes. You can try the platform for free at https://MontyCloud.com








