Site Reliability Engineer

Posted 8 Days Ago
Easy Apply
Be an Early Applicant
Hiring Remotely in India
Remote
Mid level
Software
The Role
The Site Reliability Engineer ensures the reliability and performance of Prophecy's platform, managing Kubernetes, networking, identity, and observability across multi-cloud environments.
Summary Generated by Built In
About Prophecy 

The leader in AI-native data preparation and analysis, Prophecy is revolutionizing how the world’s top enterprises turn data chaos into reliable insights. We introduce the AI-native data lifecycle (generate, refine, deploy) where our industry leading AI agents and humans work hand-in-hand in visual and document interfaces to analyze, transform and prepare data, to ship trusted insights at enterprise scale. To learn more, visit us on LinkedIn

Position Summary

As a Site Reliability Engineer (SRE), you will ensure the reliability, scalability, and performance of Prophecy’s platform across multi-cloud and SaaS environments. You will provide technical expertise in Kubernetes, networking, identity, observability, and automation, working to resolve challenges that impact the availability and resilience of our platform. Customers and internal teams will look to you for solutions ranging from infrastructure troubleshooting to complex architectural designs spanning Kubernetes, cloud-native services, and enterprise security. You will partner closely with product engineering and support teams to deliver a highly reliable experience to our enterprise customers. 

The Impact You Will Have
  • Operate and optimize Kubernetes platforms (EKS, AKS, GKE) with Helm, namespaces, pods, autoscaling, node pools.
  • Manage ingress & networking: NGINX, ALB/AGIC, DNS, TLS/certificates, proxies, VNET/VPC routing, PrivateLink/peering.
  • Implement identity & secrets management: SSO (OIDC/SAML), SCIM, service principals/managed identities, vaults, key rotation.
  • Maintain platform service health across UI, APIs, orchestrators, workflow services using readiness/liveness probes and capacity planning.
  • Enable storage & I/O: object stores (S3, ADLS, GCS), DBFS mounts, IAM roles, access connectors, throughput/quota optimization.
  • Execute release & upgrades: version rollouts, canary/blue-green strategies, rollback automation, image registries, SBOM/vulnerability scanning.
  • Deliver observability: build dashboards, log pipelines, SLO/SLA monitoring with Prometheus, Grafana, CloudWatch, Log Analytics, ELK.
  • Strengthen resilience & DR: multi-AZ architectures, backup/restore, chaos testing, RTO/RPO validation, recovery runbooks.
  • Drive release automation: GitOps (ArgoCD/Flux), pre-flight checks, automated smoke tests, post-upgrade validation suites.
  • Ensure cloud-specific reliability: IAM, private connectivity, security groups, application gateways across AWS, Azure, GCP.
  • Enforce security & compliance: CIS hardening, benchmarks, network segmentation, vulnerability management, auditability.
  • Support high-governance SaaS deployments: dedicated SaaS controls, change control, strict egress policies, artifact provenance, customer-owned KMS.
What We Look For
  • 4-7 years in SRE, platform engineering, or enterprise production support.
  • Strong hands-on experience with Kubernetes and multi-cloud (AWS, Azure, GCP).
  • Expertise in networking, identity, secrets, and platform automation.
  • Proven track record in observability, reliability engineering, and incident management.
  • Familiarity with GitOps/CI/CD pipelines and modern automation practices.
  • Strong problem-solving, ownership, and ability to work in a fast-moving startup culture.
  • Technical degree or the equivalent experience.
What You'll Have At Prophecy
  • Great company culture.
  • Competitive compensation.
  • Fair and Open Equity awards for everyone.
  • Flexible hybrid work environment
  • Private medical insurance.
  • Learning and career development opportunities
  • End-to-end project ownership and high-growth career path

Our Commitment to Diversity and Inclusion

At Prophecy, we hire for merit and foster an inclusive culture where people from diverse backgrounds can excel and do their best work. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Prophecy are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and any other protected characteristics under applicable laws.

Top Skills

Agic
Alb
Argocd
AWS
Azure
Cloudwatch
Elk
Flux
GCP
Gitops
Grafana
Helm
Kubernetes
Log Analytics
Nginx
Prometheus
Tls
Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: San Francisco, CA
94 Employees
Year Founded: 2017

What We Do

Prophecy.io is the only low-code data engineering platform. Prophecy democratizes the development and deployment of high-quality data pipelines, uniquely combining visual development with agile software engineering best practices. The developed code is open source and is targeted at Apache Spark & Apache Airflow. Prophecy is headquartered in Silicon Valley.

Similar Jobs

Boomi Logo Boomi

Site Reliability Engineer

Cloud • Information Technology • Productivity • Software • Automation
Remote
India
2200 Employees

BlackLine Logo BlackLine

Senior Site Reliability Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote or Hybrid
Bengaluru, Bengaluru Urban, Karnataka, IND
1810 Employees

Sleek Logo Sleek

Site Reliability Engineer

Fintech • Financial Services
In-Office or Remote
5 Locations
405 Employees
Remote
Shri Bhrigukshetra, BLR, Uttar Pradesh, IND
15967 Employees

Similar Companies Hiring

Scotch Thumbnail
Software • Retail • Payments • Fintech • eCommerce • Artificial Intelligence • Analytics
US
25 Employees
Milestone Systems Thumbnail
Software • Security • Other • Big Data Analytics • Artificial Intelligence • Analytics
Lake Oswego, OR
1500 Employees
Fairly Even Thumbnail
Software • Sales • Robotics • Other • Hospitality • Hardware
New York, NY

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account