What You’ll Do
- Oversee the reliability, performance, and security of critical production services from design to deployment, ensuring they meet our uptime and performance targets.
- Collaborate with development, QA, and product teams to build and maintain resilient infrastructure and efficient deployment pipelines.
- Automate infrastructure provisioning and software deployments using Infrastructure as Code and CI/CD tools, reducing manual work and errors.
- Participate in and improve our 24×7 on-call process, swiftly troubleshooting incidents and performing root cause analysis to prevent recurrence.
- Document and standardize processes and configurations, sharing knowledge to uplift the entire engineering team’s capabilities.
Essential Skills & Experience
- 5-7 years of experience in DevOps, SRE, or Software Engineering roles, with increasing responsibility in system design and operations.
- Extensive experience with containerization (Docker) and orchestration (Kubernetes) in production environments, including managing and scaling clusters.
- Proficiency in Infrastructure as Code (Terraform, CloudFormation, etc.) and configuration management tools (Ansible, Puppet) to automate infrastructure provisioning.
- Strong coding and scripting skills in languages like Python, Go, or Ruby, with the ability to build automation tools for system management.
- Deep knowledge of cloud platforms (AWS and/or GCP) and their services, with experience designing and operating cloud-based infrastructure at scale.
- Solid understanding of networking and security fundamentals in cloud and on-prem environments.
- Experience setting up and tuning monitoring/alerting systems (Prometheus, Grafana, etc.), and a thorough understanding of SRE best practices (SLIs, SLOs, incident management).
- Strong problem-solving and communication skills, with a track record of working effectively in collaborative team environments.
Preferred (Not Essential)
- Kafka
- MySQL/Postgres
- Redis
- Elasticsearch
If you're a forward-thinking engineer who excels at solving complex infrastructure challenges and is passionate about automation and reliability, we'd love to hear from you. Apply today and help us elevate our infrastructure to the next level!
Top Skills
What We Do
Brandwatch is the world’s leading digital consumer intelligence company, allowing users to analyze and utilize conversations from across the web and social media.
It is the perfect platform to make sense of your consumers, their needs, wants, and interests.
With official access to Twitter, Reddit, and Tumblr's firehose, plus data coming from 100 million other sites, our historical archive includes over a trillion conversations, with 501 million new ones added every day.
Our platform then combines queries and AI to help you parse and analyze the data that's useful to you. From there it can be chopped, sliced, and combined to find insights you can put into action.









