Position Title: DevSecOps & Site Reliability Engineer
Position Type: Regular - Full-Time
Position Location: Florenceville GTC
Requisition ID: 32708
JOB PURPOSE:
The SRE Engineer will work alongside our cloud and managed infrastructure stakeholders to ensure McCain systems are operating optimally.
JOB RESPONSIBILITIES:
- Designing, implementing, and administering IT infrastructure to support current and future business requirements, including physical and cloud compute/storage environments, network and communication infrastructure, and endpoint device configuration
- Experience in problem solving and analyzing complex enterprise systems, and navigating enterprise software, deployment and management of workloads on Cloud, on-premise systems
- Drive and influence integrated DevOps solutions across business, product, platform, infrastructure, development, support/DevOps teams that improve the design and operation of systems, making them scalable, reliable, and efficient while ensuring performance and high availability of products/services
- Overseeing and maintaining backup tools, topology, and disaster recovery processes
- Implementing and supporting maintenance and upgrades of system infrastructure such as Host hardware (IBM iSeries/pSeries, HP, Lenovo), SAN technologies, SQL, VMWare, Hyper-V, and cloud technologies
- Spearhead the development of SRE solutions (monitoring and alerting, machine learning anomaly detection, self-healing and reliability testing) for both on-prem and cloud systems
- Performing regular system monitoring, verifying the integrity and availability of all hardware, server resources, systems, and processes for service level integrity and performance
- Improve service reliability through blameless post-incident reviews and using code to prevent or respond to problem recurrence.
- Define and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to maintain high service availability
- Identifying and resolving production capacity, contention, resource, and application deficiencies on both on-prem and cloud systems
- Collaborate with engineering teams to improve availability, reliability, and observability of their services.
- Optimize existing on-premises systems and eliminate toil through automation, optimizing deployment processes, and enhancing the scalability of our infrastructure..
- Implementing observability, AIOPS across complex cloud workloads and technology stacks.
- Manage daily operations and functionality of site reliability solutions and applications.
- Conduct post-incident analysis to identify root causes, implement corrective actions, and prevent similar issues in the future
- Scripting/Automation - Python, Scripting YAML, Bash, Terraform, Power shell
- Proficiency in test framework automation, test design, test data management
- Oversee enterprise patch management and CMDB updates
- Perform application production support role and troubleshoot incidents
- Implement security controls at every stage of the deployment pipeline to detect and mitigate vulnerabilities.
- Develop and maintain automated processes for security testing, deployment, and infrastructure provisioning.
- Implement Infrastructure as Code (IaC) practices to ensure consistent and secure infrastructure configurations.
- Establish and maintain continuous monitoring processes to detect and respond to security incidents promptly.
- Collaborate with incident response teams to investigate and address security breaches or incidents.
- Security Audits and Compliance:
- Deploying and Configuring Azure Firewalls, Azure VPN Gateways and NVAs
- Manage and monitor security health of platforms to ensure that issues and risk are quickly identified and resolved.
- Collaborate with the IT operations and development teams to plan and execute system changes e.g., security and audit controls as required by the business or compliance requirements.
- Automate build and release manual activities using DevSecOps best practices.
KEY QUALIFICATION & EXPERIENCES:
- 7 + years' experience in IT administration/engineering roles
- 3 - 5 years' experience in cloud engineering, SRE roles
- Bachelor's degree in computer science, information systems or other related field (or equivalent work experience)
- Strong understanding and working experience of CI/CD and GitOps.
- Broad exposure to IT infrastructure and application landscape with technical depth in Cloud platforms
- Extensive experience with documenting and optimizing operational processes
- Extensive experience engineering both cloud and on-prem environments
- Proficiency with container orchestration tools such as Kubernetes and Helm.
- Strong understanding of networking concepts and protocols.
OTHER INFORMATION
- Key internal relationships: Director, IT Operations & Platform Support , Data & Analytics Teams, IT Application Support, IT Architects, ITSM Manager, Network Services, IT Security, IT Operations (internal and external).
- Key external relationships: External vendors, partners and service providers.
- Travel: as required.
- Job is primarily performed in a standard office environment.
McCain Foods is an equal opportunity employer. We see value in ensuring we have a diverse, antiracist, inclusive, merit-based, and equitable workplace. As a global family-owned company we are proud to reflect the diverse communities around the world in which we live and work. We recognize that diversity drives our creativity, resilience, and success and makes our business stronger.
McCain is an accessible employer. If you require an accommodation throughout the recruitment process (including alternate formats of materials or accessible meeting rooms), please let us know and we will work with you to meet your needs.
Your privacy is important to us. By submitting personal data or information to us, you agree this will be handled in accordance with the Global Privacy Policy
Job Family: Information Technology
Division: Global Digital Technology
Department: IT Operations and Platform Support
Location(s): CA - Canada : New Brunswick : Florenceville-Bristol || CA - Canada : Ontario : Toronto
Company: McCain Foods (Canada)
Top Skills
What We Do
The power it has to uplift and bring people, Guided by our purpose - Celebrating real connections through delicious, planet-friendly food - we believe that working together with our teams, business and community partners will bring sustainable growth and positive change - today, tomorrow and for generations to come.
As a privately owned family company with over 60 years of experience, a presence in over 160 countries and a global team of 22,000 people, our values and culture are at the heart of everything we do. Our product quality, people and customer dedication help us achieve global sales in excess of CDN $10 billion. Through our investment and innovation, we continue to be a global leader in prepared potato products, including our famous French Fries and appetizers.
We are passionate about supporting and developing our people-providing opportunities to grow and learn in their roles, as well as building careers for the long term.
Why Work With Us
We are working to bring digital tools and data into our processes to drive efficiency, automation and data-driven insights. From connecting our business, enabling our supply chain, supporting our customers, to reinventing agriculture. So if you are a tech expert looking to join a company transforming technology, think of McCain.
Gallery
McCain Foods Offices
Hybrid Workspace
Employees engage in a combination of remote and on-site work.