Observability Engineer

Sorry, this job was removed at 06:29 a.m. (CST) on Wednesday, Dec 25, 2024
Be an Early Applicant
Gurgaon, Gurugram, Haryana
In-Office
Food • Retail • Agriculture • Manufacturing
At McCain Foods we know the importance that food plays in people's lives.
The Role

Position Title: Observability Engineer
Position Type: Regular - Full-Time
Position Location: Gurgaon
Grade: Grade 05
Requisition ID: 33010
The IT Infrastructure, Engineering and Operations team is looking for an Observability Engineer ideally with expertise in Enterprise Network, Systems, and Application monitoring and logging development.
JOB RESPONSIBILITIES:
• Develop and improve instrumentation for monitoring and logging the health and availability of services.
• Proactively monitor systems, networks, and applications to provide input in improving the stability, security, efficiency, and scalability of systems.
• Develop and maintain Monitoring and Logging Frameworks for all of ITX Take personal responsibility for the quality, reliability and availability of global IT corporate infrastructure.
• Own operations documentation of monitoring and logging for global IT production infrastructure.
• Participate in rotating on-call incident response on the weekdays and on the weekends. Improve operational efficiencies via scripting, bots and integrations.
• Participate cross functionally with vendors and other IT engineering teams to ensure smooth service delivery.
• Network and systems troubleshooting, fault analysis, and resolution.
• Collaborate with Incident and Problem Management to reduce MTTR and Incident volume.
• Design, implement, and maintain AIOps solutions to monitor and analyze IT systems, applications, and networks.
• Deploy machine learning algorithms for anomaly detection, root cause analysis, and incident prediction.
• Configure and manage observability tools and platforms to gain real-time visibility into system health and performance.
• Develop monitoring dashboards, alerts, and reports to provide comprehensive insights into the IT environment.
• Conduct root cause analysis for incidents using data from AIOps and observability tools to identify underlying issues.
• Work closely with software engineers to instrument applications with appropriate logging, metrics, and tracing capabilities
• Continuously analyze monitoring data to identify trends, anomalies, and opportunities for optimization.
• Stay updated with industry trends and advancements in AIOps and observability practices, and recommend new tools or methodologies for adoption
• Designing, developing, and implementing AI models and algorithms utilizing state-of-the-art techniques such as GPT, VAE, and GANs.
• Collaborating with cross-functional teams to define AI project requirements and objectives, ensuring alignment with overall business goals.
• Conducting research to stay up-to-date with the latest advancements in generative AI, machine learning, and deep learning techniques and identify opportunities to integrate them into our products and services.
• Optimizing existing generative AI models for improved performance, scalability, and efficiency.
• Developing and maintaining AI pipelines, including data preprocessing, feature extraction, model training, and evaluation.
• Developing clear and concise documentation, including technical specifications, user guides, and presentations, to communicate complex AI concepts to both technical and non-technical stakeholders.
• Contributing to the establishment of best practices and standards for generative AI development within the organization.
• Providing technical mentorship and guidance to junior team members.
• Apply trusted AI practices to ensure fairness, transparency, and accountability in AI models and systems
• Drive DevOps and MLOps practices, covering continuous integration, deployment, and monitoring of AI
• Utilize tools such as Docker, Kubernetes, and Git to build and manage AI pipelines
• Implement monitoring and logging tools to ensure AI model performance and reliability
• Collaborate seamlessly with software engineering and operations teams for efficient AI model integration and deployment.
• Familiarity with DevOps and MLOps practices, including continuous integration, deployment, and monitoring of AI models.

KEY QUALIFICATION & EXPERIENCES:
• Minimum 10 years of experience in Observability/Monitoring tools
• Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field.
• 5+ years of industry experience in software development.
• In-depth experience designing at scale monitoring and logging for corporate infrastructure services.
• Expert level experience in monitoring and logging technologies, both open source and closed source (e.g. AppDynamics, Newrelic, Datadog, Prometheus, Grafana, LogicMonitor, SumoLogic, ELK)
• Experience in implementing Metrics, Logs and Tracing for E2E observability
• Experience in RBAC and user based security services such as ISE, Radius, LDAP, and AD.
• Must have strong automation/scripting skills - proficiency in Python or Golang is a plus.
• Proficient in developing and maintaining technical documentation, runbooks, and procedures.
• A working knowledge in Network is needed. Fundamental knowledge of TCP/IP stack, application protocols (DHCP/DNS/HTTPs) and networking concepts (HSRP/NAT/VPN/VLANs/802.1x/Wireless/Clustering/High Availability/Load Balancing).
• Understanding of enterprise networks using Cisco IOS/NXOS with a working knowledge of IP Protocols (TCP/UDP/ICMP) and Routing Protocols (BGP/OSPF/IS-IS).
• Technology understanding of Cisco, Cloud Native Firewalls, including Firewall Policy Rules, URL-Filtering, App-ID, User-ID, etc.
• Experience interacting with Telco and Global ISPs (WAN/DIA) and the monitoring of those services.
• A working knowledge of systems is needed. Fundamental knowledge of Configuration Management and Automation tools, with experience in: * Terraform, Ansible, Chef, Puppet, Jenkins
* Designing and implementing CI/CD pipelines * Infrastructure provisioning and management
• Strong in troubleshooting incidents in production environment.
• A strong ownership attitude and a track record of taking responsibility for problems and pushing through to resolution.
• Bachelor's degree in Computer Science or EE, or relevant industry experience is required.
• Ability to communicate and coordinate with cross-functional engineering teams across multiple geographic regions.
• Experience with AIOps and machine learning is highly desirable.
• Knowledge of OpenTelemetry is an added advantage.
• Experience with other monitoring tools like Prometheus, Grafana, etc.
• Experience with Observability solutions like Dynatrace, DataDog, Instana etc. is highly desirable
• Experience working with mainframe systems is a plus (willingness to learn is also acceptable).
• Excellent problem-solving and analytical skills.
• Strong communication and collaboration skills.
• Ability to work independently and manage multiple projects simultaneously.
• Passion for learning new technologies and continuous improvement.
• In-depth knowledge of machine learning, deep learning, and generative AI techniques
• Knowledge and experience in Generative AI
• Proficiency in programming languages such as Python, R, and frameworks like TensorFlow or PyTorch
• Strong understanding of NLP techniques and frameworks such as BERT, GPT, or Transformer models
• Familiarity with computer vision techniques for image recognition, object detection, or image generation
• Experience with cloud platforms such as Azure or AWS
• Knowledge of IT operations concepts and processes, such as monitoring, incident management, root cause analysis, remediation.
Nice To Have:
• Ability to take lead in an operations environment.
• Contributed to Open Source - your public Git repos/contributions show good examples of giving back to the community.
• Architected a monitoring and logging infrastructure that was technology agnostic for a production infrastructure environment.
• Knowledge of revision control software such as GIT.
• Familiarity with REST APIs scripting, i.e. with PAN OS API / Infoblox WAPI.
McCain Foods is an equal opportunity employer. We see value in ensuring we have a diverse, antiracist, inclusive, merit-based, and equitable workplace. As a global family-owned company we are proud to reflect the diverse communities around the world in which we live and work. We recognize that diversity drives our creativity, resilience, and success and makes our business stronger.
McCain is an accessible employer. If you require an accommodation throughout the recruitment process (including alternate formats of materials or accessible meeting rooms), please let us know and we will work with you to meet your needs.
Your privacy is important to us. By submitting personal data or information to us, you agree this will be handled in accordance with the Global Employee Privacy Policy
Job Family: Information Technology
Division: Global Digital Technology
Department: I and O Project Delivery
Location(s): IN - India : Haryana : Gurgaon
Company: McCain Foods(India) P Ltd

What the Team is Saying

Areej
Sandra
Peter
Chuk

Similar Jobs

McCain Foods Logo McCain Foods

L&D Specialist

Food • Retail • Agriculture • Manufacturing
In-Office
Gurugram, Haryana, IND
20000 Employees

McCain Foods Logo McCain Foods

Site Reliability Engineer

Food • Retail • Agriculture • Manufacturing
In-Office
Gurugram, Haryana, IND
20000 Employees
60K-100K Annually

McCain Foods Logo McCain Foods

Process Release Management Engineer

Food • Retail • Agriculture • Manufacturing
In-Office
Gurugram, Haryana, IND
20000 Employees

McCain Foods Logo McCain Foods

Support Engineer

Food • Retail • Agriculture • Manufacturing
In-Office
Gurugram, Haryana, IND
20000 Employees
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Florenceville-Bristol, NB
20,000 Employees
Year Founded: 1957

What We Do

The power it has to uplift and bring people, Guided by our purpose - Celebrating real connections through delicious, planet-friendly food - we believe that working together with our teams, business and community partners will bring sustainable growth and positive change - today, tomorrow and for generations to come.

As a privately owned family company with over 60 years of experience, a presence in over 160 countries and a global team of 22,000 people, our values and culture are at the heart of everything we do. Our product quality, people and customer dedication help us achieve global sales in excess of CDN $10 billion. Through our investment and innovation, we continue to be a global leader in prepared potato products, including our famous French Fries and appetizers.

We are passionate about supporting and developing our people-providing opportunities to grow and learn in their roles, as well as building careers for the long term.

Why Work With Us

We are working to bring digital tools and data into our processes to drive efficiency, automation and data-driven insights. From connecting our business, enabling our supply chain, supporting our customers, to reinventing agriculture. So if you are a tech expert looking to join a company transforming technology, think of McCain.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

McCain Foods Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Flexible
Company Office Image
HQFlorenceville
Company Office Image
HQOakbrook Terrace, IL
Company Office Image
HQToronto, ON
McCain Foods Australia & New Zealand
Potato Processing Technology Centre
McCain Foods South Africa
McCain Foods GB Limited
Taman Tasik Indah, Kuala Lumpur
Learn more

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account