Job Description:
- Manage infrastructure scalability and reliability.
- Monitor system performance.
- Incident response and postmortem.
- Site Reliability Engineering, Analytics, Elastic stack, CICD pipelines
- Deep understanding of monitoring tools like Grafana, ELK
- Knowledge of incident response and troubleshooting complex distributed systems
- Familiarity with infrastructure as code (IaC) tools like Terraform
- Proactive problem-solving to preemptively address infrastructure issues
- Machine learning and ML model deployment skills to integrate ML-based monitoring and alerting
- Understanding of ML frameworks (e.g., TensorFlow, PyTorch) and ML Ops for model reliability in production environments
Recruitment fraud is a scheme in which fictitious job opportunities are offered to job seekers typically through online services, such as false websites, or through unsolicited emails claiming to be from the company. These emails may request recipients to provide personal information or to make payments as part of their illegitimate recruiting process. DXC does not make offers of employment via social media networks and DXC never asks for any money or payments from applicants at any point in the recruitment process, nor ask a job seeker to purchase IT or other equipment on our behalf. More information on employment scams is available here.
Top Skills
What We Do
DXC Technology is a Fortune 500 global IT services leader. Our more than 130,000 people in 70-plus countries are entrusted by our customers to deliver what matters most. We use the power of technology to deliver mission critical IT services across the Enterprise Technology Stack to drive business impact. DXC is an employer of choice with strong values, and fosters a culture of inclusion, belonging and corporate citizenship. We are DXC.