Senior Site Reliability Engineer - Network - Remote

Posted 22 Days Ago
Hiring Remotely in United States
Remote
5-7 Years Experience
Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
We deliver end-to-end risk and compliance solutions through our software and domain expertise.
The Role
Senior Site Reliability Engineer responsible for maintaining fast, stable, and optimized networks in SaaS products. Champions SRE culture, designs secure networks, implements monitoring and alerting, troubleshoots issues, and automates system operations. Collaborates with IT to integrate SaaS products into broader network topologies.
Summary Generated by Built In

Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As markets fluctuate, regulations evolve and technology advances, we're there. And through it all, we deliver confidence with the right solutions in moments that matter.
Summary:
We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.
The Senior Site Reliability Engineer - Network is responsible for ensuring the networks in our SaaS products are fast, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.
You either have a network infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with extensive network infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.
Responsibilities:

  • Champion and implement a culture of SRE to maintain a reliable and performant network infrastructure in DFIN SaaS products
  • Design and implement secure, redundant, fault-tolerant networks in DFIN SaaS products; you understand networking protocols and network elements and how they are integrated together to create resilient, fault-tolerant networks in SaaS products
  • Choose and configure common network elements in SaaS product network topologies including load balancers, routers, DNS, etc.; provision route tables and routing paths in DFIN SaaS products so development teams do not have to
  • Define, lead the implementation, and maintain SaaS product network monitoring and alerting to prevent client impacting issues and ensure network availability, performance and scalability to maintain SLOs and SLAs
  • Identify and remediate issues in SaaS product network infrastructure (high latency, timeouts, dropped connections, etc.) using diagnostic tooling and network traces; perform thorough Root Cause Analysis (RCA); drive vendor partners (Microsoft) to provide quality assurances by requiring immediate defect fixes, software updates, etc., as necessary to ensure an ideal customer experience
  • Serve as a senior escalation point for SaaS product network issues and collaborate with DFIN IT to integrate SaaS products into broader DFIN network topologies
  • Automate everything including system operational runbooks
  • Dive deep into technology and stay on the forefront of the latest network analysis tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
  • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable to expectations
  • Learn continuously and apply lessons learned
  • Evangelize best practices, eliminate bottlenecks, and improve process
  • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents


Qualifications:

  • BS in Computer Science or equivalent work experience.
  • Thorough understanding of common networking protocols including IP, TCP/IP, ICMP, DNS, DHCP, ARP, SSL, TLS and how to diagnose network issues by isolating problems at the protocol layer within specific network elements
  • 5+ years experience with Azure network design and network element configuration including provisioning of routing tables
  • 5+ years experience monitoring and preventing issues in SaaS network topologies in Azure
  • 5+ years experience implementing network performance, availability, and scalability monitoring and alerting using tooling such as SolarWinds
  • 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment
  • 5+ years experience as a global admin of Azure including cloud cost management
  • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows or Linux environments.
  • 5+ years experience supporting public client facing revenue generating systems
  • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology
  • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts
  • Experience securing Windows or Linux systems in 24x7 production environment
  • Experience with containerization and managing Kubernetes clusters (AKS or EKS)


It is the policy of Donnelley Financial Solutions to select, place and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran's status, actual or perceived sexual orientation, genetic information or any other protected status.
If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access jobs.dfinsolutions.com as a result of your disability. You can request a reasonable accommodation by sending an email to [email protected] . #BI-Remote

What the Team is Saying

Carey
Dan
Stephen
Herve
The Company
HQ: Chicago, IL
2,600 Employees
Hybrid Workplace
Year Founded: 2016

What We Do

DFIN is a leading global risk and compliance solutions company. We provide domain expertise, software and data analytics for every stage of our clients’ business and investment lifecycles. Markets fluctuate, regulations evolve, technology advances, and through it all, DFIN delivers confidence with the right solutions in moments that matter.

Why Work With Us

DFIN is shaping global markets and is an environment where you can bring your whole self to work and do your best work every day. We are a values-based culture in which you can build a rewarding career.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

DFIN Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We operate in a fully flexible work environment. Our employees can continue to work remotely, our offices remain open and available for collaboration.

Typical time on-site: Flexible
HQChicago, IL
Located in the heart of downtown Chicago’s financial district, we are steps from all Metra stations, good eats and entertainment.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account