Site Reliability Engineering Manager

Sorry, this job was removed at 5:00 p.m. (CST) on Wednesday, November 17, 2021
Find out who's hiring in Portland, OR.
See all Developer + Engineer jobs in Portland, OR
Apply
By clicking Apply Now you agree to share your profile information with the hiring company.

DAT is looking for a Site Reliability Manager to join our team based in our Beaverton, Oregon office.
About the team
As a Site Reliability Manager you will drive system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek a manager who is passionate about system reliability to influence and drive the strategic SRE mission.
As a Systems Reliability Manager working on critical services your mission will be to ensure our services are fast, highly available, scalable, and able to withstand unprecedented increases in load. The Systems Reliability Engineering group will be at the heart of solving production problems and driving proactive initiatives to ensure we don't have outages and security issues. Your scope is from the kernel to the application. The position requires the flexibility to take a holistic approach to troubleshooting and the ability to delve deeply into technical details. The Systems Reliability Manager leads a team of centralized specialists and co-located engineers with the various application development teams. This team is key in building automation tools for system health and deployment to the cloud and datacenter environments. The team is also responsible to ensure the system is well instrumented and highly fault tolerant. They are responsible for our problem process along with ensuring that reliability issues are remediated when we do find failure to ensure issues don't occur again.
About the role

  • The successful candidate will possess an outstanding record of professional experience and will thrive in an environment that demands accountability. They must possess significant technology management and product development experience. They must also have strong planning, organizational, communication skills, and be a key driver to help the team understand the big picture perspective.
  • Proven leader of technology solutions in a high volume transaction environment.
  • Accomplished leader with 5+ years managing regional and global areas.
  • Have excellent time management, communication, decision-making, presentation, and organizational skills.
  • Maintain excellent written and verbal communications with clients, employees, and management chain, including status reports, project plans, presentations, etc.
  • Ability to lead across functions and motivate a matrix staff


What You'll Do

  • Engage, influence, and evangelize SRE practices with development, operational and product groups to align technology service/solution delivery.
  • Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality. This includes leading effective postmortems and ensuring actions are followed-up.
  • Manage availability, latency, scalability and efficiency of DAT applications development by instilling engineering reliability into our development life cycle with a focus on fault tolerant approaches.
  • Drive capacity planning, performance analysis, instrumentation and other non-functional systems requirements.
  • Must be able to define and report "progress" on strategic initiatives and project level tasks to all stakeholders including senior executives, clients and use effective communication approaches with each constituency.
  • Implement metrics driven processes to ensure service quality targets are met.


The Skills You'll Need

  • Understanding of how to influence peers and other leaders to build a culture around reliability and transparency
  • Strong management skills, with a servant leadership mindset.
  • Expert knowledge in all aspects of designing, developing, managing large real-time systems.
  • Project and process management
  • Prior successful experience as a systems performance or site/systems reliability engineer.
  • Mastery of Linux/Unix.
  • Knowledge of Windows operating systems
  • Knowledge of NodeJS, Python or Golang Programming
  • Mastery of fault tolerant approaches in a large scale distributed environment and high performance systems,
  • Demonstrated experience working in large, complex systems environments.
  • Deep understanding of internet and networking protocols.
  • A passion for performance excellence, robustness and engineering mindset


About DAT
DAT is a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 43 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on DAT for the most relevant data and most accurate insights to help them make smarter business decisions and run their companies more profitably. We operate the largest marketplace of its kind in North America, with 226 million freight posts in 2020, and a database of $110 billion of annual global shipment market transaction data. We have co-headquarters in Portland, OR and Denver, CO, and additional offices in MO, TX, and Bangalore, India.
For additional information, see www.DAT.com/company
DAT embraces the value of a diverse workforce, and believes it is a core strength of our company that we encourage those values in every DAT employee, at every level of our organization, regardless of tenure or rank. We provide equal employment opportunities (EEO) to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state, and local laws.
DAT offers competitive compensation and an excellent benefit package that includes medical, dental, and vision coverage, flexible savings accounts, 401K, Life and AD&D insurance, a comprehensive Paid Leave program, and a Tuition Reimbursement program.
All referrals and résumés are managed exclusively through the Human Resources Department.
DAT will not consider unsolicited résumés from vendors including search firms, fee-based referral services, and/or recruitment agencies.

More Information on DAT Freight & Analytics
DAT Freight & Analytics operates in the eCommerce industry. The company is located in Beaverton, OR and Denver, CO. DAT Freight & Analytics was founded in 1978. It has 500 total employees. It offers perks and benefits such as Volunteer in local community, Partners with nonprofits, Open door policy, OKR operational model, Team based strategic planning and Pair programming. To see all 3 open jobs at DAT Freight & Analytics, click here.
Read Full Job Description
Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.

Similar Jobs

Apply Now
By clicking Apply Now you agree to share your profile information with the hiring company.
Learn more about DAT Freight & AnalyticsFind similar jobs