Lead Site Reliability Engineer with Java

Sorry, this job was removed Sorry, this job was removed at 06:22 p.m. (CST) on Friday, Apr 18, 2025
Be an Early Applicant
San Antonio, TX
Information Technology • Consulting
The Role

Company Description

Derex Technologies Inc specializes in providing IT consulting, staffing solutions and software services. Globally headquartered in Harrison New Jersey since 1996 Derex delivers the highest quality technology professionals and an array of customized IT talent solutions designed to improve productivity and drive results to global clients throughout North America.

With over two decades of unparalleled experience, Derex provides supports to its clientele, across such industries as Systems Integration, Banking and Finance, Telecommunications, Pharmaceutical and Life Sciences, Energy, Healthcare, Technology, Transportation, and local and federal Government agencies.

Job Description

Role: Lead Site Reliability Engineer with Java

Location: San Antonio, Texas

Relevant Experience: 15+ Years

 

 

Job Description & Key Responsibilities:

As a Lead Site Reliability Engineer (SRE), you will leverage your extensive experience in SRE practices to maintain and enhance the reliability, performance, and scalability of mission-critical systems. You will play a crucial role in ensuring the continuous availability and optimal functioning of our services.

Key Responsibilities:

  • Senior-Level SRE Expertise: Apply your deep understanding of SRE principles to lead efforts in improving system reliability and operational efficiency.
  • Incident Management: Provide expert-level support during incidents, ensuring swift resolution with minimal service disruption. Lead post-incident reviews to drive continuous improvement.
  • Monitoring & Alerting: Design, implement, and optimize monitoring, alerting, and incident response processes. Ensure the effectiveness of these systems to proactively address potential issues.
  • Automation: Drive the automation of manual processes to enhance operational efficiency, reduce human error, and increase overall system resilience.
  • CI/CD Pipeline Management: Develop, maintain, and improve automated CI/CD pipelines using tools such as GitLab CI/CD and Jenkins, ensuring seamless and reliable deployment processes.
  • Cross-Functional Collaboration: Work closely with cross-functional teams to ensure the reliability, performance, and scalability of our infrastructure. Foster a culture of collaboration and knowledge sharing.
  • Support Across Time Zones: Provide support across all U.S. time zones, with the flexibility to work weekends, rotational shifts, and overtime as required to maintain service continuity.

 

Required Skills & Qualifications:

  • Java Programming: Advanced proficiency in Java, with a deep understanding of contemporary software development practices.
  • Kubernetes & Containerization: Extensive hands-on experience with Kubernetes, including containerization technologies like Docker and Kubernetes storage solutions such as Portworx.
  • Linux/Unix Systems: Strong command of Linux/Unix operating systems and Shell Scripting (BASH), with a focus on system reliability and automation.
  • Functional Programming: Proficiency in functional programming languages such as Prolog, Haskell, and OCaml.
  • Scripting & Automation: Experience with Python or Go, particularly in the context of scripting and automation tasks.
  • Virtualization: In-depth knowledge of VMware and other virtualization platforms, with a focus on optimizing virtual environments for reliability and performance.
  • Streaming Technologies: Expertise with Kafka Stream Generator, KSQLDB, cluster federation, and Spark Streams, including experience in managing and optimizing streaming data architectures.
  • Service Mesh & Networking: Familiarity with Istio and Anthos Service Mesh, with the ability to manage and optimize service meshes for complex environments.
  • Performance Monitoring & Debugging: Proficiency in using EBPF (Extended Berkeley Packet Filter) for performance monitoring and debugging.
  • Monitoring & Logging Tools: Experience with industry-standard monitoring and logging tools such as Splunk, Prometheus, Datadog, and Kiali.
  • Load Balancing: Familiarity with Nginx Controller and Seesaw for effective load balancing and traffic management.
  • Infrastructure-as-Code (IaC): Competence in using Terraform for managing cloud infrastructure, ensuring consistency and scalability across environments.

 

Additional Requirements:

 

  • Flexibility: Willingness to work weekends, rotational shifts, and provide 24/7 support as necessary to maintain service reliability and meet project deadlines.
    1. Required: Kubernetes
    1.  

 

 

 

Regards,

 

Manoj

Derex Technologies INC

Contact : 973-834-5005 Ext 206

Additional Information

All your information will be kept confidential according to EEO guidelines.

Am I A Good Fit?
beta
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Harrison, NJ
24 Employees
On-site Workplace
Year Founded: 1996

What We Do

DEREX Technologies, Inc. established in 1996 is engaged nationally in providing professional computer services, including management consulting firm. We provide expert services nationwide to Fortune 500 companies and other private and public organizations in the United States. Derex provides a multi-faceted portfolio of products and services to its clients, including complete IT solutions.

Similar Jobs

UL Solutions Logo UL Solutions

Senior Corporate Counsel

Automotive • Professional Services • Software • Consulting • Energy • Chemical • Renewable Energy
Hybrid
5 Locations
15000 Employees
145K-180K Annually

CertifID Logo CertifID

Talent Acquisition Manager

Real Estate • Security • Software • Cybersecurity • PropTech
Easy Apply
Hybrid
Austin, TX, USA
103 Employees

Capital One Logo Capital One

Cyber Third Party Risk Consultant, Principal Associate

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
3 Locations
55000 Employees
116K-146K Annually

Capital One Logo Capital One

Principal Associate, Risk Management (FS)

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Plano, TX, USA
55000 Employees
107K-122K Annually

Similar Companies Hiring

InCommodities Thumbnail
Renewable Energy • Machine Learning • Information Technology • Energy • Automation • Analytics
Austin, TX
234 Employees
Quantum Rise Thumbnail
Software • Professional Services • Natural Language Processing • Machine Learning • Consulting • Automation • Artificial Intelligence
Chicago, Illinois
17 Employees
HERE Technologies Thumbnail
Software • Logistics • Internet of Things • Information Technology • Computer Vision • Automotive • Artificial Intelligence
Amsterdam, NL
6000 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account