Site Reliability Engineer (SRE)

Sorry, this job was removed at 05:35 p.m. (CST) on Wednesday, Aug 21, 2024
Bulverde, TX
3-5 Years Experience
Database • Cybersecurity
The Role

The Site Reliability Engineer (SRE) will play a crucial role in ensuring the reliability, scalability, and performance of our systems and services. Working closely with cross-functional teams, the SRE will design, implement, and maintain tools and processes to monitor, manage, and automate our infrastructure. The ideal candidate is passionate about building robust and resilient systems, with a strong focus on automation and continuous improvement.

Responsibilities:

1. System Monitoring and Incident Response:

  • Design and implement monitoring solutions to detect and mitigate system issues proactively.
  • Respond to alerts and incidents promptly, troubleshoot issues, and implement effective solutions to minimize downtime.

2. Infrastructure Automation:

  • Develop and maintain automation scripts and tools to streamline deployment, configuration, and scaling of infrastructure components.
  • Implement Infrastructure as Code (IaC) practices to manage and provision infrastructure resources efficiently.

3. Performance Optimization:

  • Identify performance bottlenecks and inefficiencies in the system and work collaboratively with development teams to optimize performance.
  • Conduct capacity planning and scalability assessments to ensure our systems can handle current and future demands.

4. Reliability Engineering:

  • Design and implement fault-tolerant and resilient architectures to ensure high availability of services.
  • Conduct post-mortem analysis of incidents to identify root causes and implement preventive measures.

5. Continuous Improvement:

  • Stay current with industry best practices and emerging technologies related to site reliability and infrastructure automation.
  • Drive initiatives to continuously improve the reliability, scalability, and performance of our systems.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • Proven experience in a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Proficiency in scripting and automation using languages such as Python, Bash, or PowerShell.
  • Strong understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and container orchestration technologies (e.g., Kubernetes).
  • Experience with configuration management tools (e.g., Ansible, Puppet, Chef) and version control systems (e.g., Git).
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Excellent problem-solving skills and the ability to troubleshoot complex issues in a production environment.
  • Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.

Benefits

  • Health, dental, vision, life, and short/long-term disability insurance
  • Paid vacation, holidays, and sick leave
  • Competitive compensation and opportunities for advancement
  • Retirement plan with employer contribution match
  • Welcoming, family-style corporate culture uniquely suited to fast-paced, entrepreneurial, and motivated individuals
  • One of San Antonio's "Best Places to Work" for nine consecutive years
The Company
HQ: Bulverde, TX
116 Employees
On-site Workplace
Year Founded: 1981

What We Do

For more than 40 years, Futurex has been a globally recognized provider of enterprise-class data encryption solutions. More than 15,000 customers worldwide have trusted Futurex's innovative technology to provide market-leading solutions for the secure encryption, storage, and transmission of sensitive data. Futurex maintains an unyielding commitment to offering advanced, standards-compliant data encryption solutions, including:

• Hardware Security Modules for secure, reliable data encryption, information management, and key generation
• Remote key management and injection platforms
• Certificate Authority issuance and management
• Secure, hand-held devices for configuration, management, and compliant key loading
• High availability solutions for load balancing, monitoring, and disaster recovery
• Secure storage and access of sensitive data

Throughout every facet of our organization, we maintain a focus on providing exceptional customer service, best-in-class technology, and cost-effective solutions for our customers. Our dedication to meeting the growing business needs of our global customers and partners is exhibited by the continuous expansion of our innovative products and services. Through our results-oriented engineering culture, we have provided organizations worldwide with custom solutions supporting aggressive timelines to market.

Jobs at Similar Companies

MacPaw Logo MacPaw

Senior UX Designer (R&D stage)

Information Technology • Security • Software • Cybersecurity • App development • Data Privacy
Remote
Hybrid
Kyiv, Kiev, UKR
550 Employees

Coro Logo Coro

Marketing Operations Manager

Artificial Intelligence • Cloud • Information Technology • Security • Software • Cybersecurity • Data Privacy
Hybrid
Chicago, IL, USA
286 Employees

Silverfort Logo Silverfort

Sales Operations Analyst

Information Technology • Sales • Security • Cybersecurity • Automation
Remote
United States
357 Employees

Similar Companies Hiring

Coro Thumbnail
Software • Security • Information Technology • Data Privacy • Cybersecurity • Cloud • Artificial Intelligence
Chicago, IL
286 Employees
MacPaw Thumbnail
Software • Security • Information Technology • Data Privacy • Cybersecurity • App development
Cambridge, MA
550 Employees
Silverfort Thumbnail
Security • Sales • Information Technology • Cybersecurity • Automation
US
357 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account