Site Reliability Engineer Manager
Heredia (Heredia) IT development
Job description
Introduction
The IBM Software is seeking a talented and motivated SRE Manager professional to lead and manage a team of engineers focused on a global Cloud Platform solution servicing multiple IBM offering.
Your Role and Responsibilities
• Manage and lead a team of SRE engineers. This involves hiring, training, and mentoring team members, assigning tasks, setting goals, and conducting performance evaluations.
• Provide guidance to the engineering team on architectural directions
• Empower the engineering team to achieve a high level of technical productivity, reliability, and simplicity
• Collaborate across various engineering teams, product, and other cross-functional stakeholders to effectively deliver the best solutions
Overall, an SRE Manager plays a crucial role in aligning engineering and operations to achieve reliable software systems. Combine technical expertise with leadership and management skills to drive continuous improvement and ensure high-quality service delivery.
Required Technical and Professional Expertise
• 3+ years of experience in managing DevOps and SRE engineers
• 5+ years of experience in DevOps, SRE, or related roles, with a focus on cloud-based infrastructure (AWS, Azure, and IBM Cloud) and automation
• Linux system administration
• Experience hiring and retaining top talent
• Problem solving and incident management
• A genuine enjoyment of learning about how things work, with the ability to ask engineers good questions about architecture and product decisions
• Extreme customer focus, committed to investing in partnerships with other engineering teams to establish empathy and understand their use-cases
Preferred Technical and Professional Expertise
• AWS Certification(s), Kubernetes, Redhat/Openshift, and Ansible
• Experience on IBM Cloud
• Kubernetes
• SOC2 Controls