Offers “IBM”

Expires soon IBM

Site Reliability Engineer

  • Bengaluru (Bangalore Urban)
  • IT development

Job description

Introduction
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.

Your Role and Responsibilities
Site Reliability Engineering (SRE) professionals are engineers who specialize in reliability and resiliency with the right mix of knowledge and skills in software and systems, responsible to analyze business needs, problem determination, advise & design, build, test, deploy, changes and maintenance of a well-engineered information system and ecosystems.

Duties of SRE:
·  Build, deploy, and manage solutions for company-wide infrastructure challenges.
·  Ensure high availability for underlying platforms and infrastructure for application use.
·  Provide engineering teams with tooling and guidance to monitor their service availability against pre-determined SLOs.
·  Implement monitoring and alerting in our production environments.
·  Identify, evaluate, and recommend opportunities for automation.
·  Collaborate with other software engineers around patterns and practices for highly available, fault tolerant, and resilient applications.

Required Technical and Professional Expertise

·  5 – 11 years of relevant industry experience
·  Proven level Linux Skills with a strong background in troubleshooting
·  2+ years overall experience in Operations, Production, Development, or Engineering experience.
·  Cloud experience; IaaS and PaaS for on-prem and/or public clouds (IBM, AWS, or Azure)
·  Fast learner of technology and willing to experiment to find the best solutions
·  Monitoring & Event Management of complex systems
·  Specific skills in one of the technical SRE Squads
·  Compute (Power Servers)
·  Storage (FlashSystem 9000 products)
·  Network (Enterprise class Cisco and SAN)
·  Infrastructure Monitoring
·  Operations Lead (Cloud & Technology expertise)

Preferred Technical and Professional Expertise

·  Expert Level Linux Skills with a strong background in troubleshooting
·  2-3 years of Cloud experience; IaaS and PaaS for on-prem and/or public clouds (IBM, AWS, or Azure)
·  5+ years overall experience in Operations, Production, Development, or Engineering experience.

Make every future a success.
  • Job directory
  • Business directory