Entry Level Site Reliability Engineer
Heredia (Heredia) IT development
Job description
Introduction
As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you will have a unique opportunity to gain first-hand experience with the latest technologies and be supported by a global team of IBMers to grow your own technical skills and develop your career. This role may require some attention after normal working hours in order to troubleshoot and resolve production issues experienced by clients. This role may also require shifting working hours as needed basis for follow-the-sun coverage.
Your Role and Responsibilities
As a SRE Engineer, you will:
•Be the point of contact for clients, responsible for the management and ownership of client service requests.
•Identify and investigate issues, using troubleshooting techniques in order to provide advice and guidance to clients.
•Work in a global team located in US, Canada, Ireland, China, India, and Australia, collaborating with IBMers to share recommendations, solutions and ideas.
•Look for enhancements and innovative solutions to help the services scale and improve existing technical support tools, procedures, or processes.
•Develop and enhance your technical knowledge via projects and assignments, as well as through IBM’s world class learning platform.
•Be on on-duty rotation including weekend and holiday support as needed basis”.
Required Technical and Professional Expertise
- 0-2 years of experience in 1a software development and delivery role
- 0-2 years of experience in Cloud/DevOps engineering and/or Linux administration
- Familiar with at least one major public cloud provider or large scale private/hybrid cloud using container orchestration
- Production familiarity with one or more monitoring frameworks (Nagios, Prometheus, etc.)
- Familiar with source control management such (git, subversion, etc.)
- Understanding of software development life cycle and delivery process
- Ability to manage multiple projects, while ensuring that commitments and timetables are met
- Ability to partner with internal stakeholders to design operational solutions
- Goal oriented, forward thinker that can provide solutions for complex technical problems
- Grafana, New Relic, Prometheus, Datagod..
Preferred Technical and Professional Expertise
-Production Kubernetes/OpenShift experience strongly preferred.
- Experience with change management workflows.
- Experience with ELK/EFK stack (ElasticSearch, Logstash/Fluentd, and Kibana).
- Experience with distributed event streaming platform (Kafka, etc).
- Experience with SQL and/or NoSQL datastores (DB2 and Oracle data services).
- Familiarity with application load balancing concepts (F5, ELB, etc)”.