Site Reliability Engineer
Bangalore, INDIA
Job description
Position Description:
Site Reliability Engineer
Project context
As a Site Reliability Engineer (SRE) working in a 24/7 shift rotation, you will be responsible for ensuring the reliability, availability, and performance of critical systems and services. You will combine strong technical skills with operational excellence to proactively monitor, troubleshoot, and resolve issues. Your expertise in observability will help maintain robust monitoring, alerting, and incident response processes, ensuring seamless operations around the clock.
This role demands 24x7 monthly rotational shifts
Goals and deliverables
➔ 24/7 Operations & Incident Management ◆ Monitor production systems and services using observability tools (logs, metrics, traces, dashboards). ◆ Respond to incidents, alerts, and outages in real time, ensuring rapid resolution and minimal impact. ◆ Participate in a rotating on-call schedule, providing support during nights, weekends, and holidays.
➔ Observability & Monitoring ◆ Design, implement, and maintain observability solutions (e.g., Prometheus, Grafana, ELK and similar tools). ◆ Develop and refine dashboards, alerts, and automated health checks for critical infrastructure and applications. ◆ Analyze system performance and reliability data to identify trends and prevent future incidents, looking from an end-to-end full stack from infrastructure to application layers
➔ Technical Operations ◆ Collaborate with development, infrastructure, application and security teams to ensure system reliability and scalability. ◆ Automate operational tasks and incident response processes using scripting and configuration management tools. ◆ Document procedures, runbooks, and incident reports for knowledge sharing and continuous improvement.
➔ Continuous Improvement ◆ Conduct post-incident reviews and root cause analysis to drive improvements in reliability and response. ◆ Propose and implement enhancements to monitoring, alerting, and operational processes.
Education and experience
● Bachelor's degree in information technology, Computer Science, Business
Administration, or a related field. Master's degree or relevant certifications
would be a plus.
● Minimum of 2-5 years of experience in cloud engineering and operations
engineering
● Proven experience with Azure services, with AWS and GCP an advantage
● Hands-on experience with Infrastructure-as-Code (IaC) tools such as
Terraform.
● Strong scripting skills in Python, Bash or PowerShell for automation tasks
● Familiarity with Gitlab CI/CD tools and experience integrating them with
Azure
● Proficiency in monitoring and logging tools such as native cloud tools,
OpenMetrics, OpenTelemetry
Skills and behavioral competencies
● Excellent problem solving and troubleshooting abilities
● Result orientation, influence & impact
● Empowerment & accountability with the ability to work independently
● Team spirit, building relationships, collective accountability
● Excellent oral and written communication skills for documenting and
sharing information with technical and non-technical stakeholders
Language skills - Fluent English
Skills:
· Infrastructure as a Code
· DevOps
· Python
What you can expect from us:
Together, as owners, let’s turn meaningful insights into action.
Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…
You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.
Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.
You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.
Come join our team—one of the largest IT and business consulting services firms in the world.