Monitoring tools Engineer
Bangalore, INDIA
Job description
Position Description:
Monitoring tools Engineer
A monitoring tools engineer job description involves designing, implementing, and maintaining systems to monitor the performance and health of networks, applications, and infrastructure. Key responsibilities include configuring monitoring tools, creating dashboards and alerts, automating processes, analyzing data to identify issues, and providing technical support and documentation. This role requires strong technical skills in areas like scripting, observability (metrics, logs, traces), and various operating systems and cloud environments.
Core responsibilities
. Tool management: Install, configure, and maintain monitoring tools and platforms across different environments (e.g., cloud, on-premises).
. Monitoring and alerting: Establish comprehensive monitoring to track system performance, application availability, and infrastructure health, including setting up actionable alerts.
. Automation and scripting: Develop scripts and automated processes to streamline tasks like agent deployment, data collection, and reporting.
. Data analysis: Analyze metrics, logs, and traces to identify performance bottlenecks, troubleshoot issues, and perform root cause analysis for incidents.
. Incident response: Act as a point of escalation for monitoring-related issues, collaborating with other teams to resolve major incidents and ensure proper documentation and follow-up.
. Documentation: Create and maintain detailed documentation, standard operating procedures (SOPs), and knowledge base articles.
. Collaboration and support: Work with application and infrastructure teams to define monitoring requirements, integrate observability into CI/CD pipelines, and provide training and support to other teams.
. Reporting: Generate and distribute performance and status reports based on collected monitoring data.
Required skills and qualifications
. Technical expertise: Experience with monitoring tools SolarWinds, Azure Native tools, Infoblox for DDI(DNS,DHCP,IPAM), ORION. operating systems (Windows, Linux), networking protocols (TCP/IP, SNMP), and cloud platforms (AWS, Azure).
. Scripting and automation: Proficiency in scripting languages such as Python or PowerShell.
. Observability: Experience with Application Performance Monitoring (APM) and the "three pillars of observability": metrics, logs, and traces.
. Troubleshooting: Strong analytical and problem-solving skills to diagnose and resolve complex technical issues.
. Communication: Excellent verbal and written communication skills for interacting with technical teams and stakeholders.
. Soft skills: Ability to work independently, manage time effectively, and collaborate with others in a team environment.
. Certifications: ITIL, Windows, or cloud-related certifications are often a plus.
Skills:
· Network
What you can expect from us:
Together, as owners, let’s turn meaningful insights into action.
Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…
You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.
Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.
You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.
Come join our team—one of the largest IT and business consulting services firms in the world.