AIOps Engineer - AI First & Autonomous Operations (m/f/d)
Sofia, BULGARIA
Job description
Position Description:
We’re establishing a brand-new technology and innovation hub in Sofia – and you can be part of it from day one.
Here, you won’t just be shaping our services for German clients; you’ll also help us build a strong presence in the Bulgarian market. You’ll collaborate closely with our teams in Germany, bring in your ideas, and influence the tools, processes, and culture of our new Sofia location from the ground up.
In short: We’re blending the energy of a fresh start with the reliability of a global player. If you’re excited about building something new both locally and internationally, this is the place to make your mark. Apply now and be part of our success story in Sofia! #BePartofourStory
Your future duties and responsibilities:
You will be part of our journey toward closed-loop, human-less operations , where systems detect, diagnose, and remediate issues automatically—and humans step in only for novel, high risk, or policy-driven situations.
· Design, architect, and evolve AIOps platforms including:
· event ingestion & correlation
· anomaly detection & dynamic baselining
· probable root-cause analysis
· decision automation & automated remediation flows
· Reduce operational noise and improve MTTA/MTTR through smarter event processing, topology modeling, enrichment, filtering, and dependency mapping.
· Operationalize runbook automation using automation/orchestration frameworks (e.g., Ansible, Rundeck, StackStorm, Jenkins , or cloud-native automation tools).
This includes dry-run capabilities, safe-guards, approvals, and full audit trails.
· Integrate AIOps with ITSM processes/tools (e.g., ServiceNow, Jira Service Management, or equivalent), ensuring seamless incident/problem/change workflows.
· Build and maintain service models / topologies that support RCA, impact analysis, and dependency-aware operations.
· Implement observability & reliability standards :
SLIs/SLOs, error budgets, golden signals, health dashboards, automated insights after incidents.
· Develop and operate iPaaS or integration flows (e.g., MuleSoft, Boomi, Workato, SnapLogic) connecting monitoring, ITSM, ChatOps, CI/CD, and business systems.
· Champion AI-first delivery , leveraging LLMs for:
· incident summarization and contextualization
· knowledge grounding/RAG
· agentic workflows and operational copilots
Always with a focus on safety, governance, and measurable outcomes.
· Connect the ecosystem through agents, collectors, APIs/webhooks, and data pipelines across cloud and on-prem infrastructure.
Required qualifications to be successful in this role:
· 3+ years of experience in SRE, DevOps, Platform Engineering, Observability, or Production Operations .
· Practical experience with AIOps/Monitoring/Observability ecosystems , ideally including at least two of:
· Event management platforms
· Monitoring/metrics/logging/tracing stacks (Prometheus, Grafana, Elastic, Splunk, Dynatrace, Datadog, New Relic, etc.)
· CMDB, discovery or service mapping tools
· Automation & orchestration solutions
· Experience running self-managed or hybrid platforms :
· Linux administration
· containers, Kubernetes/OpenShift
· networking fundamentals
· TLS/certificates, backups/DR
· Strong automation skills: Python and/or Bash , JSON/YAML, API integrations, Git-based pipelines.
· Experience with iPaaS or integration platforms , including error handling, transformations, retries, secrets management, and governance.
· Solid grounding in ITIL/ITSM (Incident/Problem/Change) and Site Reliability Engineering principles.
· Excellent English communication skills ; German is a plus or willingness to learn.
Nice to have:
· Experience with AIOps-specific or enterprise automation platforms (including BMC solutions, on-prem or cloud-based).
· MLOps or data engineering exposure (Kafka, event-driven architectures, stream processing, model monitoring).
· Experience modernizing or building service maps/topology models.
· ChatOps and on-call tooling (PagerDuty, Opsgenie, MS Teams/Slack integrations).
· Certifications such as:
· Kubernetes (CKA/CKAD)
· Cloud (AWS/Azure/GCP)
· ITIL v4
· Integration/iPaaS certifications
What We Offer:
· Team Culture: You’ll find colleagues here who make collaboration enjoyable. We interact openly, use first names across all levels, and don’t think in hierarchies or silos.
· Contract & Working Time: Permanent, full-time contract with a standard 8-hour workday.
· Flexible Work Setup: Enjoy the best of both worlds with our hybrid work model – combining office presence in Sofia with remote work from home.
· Working Hours: Flexible working hours tailored to project needs and client requirements.
· Leave Policy: 25 days of paid vacation.
· Volunteer Opportunities: Make a positive impact with 8 hours of paid leave for voluntary work.
· Career Development: Benefit from clear career progression paths and opportunities to grow within the company.
· Continuous Learning: Access our internal learning platform, CGI Academia, to shape your skills based on your interests. Learn up to 9 languages via goFLUENT.
· Referral Rewards: Extend your network – refer friends and earn bonuses through our rewarded referral program.
· Health & Wellbeing: Access our Health and Wellbeing Portal for resources that support your physical and mental health.
· Workplace Equipment: Receive a budget for noise-cancelling headphones, ergonomic office equipment, and optical glasses for screen work.
· Office Environment: Work in a state-of-the-art co-working space in Sofia, with complimentary coffee, tea, and water to keep you refreshed throughout the day.
Skills:
· English
What you can expect from us:
Together, as owners, let’s turn meaningful insights into action.
Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…
You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.
Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.
You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.
Come join our team—one of the largest IT and business consulting services firms in the world.