Offers “CEA”

New CEA

STAGE: Self-explainable model for audio identification of bird species H/F

  • Stage
  • Grenoble (Isère)
  • IT development

Job description

Vacancy details

General information

Organisation

The French Alternative Energies and Atomic Energy Commission (CEA) is a key player in research, development and innovation in four main areas :
• defence and security,
• nuclear energy (fission and fusion),
• technological research for industry,
• fundamental research in the physical sciences and life sciences.

Drawing on its widely acknowledged expertise, and thanks to its 16000 technicians, engineers, researchers and staff, the CEA actively participates in collaborative projects with a large number of academic and industrial partners.

The CEA is established in ten centers spread throughout France

Reference

2024-32911

Description de l'unité

Among other activities, CEA LIST Software Safety and Security Laboratory (LSL) research teams design and implement automated analysis in order to make software systems more trustworthy, to exhaustively detect their vulnerabilities, to guarantee conformity to their specifications, and to accelerate their certification. The lab recently extended its activities on the topic of AI trustworthiness and gave birth to a new research group: AISER (Artificial Intelligence Safety, Explainability and Robustness).

Position description

Category

Mathematics, information, scientific, software

Contract

Internship

Job title

STAGE: Self-explainable model for audio identification of bird species H/F

Subject

Self-explainable model for audio identification of bird species H/F

Contract duration (months)

5-6 months

Job description

Through the recent developments of AI, the use of models produced by machine learning has become widespread, even in industrial settings. However, studies are flourishing showing the dangers that such models can bring, in terms of safety, privacy or even fairness. To mitigate these dangers and improve trust in AI, one possible avenue of research consists in designing methods for generating *explanations* of the model behaviour. Such methods, regrouped under the umbrella term "eXplainable AI" (XAI), empower the user by providing them with relevant information to make an informed choice to trust the model (or not).

In particular, rather than attempting to explain the model behaviour after it has been
trained (post-hoc explanations), some XAI methods propose to enforce explainability
constraints directly during the design phase of the machine learning process, resulting
in so-called self-explainable models. In this regard, case-based reasoning is currently
considered a viable alternative to more opaque “black box” convolutional neural networks (CNN), with works such as (Chen et al. 2019) or (Nauta, Bree, and Seifert 2021). In case-based reasoning, new instances of a problem are solved by drawing comparisons with examples encountered before, such that the decision is taken and motivated by the fact that the new instance resembles some known cases (also called prototypes).
CEA-LIST has already developed an open-source library for case-based reasoning net-
works, called CaBRNet(Xu-Darme et al. 2024), and wish to extend its application to new domains and modalities.

This internship focuses on the use of machine learning models for the recognition and
classification of bird songs. Audio clips are often encoded in the form of spectrograms, i.e. 2D representations of the intensity of the signal at various frequencies, across a given period of time. Since spectograms can be interpreted as images, a common practice consists in processing them using deep CNNs originally designed for computer vision (Kahl et al. 2021). Hence, the goal of the internship is to extend the case-based reasoning approach to this new task, by adapting existing computer vision methods to learn audio prototypes. In particular, the new approach will take into account the temporal specificities of audio samples. Indeed, contrary to computer vision models which are spacially invariant (the nature of an object remains identical regardless of its position inside the image), spatial location is crucial in spectograms as it corresponds to different frequency ranges and different periods of times.
In practice, the internship will be split in several subtasks as follows:

  • Establish a baseline using the reference BirdNET model.
  • Identify a body of existing works on self-explainable models for audio classification
  • Design and train a case-based reasoning model for audio classifiation, using the CaBRNet framework.

Methods / Means

Explainable AI, Case-based Reasoning, Convolutional Neural Networks

Applicant Profile

As it is not realistic to be expert in machine-learning, computer vision and XAI, we encourage candidates that do not meet the full qualification requirements to apply nonetheless. We strive to provide an inclusive and enjoyable workplace. We are aware of discriminations based on gender (especially prevalent on our fields), race or disability, we are doing our best to fight them.

Minimal qualifications:

  • Master student or equivalent (2nd/3rd engineering school year) in computer science
  • knowledge of Python and the Pytorch framework
  • ability to work in a team, some knowledge of version control

Preferred:

  • notions of AI and neural networks
  • notions of Computer Vision
  • notions of explainable AI

Position location

Site

Grenoble

Job location

France, Auvergne-Rhône-Alpes, Isère (38)

Location

Grenoble

Candidate criteria

Languages

  • English (Fluent)
  • French (Fluent)

Prepared diploma

Bac+5 - Master 2

Recommended training

Master degree in AI, Machine-Learning or Data Science

PhD opportunity

Oui

Requester

Position start date

01/02/2025


Make every future a success.
  • Job directory
  • Business directory