Short Bio

I am currently an AI research scientist at Thales cortAIx-Labs, in Paris region. My research interests span explainable machine learning (XAI), causal inference, tree ensembles, decision aid, deep learning, uncertainty quantification, and active learning. I am also driven by practical applications and software development.

I have ten years of experience as a machine learning researcher and data scientist in aeronautic/aerospace and tech industries, with also a strong interest in healthcare applications. I hold a PhD in machine learning and mathematical statistics, on the topic of explainable and interpretable ML (XAI). In my research, I address both the design of new algorithms, and the theoretical analysis of their mathematical properties.

My resume is available here.


Research interests

  • Explainable Machine Learning - XAI (functional decompositions, variable importance, rule learning)
  • Causal Inference (heterogeneous treatment effects, causal forests, causal discovery)
  • Tree Ensembles (random forests, boosted trees)
  • Decision Aid (MCDA, Neur-HCI)
  • Deep Learning (CNN, VLM, GNN)
  • Uncertainty Quantification (Bayesian inference, conformal predictions)
  • Active Learning (design of experiments, Bayesian optimization)


Publications

Preprints

Published/Accepted papers

Book

Academic publications


Software

  • treehfd: A python module to explain xgboost models with the Hoeffding functional decomposition.
  • Kernax: Kernax is a package that implements KSD-based algorithms for post-processing MCMC outputs. It is based on JAX and works on CPU as well as GPU.
  • Lagun: platform providing a user-friendly interface to methods and algorithms dedicated to the exploration of numerical simulations and the analysis of datasets (design of experiments, conditional gaussian processes, sensitivity analysis, uncertainty propagation, optimization).
  • sirus (Stable and Interpretable RUle Set): a regression and classification algorithm based on random forests, which takes the form of a short list of rules. SIRUS combines the simplicity of decision trees with a predictivity close to random forests.
  • shaff (SHApley eFfects via random Forests): a fast and accurate algorithm to estimate Shapley effects.
  • sobolMDA: the Sobol-MDA is a variable importance measure for random forests, fixing the flaws of Breiman’s MDA.
  • vimp-causal-forests: a variable importance measure for causal forests.


Talks

  • October 2025, MobiliT.AI 2025, Toulouse, France.
  • March 2024, SIAM UQ 2024, Trieste, Italy.
  • December 2023, NeurIPS@Paris, Sorbonne Université, Paris.
  • December 2023, 2nd Nice Workshop on Interpretability, Laboratoire J.A. Dieudonné, Université Côte d’Azur, Nice.
  • August 2023, EcoSta 2023, Tokyo, Japan.
  • March 2023, CSMA Junior, Paris.
  • December 2022, CMStatistics 2022, King’s College, London, UK.
  • March 2022, AISTATS 2022, virtual conference.
  • March 2022, Séminaire de Probabilités et Statistiques, Laboratoire de Mathématiques d’Orsay, Université Paris-Saclay, Orsay.
  • January 2022, Séminaire de Probabilités et Statistiques, Laboratoire J.A. Dieudonné, Université Côte d’Azur, Nice.
  • September 2021, École Thématique sur les Incertitudes en Calcul Scientifique (ETICS2021), Erdeven, France.
  • July 2021, ENBIS Workshop: Interpretability of Industry 4.0, Naples, Italy.
  • June 2021, 52èmes Journées de Statistiques 2021, Nice, France.
  • April 2021, GDR Mascot-Num, virtual conference.
  • April 2021, SIMPAS Group meeting, Centre de Mathématiques Appliquées, Ecole Polytechnique, Saclay, France.
  • April 2021, AISTATS 2021, virtual conference.
  • June 2019, 51èmes Journées de Statistiques 2019, Nancy, France.


Teaching

  • Instructor in statistics and optimization for mechanical engineers, 3-day sessions multiple times a year since 2018 (machine learning, uncertainty quantification, design of experiments, constrained optimization…)