Short Bio

I am currently an AI research scientist at Thales Research, in Paris region. My research interests span explainable machine learning (XAI), causal inference, tree ensembles, decision aid, deep learning, uncertainty quantification, and active learning. I am also driven by practical applications and software development.

I have ten years of experience as a machine learning researcher and data scientist in aeronautic/aerospace and tech industries, with also a strong interest in healthcare applications. I hold a PhD in machine learning and mathematical statistics, on the topic of explainable and interpretable ML (XAI). In my research, I address both the design of new algorithms, and the theoretical analysis of their mathematical properties.

My resume is available here.


Research interests

  • Explainable Machine Learning - XAI (variable importance, functional decomposition, rule learning)
  • Causal Inference (heterogeneous treatment effects, causal forests, causal discovery)
  • Tree Ensembles (random forests, boosted trees)
  • Decision Aid (MCDA, Choquet integral)
  • Deep Learning (GNN, Neur-HCI)
  • Uncertainty Quantification (Bayesian inference, conformal predictions)
  • Active Learning (design of experiments, Bayesian optimization)


Publications

Preprints

Published/Accepted papers

Book

Academic publications


Software

  • Kernax: Kernax is a package that implements KSD-based algorithms for post-processing MCMC outputs. It is based on JAX and works on CPU as well as GPU.
  • Lagun: platform providing a user-friendly interface to methods and algorithms dedicated to the exploration of numerical simulations and the analysis of datasets (design of experiments, conditional gaussian processes, sensitivity analysis, uncertainty propagation, optimization).
  • sirus (Stable and Interpretable RUle Set): a regression and classification algorithm based on random forests, which takes the form of a short list of rules. SIRUS combines the simplicity of decision trees with a predictivity close to random forests.
  • shaff (SHApley eFfects via random Forests): a fast and accurate algorithm to estimate Shapley effects.
  • sobolMDA: the Sobol-MDA is a variable importance measure for random forests, fixing the flaws of Breiman’s MDA.
  • vimp-causal-forests: a variable importance measure for causal forests.


Talks

  • March 2024, SIAM UQ 2024, Trieste, Italy.
  • December 2023, NeurIPS@Paris, Sorbonne Université, Paris.
  • December 2023, 2nd Nice Workshop on Interpretability, Laboratoire J.A. Dieudonné, Université Côte d’Azur, Nice.
  • August 2023, EcoSta 2023, Tokyo, Japan.
  • March 2023, CSMA Junior, Paris.
  • December 2022, CMStatistics 2022, King’s College, London, UK.
  • March 2022, AISTATS 2022, virtual conference.
  • March 2022, Séminaire de Probabilités et Statistiques, Laboratoire de Mathématiques d’Orsay, Université Paris-Saclay, Orsay.
  • January 2022, Séminaire de Probabilités et Statistiques, Laboratoire J.A. Dieudonné, Université Côte d’Azur, Nice.
  • September 2021, École Thématique sur les Incertitudes en Calcul Scientifique (ETICS2021), Erdeven, France.
  • July 2021, ENBIS Workshop: Interpretability of Industry 4.0, Naples, Italy.
  • June 2021, 52èmes Journées de Statistiques 2021, Nice, France.
  • April 2021, GDR Mascot-Num, virtual conference.
  • April 2021, SIMPAS Group meeting, Centre de Mathématiques Appliquées, Ecole Polytechnique, Saclay, France.
  • April 2021, AISTATS 2021, virtual conference.
  • June 2019, 51èmes Journées de Statistiques 2019, Nancy, France.


Teaching

  • Instructor in statistics and optimization for mechanical engineers, 3-day sessions multiple times a year since 2018 (machine learning, uncertainty quantification, design of experiments, constrained optimization…)