About me

I am currently a postdoctoral researcher at LISN (Université Paris-Saclay), working on the evaluation of LLMs for clinical applications in the PARTAGES project.

My research lies at the intersection of Natural Language Processing and health, with a focus on the extraction and structuring of clinical information from unstructured text, and on ensuring the reliability and trustworthiness of AI systems in real-world clinical applications.

I am particularly interested in the evaluation of LLMs in clinical contexts, the design of robust benchmarks, and the development of methods that combine linguistic knowledge, statistical learning, and domain expertise.

Research Interests

Biomedical NLP Information Extraction Clinical Text Generation LLM Evaluation

Evaluation of LLMs for Clinical Text Processing

Understanding how and when LLMs can be reliably used for clinical information extraction and generation.

Model Usage

Applicability of LLMs to information extraction and generation tasks, including prompt design and the role of external knowledge and context.

Evaluation

Design of robust evaluation frameworks combining quantitative metrics and qualitative analysis, including LLM-as-a-judge approaches.

Reliability

Calibration, uncertainty estimation, and bias analysis in clinical settings.

Data

Construction of reliable ground truth datasets in complex and costly clinical annotation environments.

Description

This research investigates the use of LLMs for clinical text processing, with a particular focus on their applicability to information extraction and generation tasks. Key questions include how to effectively prompt LLMs for clinical applications, the role of external knowledge and contextual information, and the extent to which these models can be reliably deployed in practice.

A central aspect of this work is the evaluation of LLM outputs: designing robust evaluation frameworks that combine quantitative metrics and qualitative analysis, exploring the use of LLMs as evaluators (LLM-as-a-judge), and addressing challenges related to calibration, uncertainty estimation, and bias. Particular attention is given to the construction of reliable ground truth data in clinical settings, where annotation is costly and inherently complex.

Information Extraction in the Biomedical Domain

Extracting structured clinical knowledge from heterogeneous biomedical texts.

Tasks

Named entity recognition, relation extraction, and normalization to medical ontologies.

Data Sources

Scientific literature, electronic health records, and medical prescriptions, each with different linguistic challenges.

Models

Transformer-based architectures, LLMs, and knowledge graph integration.

Linguistics

Leveraging lexical, syntactic, and semantic features to improve extraction performance.

Description

This line of research focuses on the extraction of structured information from biomedical and clinical texts, including tasks such as named entity recognition, relation extraction, and normalization to medical ontologies. It considers a variety of data sources, ranging from scientific literature to electronic health records and medical prescriptions, each with distinct linguistic and domain-specific challenges.

The work explores the use of different modeling approaches, including transformer-based architectures, large language models, and knowledge graph integration. It also investigates the contribution of linguistic features (lexical, syntactic, and semantic) to improve extraction performance. A key objective is to better understand what types of information can be reliably extracted, under which conditions, and with what level of accuracy for downstream clinical use.

Professional Experience

Postdoctoral Researcher — LISN, Paris-Saclay

04/2026 – current
  • Participation in the PARTAGES project
  • LLM evaluation for different clinical use cases

Data Scientist — Doctolib

11/2025 – 03/2026
  • Development and calibration of LLM evaluation scorers for clinical summaries
  • Prompt engineering and A/B testing for information extraction & structuration
  • Guidelines design for annotation campaigns for clinical data
  • Deployment of LLM-based systems on AWS, GCP, ArgoCD

Data Scientist — Posos

04/2024 – 05/2025
  • Fine-tuning transformer models for biomedical NER
  • Entity linking with medical terminologies (e.g. SNOMED, LOINC)
  • Development of production API for automatic medical annotation
  • Authored scientific and popular science articles on biomedical NLP

PhD Researcher — LORIA (Université de Lorraine, CNRS, Inria)

11/2019 – 12/2023

Teaching

Temporary Teaching and Research Associate

Total HEQTD: 165.5h

Télécom Nancy — LORIA (Université de Lorraine, CNRS, Inria)

Nov 2022 – Aug 2023
C Programming TD: 24h • TP: 2h • 1st year
Data Structures TD: 40h • TP: 4h • 1st year
Assemblers TP: 14h • 1st year
Projects supervising HEQTD: 5h • 1st year
Artificial Intelligence TP: 4h • 2nd year
Data Mining & Knowledge Extraction CM: 15h • TD: 50h • TP: 12h • 3rd year

Publications

2023

Scientific Activities

Co-organizer CORIA-TALN 2026

TAL@Santé Workshop

Natural Language Processing for healthcare

Co-organizer CORIA-TALN 2025

MLP-LLM Workshop

Medical Language Processing in the era of Large Language Models

Scientific Committee & Annotation AACL-IJCNLP 2025

SHROOM-CAP: Shared Task

Hallucinations and Related Observable Overgeneration Mistakes in Crosslingual Analyses of Publication