Laura Zanella

About me

I am currently a postdoctoral researcher at LISN (Université Paris-Saclay), working on the evaluation of LLMs for clinical applications in the PARTAGES project.

My research lies at the intersection of Natural Language Processing and health, with a focus on the extraction and structuring of clinical information from unstructured text, and on ensuring the reliability and trustworthiness of AI systems in real-world clinical applications.

I am particularly interested in the evaluation of LLMs in clinical contexts, the design of robust benchmarks, and the development of methods that combine linguistic knowledge, statistical learning, and domain expertise.

Research Interests

Biomedical NLP Information Extraction Clinical Text Generation LLM Evaluation

Evaluation of LLMs for Clinical Text Processing

Understanding how and when LLMs can be reliably used for clinical information extraction and generation.

Model Usage

Applicability of LLMs to information extraction and generation tasks, including prompt design and the role of external knowledge and context.

Evaluation

Design of robust evaluation frameworks combining quantitative metrics and qualitative analysis, including LLM-as-a-judge approaches.

Reliability

Calibration, uncertainty estimation, and bias analysis in clinical settings.

Data

Construction of reliable ground truth datasets in complex and costly clinical annotation environments.

Description

This research investigates the use of LLMs for clinical text processing, with a particular focus on their applicability to information extraction and generation tasks. Key questions include how to effectively prompt LLMs for clinical applications, the role of external knowledge and contextual information, and the extent to which these models can be reliably deployed in practice.

A central aspect of this work is the evaluation of LLM outputs: designing robust evaluation frameworks that combine quantitative metrics and qualitative analysis, exploring the use of LLMs as evaluators (LLM-as-a-judge), and addressing challenges related to calibration, uncertainty estimation, and bias. Particular attention is given to the construction of reliable ground truth data in clinical settings, where annotation is costly and inherently complex.

Information Extraction in the Biomedical Domain

Extracting structured clinical knowledge from heterogeneous biomedical texts.

Tasks

Named entity recognition, relation extraction, and normalization to medical ontologies.

Data Sources

Scientific literature, electronic health records, and medical prescriptions, each with different linguistic challenges.

Models

Transformer-based architectures, LLMs, and knowledge graph integration.

Linguistics

Leveraging lexical, syntactic, and semantic features to improve extraction performance.

Description

This line of research focuses on the extraction of structured information from biomedical and clinical texts, including tasks such as named entity recognition, relation extraction, and normalization to medical ontologies. It considers a variety of data sources, ranging from scientific literature to electronic health records and medical prescriptions, each with distinct linguistic and domain-specific challenges.

The work explores the use of different modeling approaches, including transformer-based architectures, large language models, and knowledge graph integration. It also investigates the contribution of linguistic features (lexical, syntactic, and semantic) to improve extraction performance. A key objective is to better understand what types of information can be reliably extracted, under which conditions, and with what level of accuracy for downstream clinical use.

Professional Experience

Postdoctoral Researcher — LISN, Paris-Saclay

04/2026 – current

Participation in the PARTAGES project
LLM evaluation for different clinical use cases

Data Scientist — Doctolib

11/2025 – 03/2026

Development and calibration of LLM evaluation scorers for clinical summaries
Prompt engineering and A/B testing for information extraction & structuration
Guidelines design for annotation campaigns for clinical data
Deployment of LLM-based systems on AWS, GCP, ArgoCD

Data Scientist — Posos

04/2024 – 05/2025

Fine-tuning transformer models for biomedical NER
Entity linking with medical terminologies (e.g. SNOMED, LOINC)
Development of production API for automatic medical annotation
Authored scientific and popular science articles on biomedical NLP

PhD Researcher — LORIA (Université de Lorraine, CNRS, Inria)

11/2019 – 12/2023

Domain: Biomedical AI, NLP, Information Extraction
Thesis: Biomedical Event Extraction based on Transformers and Knowledge Graphs

Teaching

Temporary Teaching and Research Associate

Total HEQTD: 165.5h

Télécom Nancy — LORIA (Université de Lorraine, CNRS, Inria)

Nov 2022 – Aug 2023

C Programming TD: 24h • TP: 2h • 1st year

Data Structures TD: 40h • TP: 4h • 1st year

Assemblers TP: 14h • 1st year

Projects supervising HEQTD: 5h • 1st year

Artificial Intelligence TP: 4h • 2nd year

Data Mining & Knowledge Extraction CM: 15h • TD: 50h • TP: 12h • 3rd year

Publications

2025

Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection

15th Language Resources and Evaluation Conference (LREC)

Automatic Posology Structuration: What role for LLMs?

32nd MLP-LLM / CORIA-TALN

Is Mistral’s Confidence Justified? Assessing Self-Evaluation in Biomedical QA

32nd CORIA-TALN

2023

How Much do Knowledge Graphs Impact Transformer Models for Extracting Biomedical Events?

61st Annual Meeting of the Association for Computational Linguistics (ACL)

Adding Linguistic Information to Transformer Models Improves Biomedical Event Detection?

18th Conference on Computer Science and Intelligence Systems (FedCSIS)

Fine-tuning Pre-trained Transformer Language Models for Biomedical Event Trigger Detection

DL4NLP - Extraction et Gestion de Connaissances (EGC)

Scientific Activities

Co-organizer CORIA-TALN 2026

TAL@Santé Workshop

Natural Language Processing for healthcare

Co-organizer CORIA-TALN 2025

MLP-LLM Workshop

Medical Language Processing in the era of Large Language Models

Scientific Committee & Annotation AACL-IJCNLP 2025

SHROOM-CAP: Shared Task

Hallucinations and Related Observable Overgeneration Mistakes in Crosslingual Analyses of Publication

About me

Research Interests

Evaluation of LLMs for Clinical Text Processing

Model Usage

Evaluation

Reliability

Data

Description

Information Extraction in the Biomedical Domain

Tasks

Data Sources

Models

Linguistics

Description

Professional Experience

Postdoctoral Researcher — LISN, Paris-Saclay

Data Scientist — Doctolib

Data Scientist — Posos

PhD Researcher — LORIA (Université de Lorraine, CNRS, Inria)

Teaching

Temporary Teaching and Research Associate

Publications

2025

Confabulations from ACL Publications (CAP): A Dataset for Scientific Hallucination Detection

Automatic Posology Structuration: What role for LLMs?

Is Mistral’s Confidence Justified? Assessing Self-Evaluation in Biomedical QA

2023

How Much do Knowledge Graphs Impact Transformer Models for Extracting Biomedical Events?

Adding Linguistic Information to Transformer Models Improves Biomedical Event Detection?

Fine-tuning Pre-trained Transformer Language Models for Biomedical Event Trigger Detection

Scientific Activities