Expert Meeting on Statistical Data Confidentiality

Name: Expert Meeting on Statistical Data Confidentiality
Start: 2025-10-15T08:45:00+02:00
End: 2025-10-17T16:00:00+02:00
Location: Poblenou Campus Auditorium

15–17 Oct 2025

Poblenou Campus Auditorium

Europe/Zurich timezone

Chris Jones

jonesc@un.org

A formal model for reasoning about output disclosure risks and mitigations

16 Oct 2025, 12:20

14m

In-Person

Poblenou Campus Auditorium, Barcelona, Spain

Poblenou Campus Auditorium

Roc Boronat, 138 08018 Barcelona

Machine Learning and Artificial Intelligence versus Disclosure Control

Jim Smith (University of the West of England Bristol)

The practice of Output Statistical Disclosure Control has developed largely by consensus, a situation which is being challenged by a number of factors. First of these is the almost Cambrian Explosion in the number and scope of Trusted Research Environments as many domains move away from the ‘download’ model of enabling research. A second challenge is the accompanying proliferation of different forms of outputs requested (including AI models trained on sensitive data). The final driver is a growth in tools for (semi) automated assistance in the OSDC process, which because they arise from different domains, often differ in the types of risks they check for, and the range of mitigations they apply.
In this paper we describe the development of a formal specification of queries, risks, and mitigations. This leverages the taxonomy described in the ‘Statbarn’ framework, but also provides a basis for encompassing the risks posed by machine learning models. This specification has several features that we hope will assist the OSDC community. First, it provides easy-to-understand graphical representations that we hope will spark debate, encourage consensus-building, and be useful for training purposes.
Second, it uses an extension of the W3C ‘Data Privacy Vocabulary’ that means it is both human-readable and machine-actionable. We will describe how this has been used to create a ‘reference implementation’ via a refactoring of the SACRO toolkit.
Third, it creates a rigourous basis for making systematic and grounded comparisons between various OSDC tools (such as Tau/Mu-Argus, SACRO, DataSHIELD etc) and the mitigations offered by various ‘privacy preserving’ technologies.

Jim Smith (University of the West of England Bristol) Dr Trupti Padiya (University of the West of England) Prof. Felix Ritchie (University of the West of England) Dr Elizabeth Green (University of the West of England) Dr Amy Tilbrook (Edinburgh University)

SDC2025_Sc_UnivWoE_Smith.pdf

Expert Meeting on Statistical Data Confidentiality

Chris Jones

A formal model for reasoning about output disclosure risks and mitigations

Poblenou Campus Auditorium

Speaker

Description

Authors

Presentation materials