Applied Bioinformatics Group

A   A   A
Home > Publications > Schema Extraction for Privacy Preserving Processing of Sensitive Data

Skip to content. | Skip to navigation

Lars C Gleim, Md. R Karim, Lukas Zimmermann, Holger Stenzhorn, Stefan Decker, and Oya Beyan (2018)

Schema Extraction for Privacy Preserving Processing of Sensitive Data

In: MEPDaW-SeWeBMeDA-SWeTI 2018 Joint Proceedings of MEPDaW, SeWeBMeDA and SWeTI 2018, vol. 2112, pp. 36-47. CEUR Workshop Proceedings.

Lars Christoph Gleim, Md. Rezaul Karim, Lukas Zimmermann, Oliver Kohlbacher, Holger Stenzhorn, Stefan Decker, Oya Beyan Sharing privacy sensitive data across organizational boundaries is commonly not a viable option due to the legal and ethical restrictions. Regulations such as the EU General Data Protection Rules impose strict requirements concerning the protection of personal data. Therefore new approaches are emerging to utilize data right in their original repositories without giving direct access to third parties, such as the Personal Health Train initiative [16]. Circumventing limitations of previous systems, this paper proposes an automated schema extraction approach compatible with existing Semantic Web-based technologies. The extracted schema enables ad-hoc query formulation against privacy sensitive data sources without requiring data access, and successive execution of that request in a secure enclave under the data provider’s control. The developed approach permit us to extract structural information from non-uniformed resources and merge it into a single schema to preserve the privacy of each data source. Initial experiments show that our approach overcomes the reliance of previous approaches on agreeing upon shared schema and encoding a priori in favor of more flexible schema extraction and introspection.
Printable file