News

Contact

Head

Prof. Dr. Wolfram Wingerath

Office

Claudia Martsfeld

Mailing Address

Arbeitsgruppe Data Science
Universität Oldenburg
Fakultät II – Department für Informatik
Escherweg 2, 26121 Oldenburg, Germany

News

It is our pleasure to announce that our paper „Similarity-driven Schema Transformation for Test Data Generation“ has been accepted for publication and presentation at this year’s EDBT 2022.

The preprint is available here.

Abstract

A flexible and versed generation of test data is an important aspect in benchmarking algorithms for data integration. This includes the generation of heterogeneous schemas, each representing another data source of the integration benchmark. In this paper, we present our ongoing research on a novel approach for similarity-driven generation of schemas, which takes as input an arbitrary dataset, extracts its schema, and derives a set of output schemas from it. In contrast to previous solutions, we do not focus on structural transformations of relational or XML schemas, but extend the scope to contextual transformations and NoSQL data models, where the required schema information is often only implicitly defined within the data and must first be extracted. In addition, we utilize a novel method that generates multiple schemas based on user-defined heterogeneity constraints making the generation process configurable even for non-experts.

(Changed: 11 Feb 2026) | Kurz-URL:Shortlink: https://uol.de/p92184n6384en

Sprache wechseln

Change Language

Hell-/Dunkelmodus

Light mode / Dark mode

News

Contact

Head

Office

Mailing Address

News

EDBT 2022 Research Paper on Test Data Generation for Duplicate Detection

Abstract