CompERBench: A collection of 21 complete benchmark tasks for entity matching.
Item Type: | Dataset |
---|---|
Title: | CompERBench: A collection of 21 complete benchmark tasks for entity matching. |
Alternative Title: | CompERBench: Complementing Entity Matching Benchmark Tasks |
Date: | 31 July 2020 |
Creator: | Primpeli, Anna ; Bizer, Christian ORCID: 0000-0003-2367-0237 |
Divisions: | School of Business Informatics and Mathematics > Wirtschaftsinformatik V (Bizer) |
DDC Classification: |
004 Computer science, internet |
---|---|
Keywords: | entity matching, benchmarking, reproducibility |
Abstract: | Entity Matching is the task of determining which records from different data sources describe the same real-world entity. It is an important task for data integration and has been the focus of many research works. A large amount of entity matching tasks for benchmarking have been developed and made publicly available for evaluating, comparing, reproducing and showing the strengths of different matching methods. However, the lack of fixed development and test sets, correspondence sets including both matching and non-matching record pairs as well as baseline results, hinders reproducibility and comparability. In an effort to enhance the reproducibility and comparability of matching methods, we complement existing benchmark tasks for entity matching with fixed development and test sets. We provide 21 complete benchmark tasks for entity matching for public download. The selected tasks are highly diverse and include data sets of different sizes, amounts of attributes, density, attribute data types as well as number of sources from which the originate. |
URL: | https://madata.bib.uni-mannheim.de/348/ |
---|---|
DOI: | https://doi.org/10.7801/348 |
Availability (Controlled): | Download |
Availability: | You can download our datasets by navigating to: http://data.dws.informatik.uni-mannheim.de/benchmarkmatchingtasks/index.html#toc2 |
Publication(s) (MADOC): |
Primpeli, Anna und Bizer, Christian (2020), Profiling entity matching benchmark tasks |
DOI (External): |
https://doi.org/10.1145/3340531.3412781 |
Reference URL (External): |
http://data.dws.informatik.uni-mannheim.de/benchma... |
File | Filename / Infos | Link |
---|---|---|
Archive
Filename: compERbench.zip |
Download (132MB)
|
Depositing User: | Anna Primpeli |
---|---|
Date Deposited: | 23 Nov 2020 10:17 |
Last Modified: | 29 Feb 2024 20:34 |
Actions (login required)
View Item |