SOTAB V2 for SemTab 2023
| Item Type: | Dataset |
|---|---|
| Title: | SOTAB V2 for SemTab 2023 |
| Date: | 2023 |
| Creator: |
Korini, Keti ORCID: 0000-0002-2158-0070 ; Peeters, Ralph ORCID: 0000-0003-3174-2616 ; Bizer, Christian ORCID: 0000-0003-2367-0237
|
| Divisions: | School of Business Informatics and Mathematics > Information Systems V: Web-based Systems (Bizer 2012-) School of Business Informatics and Mathematics > Information Systems III: Enterprise Data Analysis (Ponzetto 2016-) |
| DDC Classification: |
004 Computer science, internet |
|---|---|
| Abstract: | SOTAB V2 for SemTab 2023 includes datasets used to evaluate Column Type Annotation (CTA) and Columns Property Annotation (CPA) systems in the 2023 edition of the SemTab challenge. The datasets for both rounds of the challenge were down-sampled from the full train, test and validation splits of the SOTAB V2 (WDC Schema.org Table Annotation Benchmark version 2) benchmark, so that the datasets of the first round have a smaller vocabulary of 40 and 50 labels for CTA and CPA respectively corresponding to easier/more general domains, and the datasets of the second round include the full vocabulary size of 80 and 105 labels and are therefore considered to be harder to annotate. The columns and the relationships between columns are annotated using the Schema.org and DBpedia vocabulary. SOTAB V2 for SemTab 2023 contains the splits used in Round 1 and Round 2 of the challenge. Each round includes a training, validation and test split together with the ground truth for the test splits and the vocabulary list. The ground truth of the test sets of both rounds are manually verified. |
| External Identifier for Data: | https://doi.org/10.5281/zenodo.8422037 |
| URL: | https://madata.bib.uni-mannheim.de/531/ |
|---|---|
| Access (Controlled): | Only Metadata |
| License (Controlled): | Creative Commons: CC-BY | Attribution 4.0 (recommended) |
Full text not available from this repository.
| Date Deposited: | 13 Mar 2026 12:38 |
|---|---|
| Last Modified: | 13 Mar 2026 12:38 |
You have found an error? Please let us know about your desired correction here: E-Mail
Actions (login required)
![]() |
View Item |

