Web Data Commons - Web Table Corpus 2012 / Relational Data
| Item Type: | Dataset |
|---|---|
| Title: | Web Data Commons - Web Table Corpus 2012 / Relational Data |
| Alternative Title: | Web relational table corpus extracted from the August 2012 Common Crawl |
| Date: | August 2012 |
| Creator: |
Bizer, Christian ORCID: 0000-0003-2367-0237 ; Meusel, Robert ; Ristoski, Petar ; Paulheim, Heiko ; Lehmberg, Oliver ; Diete, Alexander ; Heist, Nicolas ; Krstanovic, Sascha ; Knöller, Thorsten Andre
|
| Divisions: | School of Business Informatics and Mathematics > Information Systems V: Web-based Systems (Bizer 2012-) |
| DDC Classification: |
004 Computer science, internet |
|---|---|
| Keywords: | web tables ; relational tables |
| Abstract: | The subset consists of 147 million relational tables. In relational tables, a set of entities is described with one or more attributes. |
| URL: | https://madata.bib.uni-mannheim.de/210/ |
|---|---|
| DOI: | https://doi.org/10.7801/210 |
| Access (Controlled): | Download |
| Access: | Web Tables in CSV format and metadata in JSON format. The relational corpus 2012 is available here : http://webdatacommons.org/webtables/2012/downloadInstructions.html |
| Related Publication(s) in MADOC: | Lehmberg Oliver und Ritze Dominique und Meusel Robert und Bizer Christian (2016), A large public corpus of web tables containing time and context metadata |
| External URL for Other Related Materials: |
http://webdatacommons.org/webtables/2012/relationa...
http://webdatacommons.org/webtables/2012/downloadI... |
| Project: |
Project Title: Web Data Commons - Web Tables Project Description: The Web contains vast amounts of HTML tables. Most of these tables are used for layout purposes, but a fraction of the tables is also quasi-relational, meaning that they contain structured data describing a set of entities, and are thus useful in application contexts such as data search, table augmentation, knowledge base construction, and for various NLP tasks. The WDC Web Tables data set consists of millions of relational Web tables that are contained in HTML tables found in the Common Crawl. |
Full text not available from this repository.
| Date Deposited: | 15 May 2017 15:53 |
|---|---|
| Last Modified: | 05 Mar 2024 13:56 |
You have found an error? Please let us know about your desired correction here: E-Mail
Actions (login required)
![]() |
View Item |

