Microformat, Microdata and RDFa data from the November 2015 Common Crawl web corpus. We found structured data within 541 million HTML pages out of the 1.77 billion pages contained in the crawl (30%).These pages originate from 2.72 million different pay-level-domains out of the 14.41 million pay-level-domains covered by the crawl (19%). Altogether, the extracted data sets consist of 24.38 billion RDF quads.