Data set for reproduction purposes (tabular data is stored using Apache's Parqet format).