The data consists of 2.402 pages of lists of references from books and chapters together with the labeled boxes for each entry in the list of references. The XML files contain the coordinates of the boxes and for each box a label (box or incomplete).