The data consists of 515 pages of lists of references from books and chapters together with the labeled boxes for each entry in the list of references. The XML files contain the coordinates of the 10.722 boxes and for each box a label (box or incomplete).