Replication data for: Validating estimates of latent traits from textual data using human judgement as a benchmark