UB-Mannheim/reichsanzeiger-gt: 1.0.0
| Item Type: | Dataset |
|---|---|
| Title: | UB-Mannheim/reichsanzeiger-gt: 1.0.0 |
| Alternative Title: | Reichsanzeiger-GT: Ground truth OCR dataset for German newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–1945) |
| Date: | 15 November 2023 |
| Creator: |
Schmidt, Thomas ORCID: 0000-0003-3620-3355 ; Kamlah, Jan ORCID: 0000-0002-0417-7562 ; Weil, Stefan ORCID: 0000-0002-0524-9898 ; Shigapov, Renat ORCID: 0000-0002-0331-2558
|
| Divisions: | Zentrale Einrichtungen > University Library |
| DDC Classification: |
004 Computer science, internet |
|---|---|
| Keywords: | OCR, Text recognition, Ground truth, Historical newspapers |
| Abstract: | Ground truth dataset for German newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (German Imperial Gazette and Prussian Official Gazette), which was published under changing names from 1819 to 1945 (https://digi.bib.uni-mannheim.de/periodika/reichsanzeiger/ausgaben). The ground truth is provided as PAGE-XML and URLs for the corresponding newspaper scans/images. Use the provided bash-script to download the images. The data paper is published at https://doi.org/10.1016/j.dib.2024.110274. |
| URL: | https://madata.bib.uni-mannheim.de/445/ |
|---|---|
| Availability (Controlled): | Download |
| Availability: | It's freely available at Zenodo https://doi.org/10.5281/zenodo.10144428 and GitHub https://github.com/UB-Mannheim/reichsanzeiger-gt. The data paper is published at https://doi.org/10.1016/j.dib.2024.110274. |
| DOI (External): |
https://doi.org/10.5281/zenodo.10144428
https://doi.org/10.1016/j.dib.2024.110274 |
| Reference URL (External): |
https://github.com/UB-Mannheim/reichsanzeiger-gt |
| File | Filename / Infos | Link |
|---|---|---|
|
Archive
Filename: reichsanzeiger-gt-1.0.0.zip
|
Download (26MB)
|
| Notes: | This dataset is originally stored at GitHub (https://github.com/UB-Mannheim/reichsanzeiger-gt) and archived at Zenodo: Schmidt, T., Kamlah, J., Weil, S., & Shigapov, R. (2023). UB-Mannheim/reichsanzeiger-gt: 1.0.0 (1.0.0). Zenodo. https://doi.org/10.5281/zenodo.10144428 |
|---|---|
| Depositing User: | Renat Shigapov |
| Date Deposited: | 15 Mar 2024 06:37 |
| Last Modified: | 17 Feb 2025 14:27 |
Actions (login required)
![]() |
View Item |


