@@ -5,7 +5,7 @@ These datasets have been publised in [*Recognition and information extraction in
The 3 datasets are called "Generic dataset", "Belleville", and "Chaussée d'Antin" and contains lines made from the extracted rows of census tables from 1926. Each table in the Paris census contains 30 rows, thus each page in these datasets corresponds to 30 lines.
This repository is a Git LFS repository containing the image files, the labels are stored in [another repository](https://github.com/Shulk97/POPP-datasets/).
This repository is a Git LFS repository containing the image files, the labels are stored in [another repository](https://github.com/Shulk97/POPP-datasets/). The datasets are also available [on Zenodo](https://zenodo.org/record/6581158).
The scructure of each dataset is the following:
- double-pages : images of the double pages
...
...
@@ -28,6 +28,7 @@ The split for the *Generic Dataset* and *Belleville* have been made at the doubl