@@ -5,7 +5,7 @@ These datasets have been publised in *Recognition and information extraction in
...
@@ -5,7 +5,7 @@ These datasets have been publised in *Recognition and information extraction in
The 3 datasets are called "Generic dataset", "Belleville", and "Chaussée d'Antin" and contains lines made from the extracted rows of census table from 1926. Each table in the Paris census contains 30 rows, thus each page in these datasets corresponds to 30 lines.
The 3 datasets are called "Generic dataset", "Belleville", and "Chaussée d'Antin" and contains lines made from the extracted rows of census table from 1926. Each table in the Paris census contains 30 rows, thus each page in these datasets corresponds to 30 lines.
This repository is a Git LFS repository which contain the image files.
This repository is a Git LFS repository which contain the image files, the annotations are stored in [another repository](https://github.com/Shulk97/POPP-datasets).
The double pages were scanned at a resolution of 200dpi and saved as PNG images with 256 gray levels.
The double pages were scanned at a resolution of 200dpi and saved as PNG images with 256 gray levels.
The line and page images are shared in the TIFF format, also with 256 gray levels.
The line and page images are shared in the TIFF format, also with 256 gray levels.