Skip to content
Snippets Groups Projects
Commit 957ae4b1 authored by Thomas CONSTUM's avatar Thomas CONSTUM
Browse files

Update README.md

parent ca74d710
Branches
Tags
No related merge requests found
......@@ -5,7 +5,7 @@ These datasets have been publised in [*Recognition and information extraction in
The 3 datasets are called "Generic dataset", "Belleville", and "Chaussée d'Antin" and contains lines made from the extracted rows of census tables from 1926. Each table in the Paris census contains 30 rows, thus each page in these datasets corresponds to 30 lines.
This repository is a Git LFS repository containing the image files, the labels are stored in [another repository](https://github.com/Shulk97/POPP-datasets/).
This repository is a Git LFS repository containing the image files, the labels are stored in [another repository](https://github.com/Shulk97/POPP-datasets/). The datasets are also available [on Zenodo](https://zenodo.org/record/6581158).
The scructure of each dataset is the following:
- double-pages : images of the double pages
......@@ -28,6 +28,7 @@ The split for the *Generic Dataset* and *Belleville* have been made at the doubl
| Generic | 3840 (128 pages)| 480 (16 pages) | 480 (16 pages)| 80 |
| Belleville | 1140 (38 pages)| 150 (5 pages) | 180 (6 pages)| 1 |
| Chaussée d'Antin | 625 | 78 | 77 | 10 |
## Generic dataset
- This dataset is made 4800 annotated lines extracted from 80 double pages of the 1926 Paris census.
......
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment