Alternative title
ACROBAT
Creator/Principal investigator(s)
Mattias Rantalainen
- Karolinska Institutet, Department of Medical Epidemiology and Biostatistics
Johan Hartman
- Karolinska Institutet, Department of Oncology-Pathology
Description
The ACROBAT data set consists of 4,212 whole slide images (WSIs) from 1,153 female primary breast cancer patients. The WSIs in the data set are available at 10X magnification and show tissue sections from breast cancer resection specimens stained with hematoxylin and eosin (H&E) or immunohistochemistry (IHC). For each patient, one WSI of H&E stained tissue and at least one one, and up to four, WSIs of corresponding tissue stained with the routine diagnostic stains ER, PGR, HER2 and KI67 are available. The data set was acquired as part of the CHIME study (chimestudy.se) and its primary purpose was to facilitate the ACROBAT WSI registration challenge (acrobat.grand-challenge.org). The histopathology slides originate from routine diagnostic pathology workflows and were digitised for research purposes at Karolinska Institutet (Stockholm, Sweden). The image acquisition process resembles the routine digital pathology image digitisation workflow, using three different Hamamatsu WSI scanners, specifically one NanoZoomer S360 and two NanoZoomer XR. The WSIs in this data set are accompanied by a data ta
... Show more..Language
English
Research principal
Responsible department/unit
Department of Medical Epidemiology and Biostatistics
Contributor(s)
Leena Latonen
- University of Eastern Finland, Institute of Biomedicine
Constance Boissin
- Karolinska Institutet, Department of Medical Epidemiology and Biostatistics
Yanbo Feng
- Karolinska Institutet, Department of Medical Epidemiology and Biostatistics
Philippe Weitz
- Karolinska Institutet, Department of Medical Epidemiology and Biostatistics
Dusan Rasic
- Zealand University Hospital, Department of Surgical Pathology
Data contains personal data
No
Ethics Review
Stockholm - Ref. 2017/2106-31
Amendment: 2018/1462-32
Unit of analysis
Population
Anonymised female primary breast cancer patients from the Stockholm region
Study design
Observational study
Sampling procedure
Time period(s) investigated
2012 – 2018
Geographic spread
Geographic location: Stockholm County
Research area
SCIENCE AND TECHNOLOGY, Information technology
(CESSDA Topic Classification)
Medical Image Processing, Medical and Health Sciences, Cancer and Oncology
(The Swedish standard of fields of research 2011)
Weitz, P. et al., (2022). ACROBAT -- a multi-stain breast cancer histological whole-slide-image data set from routine diagnostics for computational pathology. doi:10.48550/ARXIV.2211.13621
DOI:
https://doi.org/10.48550/ARXIV.2211.13621
If you have published anything based on these data, please notify us with a reference to your publication(s). If you are responsible for the catalogue entry, you can update the metadata/data description in DORIS.
Download data
Description
The data set consists of three subsets, the training, validation and test set, based on the ACROBAT WSI registration challenge. There are 750 cases in the training set, for each of which one H&E WSI and one to four IHC WSIs are available, with 3406 WSIs in total. The validation set consists of 100 cases with 200 WSIs in total and the test set of 303 cases with 606 WSIs in total. Both for the validation and test set, one H&E WSI as well as one randomly selected IHC WSI is available.
WSIs were a
Version 1
https://doi.org/10.48723/w728-p041
Citation
Download citation
Data format / data structure
Still image
Creator/Principal investigator(s)
Mattias Rantalainen
- Karolinska Institutet, Department of Medical Epidemiology and Biostatistics
Johan Hartman
- Karolinska Institutet, Department of Oncology-Pathology
Number of individuals/objects
1153