loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Rainer Schnell and Sarah Redlich

Affiliation: University of Duisburg-Essen, Research Methodology Group, Forsthausweg 2, 47057 Duisburg and Germany

Keyword(s): Administrative Data, Web Data, Mortality, Undercoverage, Big Data, Obituaries.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Since access to real-world data is often tedious, web scraping has gained popularity. A health research example is the monitoring of mortality rates. We compare the results of local online death notices and print-media obituaries to administrative mortality data. The web scraping process and its problems are being described. The resulting estimates of death rates and demographic characteristics of the deceased are statistically different from known population values. Scaped data resulted in a sample that is more male, older and contains less foreign nationals. Therefore, using web scraped data instead of administrative data cannot be recommended for the estimation of death rates at this time for Germany.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.147.42.168

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Schnell, R. and Redlich, S. (2019). Web Scraping Online Newspaper Death Notices for the Estimation of the Local Number of Deaths. In Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - HEALTHINF; ISBN 978-989-758-353-7; ISSN 2184-4305, SciTePress, pages 319-325. DOI: 10.5220/0007382603190325

@conference{healthinf19,
author={Rainer Schnell. and Sarah Redlich.},
title={Web Scraping Online Newspaper Death Notices for the Estimation of the Local Number of Deaths},
booktitle={Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - HEALTHINF},
year={2019},
pages={319-325},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007382603190325},
isbn={978-989-758-353-7},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - HEALTHINF
TI - Web Scraping Online Newspaper Death Notices for the Estimation of the Local Number of Deaths
SN - 978-989-758-353-7
IS - 2184-4305
AU - Schnell, R.
AU - Redlich, S.
PY - 2019
SP - 319
EP - 325
DO - 10.5220/0007382603190325
PB - SciTePress