Published on in Vol 10 (2024)

This is a member publication of University of Toronto

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/50379, first published .
Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis

Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis

Generating Contextual Variables From Web-Based Data for Health Research: Tutorial on Web Scraping, Text Mining, and Spatial Overlay Analysis

Pablo Galvez-Hernandez   1, 2 , PhD ;   Angelina Gonzalez-Viana   3 , PhD ;   Luis Gonzalez-de Paz   4, 5 , PhD ;   Ketan Shankardass   6, 7 , PhD ;   Carles Muntaner   1, 8 , PhD

1 Lawrence S Bloomberg Faculty of Nursing, University of Toronto, Toronto, ON, Canada

2 Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada

3 Public Health Agency of Catalonia, Health Department, Barcelona, Spain

4 Primary Healthcare Transversal Research Group, Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain

5 Consorci d'Atenció Primària de Salut Barcelona Esquerra, Barcelona, Spain

6 Department of Heath Sciences, Wilfrid Laurier University, Waterloo, ON, Canada

7 MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, St Michael’s Hospital, Toronto, ON, Canada

8 Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada

Corresponding Author:

  • Pablo Galvez-Hernandez, PhD
  • Institute of Health Policy, Management and Evaluation
  • Dalla Lana School of Public Health
  • University of Toronto
  • Health Sciences Building, 4th Fl.
  • 155 College St
  • Toronto, ON, M5T 3M6
  • Canada
  • Phone: 1 6475752195
  • Email: pau.galvez@utoronto.ca