Published on in Vol 11 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/62939, first published .
Leveraging Administrative Health Databases to Address Health Challenges in Farming Populations: Scoping Review and Bibliometric Analysis (1975-2024)

Leveraging Administrative Health Databases to Address Health Challenges in Farming Populations: Scoping Review and Bibliometric Analysis (1975-2024)

Leveraging Administrative Health Databases to Address Health Challenges in Farming Populations: Scoping Review and Bibliometric Analysis (1975-2024)

Authors of this article:

Pascal Petit1 Author Orcid Image ;   Nicolas Vuillerme1, 2 Author Orcid Image

Original Paper

1Laboratoire AGEIS, Université Grenoble Alpes, La Tronche Cedex, France

2Institut Universitaire de France, Paris, France

Corresponding Author:

Pascal Petit, PhD

Laboratoire AGEIS

Université Grenoble Alpes

Bâtiment Jean Roget, UFR de Médecine

Domaine de La Merci

La Tronche Cedex, 38706

France

Phone: 33 4 76 63 71 04

Email: pascal.petit@univ-grenoble-alpes.fr


Background: Although agricultural health has gained importance, to date, much of the existing research relies on traditional epidemiological approaches that often face limitations related to sample size, geographic scope, temporal coverage, and the range of health events examined. To address these challenges, a complementary approach involves leveraging and reusing data beyond its original purpose. Administrative health databases (AHDs) are increasingly reused in population-based research and digital public health, especially for populations such as farmers, who face distinct environmental risks.

Objective: We aimed to explore the reuse of AHDs in addressing health issues within farming populations by summarizing the current landscape of AHD-based research and identifying key areas of interest, research gaps, and unmet needs.

Methods: We conducted a scoping review and bibliometric analysis using PubMed and Web of Science. Building upon previous reviews of AHD-based public health research, we conducted a comprehensive literature search using 72 terms related to the farming population and AHDs. To identify research hot spots, directions, and gaps, we used keyword frequency, co-occurrence, and thematic mapping. We also explored the bibliometric profile of the farming exposome by mapping keyword co-occurrences between environmental factors and health outcomes.

Results: Between 1975 and April 2024, 296 publications across 118 journals, predominantly from high-income countries, were identified. Nearly one-third of these publications were associated with well-established cohorts, such as Agriculture and Cancer and Agricultural Health Study. The most frequently used AHDs included disease registers (158/296, 53.4%), electronic health records (124/296, 41.9%), insurance claims (106/296, 35.8%), population registers (95/296, 32.1%), and hospital discharge databases (41/296, 13.9%). Fifty (16.9%) of 296 studies involved >1 million participants. Although a broad range of exposure proxies were used, most studies (254/296, 85.8%) relied on broad proxies, which failed to capture the specifics of farming tasks. Research on the farming exposome remains underexplored, with a predominant focus on the specific external exposome, particularly pesticide exposure. A limited range of health events have been examined, primarily cancer, mortality, and injuries.

Conclusions: The increasing use of AHDs holds major potential to advance public health research within farming populations. However, substantial research gaps persist, particularly in low-income regions and among underrepresented farming subgroups, such as women, children, and contingent workers. Emerging issues, including exposure to per- and polyfluoroalkyl substances, biological agents, microbiome, microplastics, and climate change, warrant further research. Major gaps also persist in understanding various health conditions, including cardiovascular, reproductive, ocular, sleep-related, age-related, and autoimmune diseases. Addressing these overlooked areas is essential for comprehending the health risks faced by farming communities and guiding public health policies. Within this context, promoting AHD-based research, in conjunction with other digital data sources (eg, mobile health, social health data, and wearables) and artificial intelligence approaches, represents a promising avenue for future exploration.

JMIR Public Health Surveill 2025;11:e62939

doi:10.2196/62939

Keywords



Background

Public health research seeks to identify and understand the factors that influence population health to effectively prevent diseases and promote health and well-being for all [1,2]. A broad range of environmental determinants can impact health across the life span. One of the core areas of public health research, known as the exposome, investigates how cumulative environmental influences contribute to disease etiology and pathogenesis [3-18]. The exposome, which complements genomic research, refers to the comprehensive examination of all environmental exposures experienced throughout an individual’s lifetime, including physical, chemical, biological, psychosocial, and behavioral factors, from conception to death [3-18]. The exposome classically includes 3 overlapping domains: the general external exposome (eg, climate and built environment); the specific external exposome (eg, chemical exposure, lifestyle, and occupations); and the internal exposome (eg, aging, oxidative stress, metabolism, and gut microbiome) [8,14,16,17,19,20]. Understanding the exposome is crucial for enabling both population-wide and precision prevention [3,21-23]. However, fully describing the exposome is challenging due to the vast diversity and the temporal and spatial variability of environmental factors [3]. Public health research in this area requires data on both risk factors and adverse health outcomes to progress effectively [3,14,24,25].

The volume of data collected has grown exponentially as the world becomes increasingly reliant on technology and digitization [26,27]. Data are omnipresent in our everyday lives, leading science toward data-driven research [27,28], in particular in the health field. The digital transformation in health care has enabled unprecedented data availability, collection, storage, and analysis capabilities, leading to a paradigm shift in health care systems, with entire care pathways becoming digitized [29,30]. Health-related data now represent approximately 6% of all digital data globally, a figure that continues to rise [31]. This explosion of data has transformed research, providing new opportunities, especially in public health, to enhance disease understanding and evaluate intervention effectiveness [27,28,32-35]. The integration of digital technologies and digital data in public health has led to the emergence of “digital public health,” an evolving field focused on using digital data to achieve public health goals [33,36-41]. Public health research is moving from isolated data systems to more integrated, accessible, and reusable data resources [42]. Reusing data allows researchers to explore various health determinants, including environmental, occupational, behavioral, and organizational factors, fostering a holistic approach to disease prevention and health promotion strategies [14].

Within the digital public health framework, 2 main types of data are being used, namely primary and secondary data. Primary data are tailor-made, designed for a specific purpose, and often used once or repeatedly for the same goal [43-45]. Primary data are the cornerstone of traditional public health policy and decision-making. These data are derived from several types of studies [46-50], in particular observational cohorts (eg, the Framingham Cohort study [51,52]) [46-50,53,54], case-control studies [46-50], cross-sectional surveys (eg, the China Health and Retirement Longitudinal Study [55,56]) [46-50,53], and experimental studies [46-50]. Primary data have many advantages [46-50]. They are rich, of high quality, and are designed to answer specific research questions for public health and epidemiological purposes. Primary data are usually available at the individual level and are derived from studies that control for certain biases. By contrast, they are cumbersome, time-consuming, and costly to set up and maintain [53,54,57]. The representativeness of primary data is also limited in size, geographic scope, and temporal coverage and can erode with time [46-50,53]. Primary data are not free from bias, such as selection, healthy worker, recall, or prevarication biases [53,58].

Unlike traditional public health, digital public health does not rely solely on primary data but takes advantage of the myriad of existing digital data that have not been generated originally for research purposes (ie, secondary data) to overcome some limitations intrinsic to primary data and complement them [28,43,44,53,59-68]. Indeed, some data can have an additional impact when used beyond the context for which they were originally created [68,69]. Secondary data are collected for purposes other than public health or epidemiology and include contextual data (eg, air quality and climate data) [14,24,26,29,70-74], person-generated data (eg, social media, crowdsourcing, and mobile health) [2,24,26,31,43,61,62,73,75-86], synthetic data (eg, digital twin) [87-91], and administrative health databases (AHDs) [26,64,68,81,92-102].

AHD is a broad term encompassing a wide range of routinely collected data on individuals’ health and sociodemographic information collected for registration, billing, record keeping, and other administrative purposes [26,64,81,93,95,98,100-102]. For this review, based on previous works [93,95,96,103-105], AHDs included population registers, claims databases, disease registers, electronic health or medical records, and hospital discharge databases that were collected at a local, regional, national, or international level (Table 1) [26,61,62,93,95,96,100,103-110].

Table 1. Definition and characteristics of administrative health databases included in this review.

Population registerClaims databaseDisease registerElectronic health or medical recordHospital discharge database
DefinitionDigital sociodemographic information on the residents of a countryRoutinely collected digital information on individual data regarding reimbursement, records of health services, medical procedures, prescriptions, and medical diagnosesA continuous and exhaustive digital collection of individual data regarding 1 or more health events in a geographically defined populationSystematized digital record of a patient’s medical information collected in real timeDigital records of service use with information about patients, their care, and their stay in the hospital
SourceLocal or national authoritiesInsurance programs or schemes and health care providersHealth care institutions (eg, hospitals)Hospitals, physicians, health care centers, and institutionsHospitals
PopulationAll individuals residing in a countryAll individuals covered by an insurance program or schemeAll individuals diagnosed with a specific health event in a population on a geographically defined scaleAll patients using the health care systemAll patients from a hospital
Purpose or finalityFor the administrative purposes of government: to provide reliable informationTo store financial and administrative information for medical insurers’ and providers’ useFor clinical and research purposes: to collect information about people diagnosed with a specific health eventFor clinical and billing purposes: to document patients’ clinical conditionFor billing or accounting purposes
Health eventNoneHealth events covered by insurance or a health care providerSpecific health events (eg, cancer)Health events requiring care that are reported in medical recordsHealth events from hospital admission

AHDs offer many advantages for research. Such data are collected as part of routine administrative processes, reducing additional costs for researchers. Therefore, AHDs offer relatively inexpensive access to a large number of individuals who can be tracked with time for several years, guaranteeing the representativeness of the populations studied [26,54,78,93,95,104,105,111-114]. Data recorded within AHDs are structured, coded in a standardized way, and less affected by participation and recall biases [54,58,95,106,113,115]. AHDs enable the study of rare events and populations underrepresented in studies using only primary data [95,111-113]. AHDs have limitations inherent to their nature, such as the absence of some confounding factors, the limited granularity of certain information, the data complexity, and confidentiality issues [73,78,93,95,115-125].

Rationale

AHDs are increasingly used in population-based health research due to their complementarity with traditional sources of public health and epidemiological data (ie, primary data) [42,59,64,87,93,95,96,126-128]. The reuse of AHDs, referring to their application beyond their original or intended purpose, holds major potential to advance public health and epidemiological research, offering insights that can guide public health decision-making [42,59,64,87,93,95,96,105,126-131]. Although several reviews have previously explored the general use of AHDs in research [42,95,107,129,132-135], others have focused on their application within specific countries [96,136], examined individual AHDs [108,137], or investigated their role in studying specific diseases and adverse health outcomes [44,93,104,138-142]. However, to the best of our knowledge, no study has synthesized how AHDs are reused for epidemiological and public health research within a specific population group.

To address this gap, we conducted a comprehensive scoping review and bibliometric analysis aimed at identifying how AHDs are used to address health issues in a specific population. We selected farming populations as an illustrative example because they present unique health and disease patterns [143-147]. Globally, approximately 27% of the workforce is engaged in occupational farming, and this group is exposed to numerous risk factors (ie, exposomes), including pesticides, biological agents, and limited access to health care [148]. These exposures put them at heightened risk for a wide range of adverse health outcomes [143,145,147,149]. Although agricultural safety and health have become a major public health issue in recent decades, most research on the health of farming populations has relied on traditional epidemiological and community-based studies, which often face limitations in terms of sample size, geographic scope, temporal coverage, and the range of health events examined [145,150,151].

In this context, AHDs offer valuable opportunities to enhance public health and epidemiological research in farming populations by providing broader insights, identifying at-risk subgroups, and informing health services and policy development [152]. The primary objectives of this scoping review were two-fold: (1) to summarize the current state of AHD-based research in farming populations by examining which types of AHDs are used and why, whether AHDs are integrated with other data sources, which farming populations have been studied, and what exposures and health outcomes have been explored and (2) to identify key areas of interest and potential research gaps and unmet needs in this field.


Overview

This scoping review was conducted and reported according to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) and evidence maps guidelines (Table S1 in Multimedia Appendix 1) [153] following a single screening approach. The protocol of this study was not registered. A 7-step procedure was used: research question formulation, identifying relevant publications, title review, abstract review, full-text review, data extraction, and data analysis.

To formulate our research question, we followed the Joanna Briggs Institute guidelines, using the population, concept, and context criteria framework [154]. Our population included all individuals engaged in farming and all individuals exposed to farming-related exposures. The concepts included all possible public health and epidemiological research works that involved the study of a health outcome of interest. The context was the use, in any setting, of at least one of the AHDs, as defined in Table 1.

Search Strategy and Selection Criteria

To develop and validate the search strategy, previous reviews that examined the reuse of AHDs for population-based research were identified and refined [93,96,103,104]. Our initial search revealed that electronic health records (EHRs) are often interchangeably referred to as electronic medical records (EMRs). A distinction between EHR and EMR is sometimes made, with EMR describing patients’ care from only 1 practice (eg, specific encounters in hospitals), which is contrary to EHR [105]. In that case, EMR serves as a data source for EHR. This distinction was not considered in this paper. In addition, to ensure comprehensiveness, the search terms were broadened by searching for their synonyms. For example, search terms such as “electronic health record,” “digital health record,” “electronic medical record,” “EHR,” or “EMR” were used as synonyms for electronic health or medical records. A total of 72 terms pertaining to 2 categories (farming and AHDs) were used (Textbox 1). The search terms were reflective of our research topic and question.

Textbox 1. Search terms.

Farming

  • husbandry* OR agriculture* OR farming OR farm* OR agricultural* OR farmworker*

Administrative health databases (combined using AND)

  • “health record” OR “health records” OR “digital record” OR “digital records” OR “health administrative register” OR “health administrative registry” OR “health register” OR “health registry” OR “medical register” OR “medical registry” OR “electronic health record” OR “electronic health records” OR “EHR” OR “EMR” OR “electronic medical record” OR “electronic medical records” OR “digital medical record” OR “digital medical records” OR “digital health record” OR “digital health records” OR “health administrative data” OR “health administrative database” OR “ health administrative dataset” OR “ health administrative datasets” OR “health administrative databases” OR “administrative health data” OR “administrative health database” OR “administrative health dataset” OR “administrative health datasets” OR “administrative health databases” OR “insurance data” OR “insurance database” OR “insurance databases” OR “insurance dataset” OR “insurance claim” OR “insurance claims” OR “cancer registry” OR “cancer register” OR “health insurance” OR “health surveillance program” OR “health surveillance programs” OR “Mutualite Sociale Agricole” OR “MSA” OR “health insurance system” OR “record-linkage” OR “population register” OR “population registry” OR “insurance scheme” OR “social security scheme” OR “hospital discharge” OR “administrative claim” OR “administrative claims” OR “medical claims” OR “medical claim” OR “electronic claim” OR “electronic claims” OR “mortality register” OR “mortality registry” OR “mortality record” OR “mortality records” OR “disease register” OR “disease registry” OR “illness register” OR “illness registry” OR “disorder register” OR “disorder registry”

To develop the eligibility criteria, an initial search of the literature was conducted on PubMed, with a review of the first 100 articles that used AHDs for public health and epidemiological research. In our pilot run, disease and morbidity registers were initially not considered as AHD because they were created for clinical and research purposes [47-50,53,76,155,156]. However, because disease registers contain some information derived from medical records, we decided to consider them as AHD for this review. The eligibility criteria are presented in Textbox 2. The search was restricted to original peer-reviewed records (all types were included) written in English or French but not constrained by the year of publication [93,106,157]. Publications that examined partly farming populations, with, for instance, studies reporting health risks for various sectors of activity, were included.

Textbox 2. Eligibility criteria for selection of publications.

Inclusion criteria for articles

  • Data had to originate at least partly from the administrative health database (AHD)
  • The study had to pertain at least partly to the farming population
  • The study had to relate to public health or epidemiological research
  • Original peer-reviewed publications
  • Publications in English or French

Exclusion criteria for articles

  • Publications not describing the use of an AHD
  • Animal or in vitro studies
  • Publications not in English or French

The final literature search was conducted on both PubMed and Web of Science Core Collection databases. Regarding the Web of Science Core Collection, a topic search was performed. To reduce the bias induced by daily database changes, all data collection (literature retrieval and data download) was conducted and completed on the same day, that is, April 15, 2024. Titles, abstracts, and full-text publications were screened based on pre-established inclusion and exclusion criteria. The inclusion criteria for each phase of the literature search are provided in Table S1 in Multimedia Appendix 1. When abstracts did not contain enough information about correspondence to inclusion or exclusion criteria, the article was considered for full-text review. Reference lists of included publications were not searched, although they might have also yielded new relevant studies.

Data Collection and Processing

A total of 29 metadata were extracted from each publication included in the scoping review (Table 2).

The data underwent rigorous manual validation, cleaning, and harmonization following a structured 5-step process. First, duplicate items (eg, keywords and institutions) were removed. Second, leading and trailing white spaces were eliminated. Third, items were standardized by converting text to lower case, with only the first letter capitalized. In the fourth step, items were harmonized to either singular or plural forms consistently. Finally, synonyms or terms with similar meanings (eg, “illness” and “disease”) were unified under a single term. For instance, “Pesticide,” “Pesticide exposure,” and “Pesticide use” were standardized to “Pesticide,” while “Pulmonary disease copd,” “Copd,” and “Chronic obstructive pulmonary disease” were unified as “COPD.” For cancer-related keywords, the International Classification of Diseases, eleventh revision, was used to consolidate varied terms (eg, “lung cancer,” “lung cancer risk,” “lung and bronchus cancer,” “lung tumor,” “lung tumour,” “lung neoplasm,” and “basal cell carcinoma of the lung”) into standard categories (eg, lung cancer). Quality appraisals were not performed because they were beyond the aim of this review [106,157].

Table 2. List of metadata of interest to collect from the literature search.
MetadataFictional example
Publication year2024
Publication typeArticle
Study nameProject X
Goal of the studyTo study the association between farming and health outcome
Study typeEcological study
Is the study nationwide?Yes
Digital data usedInsurance claims
Goal of the digital data usedTo identify farmers
Is active data used?Yes
Active data usedClinical examination
Farming exposure consideredFarming activity and pesticide compounds
Farming activities studied, n10
Pesticide compounds studied, n29
PopulationAdults
SexFemale
Participants included, n100 to 1000
CountryFrance
Oldest data used (year)1991
Most recent data used (year)2020
Data follow-up period (years)4
Years between the most recent data used and publication year, n7
Disease or health eventsParkinson disease
Authors’ namesGauthier J
Authors’ keywordsPesticide
Authors’ countryFrance
Authors’ institutionUniversité Grenoble Alpes
JournalEnvironmental Health Perspectives
Funding bodyMIAI@Grenoble Alpesa
Citations, n14

aMIAI@Grenoble Alpes: Multidisciplinary Institute in Artificial Intelligence at the Université Grenoble Alpes.

Data Analysis

To analyze the research directions (ie, hot spots and gaps) on the use of AHDs for public health and epidemiological research in farming populations, a bibliometric approach was conducted [158-160]. This analysis examined the number of publications, countries of publications, most active journals, institutions, authors, funding bodies, subject areas, citations of publications, and keywords of publications. Seven bibliometric metrics were computed, including the h-, g-, m-, and Y-indices; dominance factor; annual growth rate (AGR); and fractionalized frequency (Table S3 in Multimedia Appendix 1). The h-index attempts to measure both the productivity and citation impact of the published body of work of an entity (eg, author, institution, and journal) [161,162]. It refers to the total number of publications by a particular entity with at least the same number of citations. The m-index is calculated by dividing the h-index by the number of years of an entity’s productive life (eg, researcher) [161]. The g-index of an entity corresponds to the largest number g such that the top g publications have at least ≥g2 citations together [162]. The Y-index refers to the sum of both the total number of first-authored publications and the total number of corresponding author publications [163]. The dominance factor refers, for a particular researcher, to the proportion of multiauthored publications as a specific author’s rank to the total number of multiauthored publications [164]. The fractionalized frequency intends to reflect an author’s contribution. The AGR refers to the variable’s change in percentage as a year-over-year statistic [165]. The most up-to-date journals’ impact factors and ranks were retrieved manually using the Journal Citation Report in April 2024.

Spearman correlations were calculated to examine the association between the number of publications and gross domestic product (GDP); population size [166]; and the total labor force, the number of researchers in research and development (per million people), fertilizer consumption (in both % of fertilizer production and kilograms per hectare of arable land), agricultural land (km2), agricultural land (% of land area), land under cereal production (hectares), permanent cropland (% of land area), cereal production (metric tons), crop production index, food production index, livestock production index, cereal yield (kilogram per hectare), female individual employment in agriculture (% of female employment), male individual employment in agriculture (% of male employment), employment in agriculture (% of total employment); and agriculture, forestry, and fishing, value added (% of GDP). These country characteristics were obtained from the World Bank. The most recent country characteristic (eg, GDP) was used when available.

Research directions, including hot spots and gaps, were investigated with keyword frequency, co-occurrence (counting of paired keywords), and thematic mapping analyses. Thematic mapping and keyword co-occurrence network are 2 complementary but distinct approaches that serve different purposes and offer different insights. In summary, thematic mapping focuses more on the strategic positioning of research themes within a field, while keyword co-occurrence networks emphasize the relationships and connections between specific keywords in the literature [158,167]. Both methods complement each other and are usually used to provide a more comprehensive understanding of research landscapes. The co-occurrence of 2 keywords was defined by the frequency with which they appear together in publications and was quantified using association strength (AS) or equivalent index, calculated as , where cij is the number of publications in which keywords i and j co-occur, while ci and cj are the number of publications in which each keyword appears, respectively [158,167]. AS measures how close 2 keywords are to each other. An AS value of 1 indicates keywords always appear together, while 0 indicates they never co-occur. These keyword co-occurrences can be visualized using a co-occurrence network graph, where a vertex or node represents a keyword, the size of the node represents the keyword frequency, and the edge represents the association between 2 keywords [158,167]. On the basis of the keyword co-occurrence network graph, a community detection procedure can be used to identify groups of words highly associated with each other [158,167]. In other words, equivalent keywords based on AS can be grouped together to identify research themes [158,167]. A strategic diagram or thematic map is based on Callon centrality (x-axis) and Callon density (y-axis) [158,167]. Callon centrality measures the degree of interaction of a theme with other themes. It is defined as , where k is a keyword belonging to a theme and h is a keyword belonging to another theme [158,167]. Callon centrality can be interpreted as an indicator of the importance of a particular topic within the broader research landscape. Callon density measures the internal strength of a theme. It is defined as , where i and j are keywords belonging to the same theme and w is the total number of keywords in a theme [158,167]. Callon density serves as a metric for assessing the progression and maturation of that topic [158,167]. A strategic diagram is divided into 4 quadrants according to Callon centrality and density values, which correspond to 4 types of topics. Hot spots or hot topics are defined by both high density and high centrality values (upper-right quadrant), while basic topics are defined by high centrality but low density values (lower-right quadrant). Peripheral topics are defined by both low centrality and low density values (lower-left quadrant), while niche topics are defined by low centrality and high density values (upper-left quadrant) [158,167].

To focus on agricultural or farming exposome research, a bibliometric profile of the “farming exposome” was constructed, which restricts the exposome concept to environmental exposures specific to farming populations [152,168]. This bibliometric farming exposome picture examined co-occurrences between keywords related to potential risk factors and specific health events (eg, cancers and reproductive disorders).

The bibliometric analysis was conducted and reported according to the preliminary guideline for reporting bibliometric reviews of the biomedical literature (BIBLIO; Table S4 in Multimedia Appendix 1) [169]. All analyses were performed using R software (version 4.3.2; R Foundation for Statistical Computing) for Windows 10 (Microsoft Corporation). The bibliometric analysis was performed using the bibliometrix R package (version 4.1.4) [170].


Overview

After excluding 4485 irrelevant records, 296 publications were analyzed (Figure 1). The majority were articles (293/296, 98.9%), with a small number of reviews (2/296, 0.7%) and editorial materials (1/296, 0.3%; Table 3). Only one-third of the publications (107/296, 36.1%) were open access (Table 3 and Figure S1 in Multimedia Appendix 1).

Figure 1. PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) flowchart depicting the literature search and the evaluation process for finding relevant records. The search, conducted on April 15, 2024, in PubMed and Web of Science, had no date restrictions. AHD: administrative health database.
Table 3. Main characteristics of the included publications (N=296).
DescriptionResults
Timespan1975 to 2024
Publication type, n (%)

Article293 (99)

Review2 (0.7)

Editorial material1 (0.3)
Open-access publications, n (%)107 (36.1)
Document age (y), mean (SD)14.2 (11.8)
Annual growth rate (%)5.2
Publication citations

Total, na9379

Average citations per publication31.7

Average citations per year per publications2.02
References, n8814
Journals

Total, n118

Average number of publications per journal1.86

Average number of citations per journal79.5
Authors

Total, n1225

Single-author publications, n4

Author appearances, n1882

Average number of coauthors per publication6.36

Average number of publications per author0.24

International coauthorships (%)24.3

Author’s keywords, n576
Author’s country

Total, n34

Average number of publications per country2.86

Average number of citations per country436.0
Author’s institution

Total, n338

Average number of publications per institution3.11

Average number of citations per institution101.3
Author’s funding body

Total, n181

Average number of publications per funding body2.48

Average number of citations per funding body77.7

aTotal, n indicates that the respective parameter has been cited n number of times, as in 296 publications have been cited 9379 times.

The average publication age was 14.2 (SD 11.8) years, ranging from the oldest in 1975 [171] to the most recent in April 2024 [152]. From 1975 onward, there has been a steady increase in publications using AHDs to address health issues in farming populations, with an AGR of 5.2%. Notably, almost one-third of these articles (91/296, 30.7%) were published in the last 5 years, highlighting the rising interest in AHD-based public health research in this population (Figure S2 in Multimedia Appendix 1). Collectively, the publications received 9379 citations, averaging 31.7 citations per publication. Figure S3 in Multimedia Appendix 1 presents the historical direct citation network. The body of work involved 1225 authors from 338 institutions, with 1882 author appearances and an average of 6 authors per paper (Table 3). Four (1.4%) out of the 296 publications were single-author publications. On average, each paper cited 30 references.

Studies were led by authors from 34 countries, predominantly high-income nations, with 24.3% (72/296) of studies involving multicountry collaborations (Figure S4 in Multimedia Appendix 1). US-based authors contributed the most publications (91/296, 30.7%), followed by authors based in France (71/296, 24%) and Finland (35/296, 11.8%). US authors also had the most citations (3495/9379, 37.2%), with France and Finland ranking second and third, respectively.

Of 296 publications, the 25 (8.4%) most cited ones, appearing in 17 different journals, received between 83 (83/9379, 0.9%) and 485 (485/9379, 5.2%) citations (Table S5 in Multimedia Appendix 1) [150,172-196]. Of these 25 publications, 10 (40%) were published before 2000, another 10 (40%) between 2000 and 2010, and 5 (20%) after 2010. Most of these studies focused on cancer risk (16/25, 64%), while others investigated neurodegenerative disorders (5/25, 20%); respiratory conditions (2/25, 8%); and multiple health outcomes, such as sleep disorders, mental health disorders, and musculoskeletal disorders (2/25, 8%).

Table S6 in Multimedia Appendix 1 provides details on the most productive countries, prolific authors, active journals, institutions, and funding bodies.

Study Characteristics

Table 4 provides an overview of the included publications. Longitudinal study designs were the most common, including retrospective cohorts (129/296, 43.6%) and prospective cohorts (56/296, 18.9%). Case-control studies (62/296, 20.9%), cross-sectional studies (39/296, 13.2%), and ecological studies (17/296, 5.7%) were less common (Multimedia Appendix 2). A few studies (10/296, 3.4%) used multiple study designs [188,194,197-204].

The median follow-up period was 9.5 (IQR 5-17) years. On average, there was a 7-year gap (90% CI 3-14) between the most recent data used and the year of publication, with considerable variation depending on the publication year (Figure 2). The oldest data were from 1801 [205], and the most recent data were from 2022 [206]. Notably, one-third of the data used (98/296, 33.1%) were from before 2000, while nearly three-quarters (214/296, 72.3%) were from before 2015 (Figure S2 in Multimedia Appendix 1). Of 296 studies, only 10 (3.4%) used data from the last 5 years (from 2020), while 80 (27%) used data from the last 10 years (from 2015).

Studies were conducted in all continents, but most participants were from Europe (249/296, 84.1%), followed by North America (85/296, 28.7%), Asia (24/296, 8.1%), Oceania (17/296, 5.7%), Africa (4/296, 1.4%), and Central and South America (4/296, 1.4%). France (70/296, 23.6%) and the United States (67/296, 22.6%) were the most represented countries, followed by Finland (36/296, 12.2%), Sweden (32/296, 10.8%), Denmark (28/296, 9.5%), and Norway (26/296, 8.8%; Figure 3 and Figure S5 in Multimedia Appendix 1). Most studies had a regional or local scope (177/296, 59.8%), in particular, traditional epidemiological studies, such as Agriculture and Cancer (AGRICAN) [207] and Agricultural Health Study (AHS) [172], which used AHDs to either identify potential individuals for inclusion or enrich their cohorts.

Table 4. Characteristics of the included studies (1975 to 2024; N=296).
CharacteristicValues
Research goal, n (%)

Study the association between farming and a health event156 (52.7)

Study the association between individual characteristics and a health event131 (44.3)

Other research goals9 (3)
Study design, n (%)

Retrospective cohort129 (43.6)

Case-control study62 (20.9)

Prospective cohort56 (18.9)

Cross-sectional study39 (13.2)

Ecological study17 (5.7)

Multiple designs10 (3.4)

Review2 (0.7)

Perspective1 (0.3)
Geographic scope, n (%)

Nationwide117 (39.5)

Regional or local176 (59.5)
Temporal scope (y)

Follow-up period, median (IQR)9.50 (5-17)

Follow-up period, mean (SD)12.8 (14.0)

Gap between the latest data used and publication year, median (IQR)7.21 (5-9)

Gap between the latest data used and publication year, mean (SD)7.21 (4.67)
Population, n (%)

Adult265 (89.5)

Adult and child19 (6.4)

Child8 (2.7)

Not reported1 (0.3)
Sex, n (%)

Female130 (43.9)

Male169 (57.1)

Female and male188 (63.5)

Not specified108 (36.5)
Participants, n (%)

>1,000,00050 (16.9)

100,001 to 1,000,00053 (17.9)

10,001 to 100,00065 (22)

1001 to 10,00067 (22.6)

101 to 100047 (15.9)

10 to 1008 (2.7)

Not reported3 (1)
AHDatype, n (%)

Disease register158 (53.4)

Electronic health or medical record124 (41.9)

Insurance claim106 (35.8)

Population register95 (32.1)

Hospital discharge databases41 (13.9)
AHD use, n (%)

Obtain information on sociodemographics272 (91.9)

Obtain information on a health event269 (90.9)

Identify a farmer147 (49.7)

Identify an individual140 (47.3)

Obtain information on occupations117 (39.5)

Exposure assessment57 (19.3)

Obtain information on a farming activity43 (14.5)

Other uses14 (4.7)

aAHD: administrative health database.

Figure 2. Number of years between the most recent data used and publication for all included articles (1975-2024). Points refer to the average number of years or gap between the most recent data used and publication (x-axis) for each publication year (y-axis). Error bars refer to the 90% CI of the number of years between the most recent data used and publication.
Figure 3. World map of the number of publications per country of the farming population studied between 1975 and 2024.

Most studies included 1001 to 10,000 participants (67/296, 22.6%), followed by studies with 10,001 to 100,000 participants (65/296, 22%) and 100,001 to 1 million participants (53/296, 17.9%; Table 4). Larger studies (>1 million participants) accounted for 16.9% (50/296) of the included publications. Smaller studies, with 100 to 1000 participants, were less common (47/296, 15.9%), and very few (8/296, 2.7%) had <100 participants. Most studies included adult participants (284/296, 95.9%). Of 296 studies, 169 (57.1%) examined male individuals, 130 (43.9%) examined female individuals, and 188 (63.5%) examined both sexes, but 108 (36.5%) did not specify the participants’ sex.

More than half of the studies (156/296, 52.7%) aimed to explore the relationship between farming activities (eg, dairy farming) and health events, while 131 studies (44.3%) focused on individual characteristics, such as occupation, age, sex, and socioeconomic status (Figure S6 in Multimedia Appendix 1). Among those studies examining individual characteristics, farming was often considered broadly and compared to other occupations (95/131, 72.5%). Conversely, in studies investigating health outcomes specifically related to farming activities, agriculture was treated as a broad category in only 27.6% (43/156) of the cases. Most studies (277/296, 93.6%) used the general population or other nonfarming groups as the reference category without differentiating farmers by job role (eg, farm managers vs farm workers). Descriptive statistics and multivariable regression were the most commonly used methods. Notably, only 2 studies (2/296, 0.7%) incorporated artificial intelligence (AI) in their analysis [208,209].

Few studies investigated health outcomes in farmers’ family members or nonfarmers exposed to farming. Of 296 studies, only 3 (1%) focused on health events in farmers’ partners [177,210,211], 5 (1.7%) on farmers’ children [179,212-215], and 6 (2%) on nonfarmers exposed to farming-related risks [209,216-221]. There were 11 (3.7%) studies that explored health risks in migrant workers.

Some publications reported findings from the same cohorts (Figure S7 in Multimedia Appendix 1). The 10 most prolific cohorts included France-based AGRICAN (18/296, 6.1%) [207], the US-based AHS (17/296, 5.7%) [172], Nordic Occupational Cancer Study (NOCCA; 12/296, 4.1%) from Nordic countries (Finland, Denmark, Norway, Sweden, and Iceland) [189], France-based Tracking and Monitoring Occupational Risks in Agriculture (TRACTOR; 7/296, 2.4%) [222], and Cancer in the Norwegian Agricultural Population (7/296, 2.4%) [182] cohorts. Other notable cohorts included the US-based United Farm Workers of America (6/296, 2.0%) [223], France-based BALISTIC (5/296, 1.7%) [224], the international (29 countries) consortium agricultural cohort (AGRICOH; 4/296, 1.4%) [150,225], AIRBAg (4/296, 1.4%) from France [226], and the US-based National Agricultural Workers Survey (3/296, 1%) [227]. Among these top 10 cohorts, only NOCCA, United Farm Workers of America, and TRACTOR exclusively used AHDs.

AHD Use

There was high heterogeneity in the coding systems used and the granularity of the information available regarding health events (outcomes), population, and exposure determinants, depending on the AHD and study considered. Regardless of the publication reviewed, AHDs and other datasets were never reported as adhering to the findable, accessible, interoperable, and reusable (FAIR) data principles [228-230]. In addition, none of them could be considered as FAIR data because, with a few exceptions [222], most AHDs were not precisely described, and data availability statements were rare. Furthermore, mainly due to privacy concerns, AHDs were not available for open and free access.

The most commonly used AHDs were disease registers, used in more than half of the studies (158/296, 53.4%), followed by electronic health or medical records (124/296, 41.9%), insurance claims (106/296, 35.8%), population registers (95/296, 32.1%), and hospital discharge databases (41/296, 13.9%; Table 4). Among disease registers, cancer (120/158, 75.9) and mortality registers (75/158, 47.5%) were the most frequently used (Figure S8 in Multimedia Appendix 1 and Multimedia Appendix 2). Nearly one-third of the studies (91/296, 30.7%) relied on a single AHD, with disease registers being the most common (38/91, 42%), followed by insurance claims (29/91, 32%), electronic health or medical records (18/91, 20%), population registers (5/91, 5%), and hospital discharge databases (1/91, 1%). Other types of digital data were used less frequently, including pesticide registration records (13/296, 4.4%), job-exposure matrices (JEMs; 12/296, 4.1%), crop-exposure matrices (11/296, 3.7%), pesticide use records (8/296, 2.7%), climate data (7/296, 2.4%), and air quality data (2/296, 0.7%). While contextual data were sometimes used (9/296, 3.0%), person-generated data, smart agriculture data, and omics were never used.

The AHDs and other digital data were primarily used to obtain sociodemographic information (272/296, 91.9%) and health event data (269/296, 90.9%). They were also used to identify farmers (147/296, 49.7%) or individuals (140/296, 47.3%), gather occupational information (117/296, 39.5%), assess exposure (57/296, 19.3%), obtain data on farming activities (43/296, 14.5%), or track climate conditions (7/296, 2.4%).

Nearly two-thirds of the studies (181/296, 61.1%) relied exclusively on digital data (AHDs or other), while more than one-third (112/296, 37.8%) incorporated self-reported information/active data (requiring active participant involvement) as part of epidemiological cohorts. A total of 111 (37.5%) out of 296 studies used participant-completed questionnaires (paper or electronic) to gather sociodemographic data and confounding factors (98/296, 33.1%), assess exposure (96/296, 32.4%), or collect health information (83/296, 28%). Some information was obtained through interviews (44/296, 14.9%) or clinical examinations (32/296, 10.8%). Biological monitoring (24/296, 8.1%) and airborne monitoring (2/296, 0.7%) were sometimes used, whereas no study reported dermal monitoring (Figure S9 in Multimedia Appendix 1).

Among all the AHDs used, the Mutualité Sociale Agricole (MSA) is a singularity. To the best of our knowledge, it is the only AHD specifically dedicated to the entire farming population of a country. Indeed, MSA is the French national insurance scheme that covers the entire farming workforce (5% of the overall French population) [115,128]. MSA was used in 60 studies (60/296, 20.3%). These studies were often part of cohorts with multiple publications, such as AGRICAN (18/60, 30%), TRACTOR (7/60, 12%), BALISTIC (5/60, 8%), AIRBAg (4/60, 7%), Aging Multidisciplinary Investigation (2/60, 3%) [151], BM3R (2/60, 3%) [231], FERMA (risk factors of the rural environment and allergic and respiratory disease; 1/60, 2%) [232], and Phytoner (1/60, 2%) [233]. Of these, TRACTOR was the only cohort using exclusively MSA data [222].

Farming Exposure

A variety of exposure proxies were used to assess farming-related exposure. The most common proxy was a job title, which generally referred to whether the individual was a farmer (184/296, 62.2%). Other proxies included specific farming activities, such as dairy or crop farming (111/296, 37.5%), general pesticide exposure (yes or no; 62/296, 20.9%), and exposure to specific pesticide compounds (eg, glyphosate or paraquat) or pesticide classes (eg, insecticides; 51/296, 17.2%; Figure S10 in Multimedia Appendix 1 and Multimedia Appendix 2). The number of farming activities studied ranged from just 1 [226] to 78 [222], with an average of 8 farming activities per study. Similarly, the number of pesticide compounds assessed ranged from 1 [234] to 943 [235], with an average of 42 pesticides per study. Only 1 study investigated the mixture effect of exposure to multiple pesticide combinations on human health [236]. Investigations into other chemical exposures were also limited, with only 2 papers each addressing silica exposure [237,238] and air pollution [194,217] (2/296, 0.7%). Notably, no studies examined exposure to per- and polyfluoroalkyl substances or micro- and nanoplastics. Research on the broader farming exposome was rare (5/296, 1.7%) and typically used farming activities as proxies [152].

Of 296 studies, few explored exposure to physical agents, with 5 studies (1.7%) focusing on radiation [187,218,239-241]. No studies investigated the effects of climate change on farmers’ health. Exposure to biological agents was rarely studied as well, with just 3 (1%) out of 296 papers addressing mycotoxins [241-243]. Finally, only 3 studies (1%) examined psychological factors related to farming exposure [244-246].

Health Events

The most frequently studied health events were cancer (142/296, 48%), followed by mortality (44/296, 14.9%), injuries (38/296, 12.8%), workplace accidents (32/296, 10.8%), respiratory disorders (30/296, 10.1%), neurodegenerative diseases (28/296, 9.5%), and mental health issues (26/296, 8.8%; Figure S11 in Multimedia Appendix 1 and Multimedia Appendix 2). Less frequently studied conditions included cardiovascular diseases (16/296, 5.4%), autoimmune disorders (11/296, 3.7%), musculoskeletal disorders (11/296, 3.7%), reproductive disorders (3/296, 1.0%), sleep disorders (1/296, 0.3%), and frailty (1/296, 0.3%). Notably, no studies explored the farming microbiome.

Among cancers, lung cancer was the most commonly investigated cancer (43/142, 30.3%), followed by prostate cancer (38/142, 26.8%), leukemia (37/142, 26.1%), colorectal cancer (35/142, 34.6%), multiple myeloma (35/142, 34.6%), non-Hodgkin lymphoma (35/142, 34.6%), bladder cancer (31/142, 21.8%), and brain cancer (31/142, 21.8%; Figure S12 in Multimedia Appendix 1). Respiratory disorders were primarily focused on asthma (15/30, 50%) and COPD (chronic obstructive pulmonary disease; 14/30, 47%). Parkinson disease was the most studied neurodegenerative condition (16/28, 57%), followed by multiple sclerosis (6/28, 21%). Fewer publications examined Alzheimer disease (2/28, 7%) and amyotrophic lateral sclerosis (2/28, 7%; Figure S13 in Multimedia Appendix 1). In the mental health field, suicide (12/26, 46%) and depression (8/26, 31%) were the most investigated issues (Figure S14 in Multimedia Appendix 1).

Keyword Analysis

Overview

Following an initial extraction of 1259 authors’ keywords, manual harmonization was performed. Duplicate keywords were removed through singular or plural standardization (130/1259, 10.3%) and synonym unification and grouping of cancer-related terms (553/1259, 43.9%), yielding a final set of 576 (45.8%) harmonized keywords, which were all used in subsequent analyses to prevent selection bias.

On average, each publication included 8.90 keywords (90% CI 0-17), although 35 (11.8%) out of 296 publications lacked any keywords, in line with the journal guidelines. Keyword analysis confirmed prior findings regarding farming exposure and health outcomes. It also provided deeper insights into emerging research hot spots, directions, and gaps.

Of the total 576 keywords, 301 (52.3%) appeared only once, while 68 (11.8%) were mentioned at least 10 times. More frequently used keywords included 39 that appeared at least 20 times (39/576, 6.8%) and 11 that featured in at least 50 publications (11/576, 1.9%). The 50 most frequently used keywords were mentioned in at least 17 (5.7%) out of 296 publications, while the top 10 appeared in at least 51 publications (17.2%; Table S7 in Multimedia Appendix 1). The most frequently cited keyword was “cancer” (150/296, 50.7%), followed by “mortality” (96/296, 32.4%), “pesticide” (88/296, 29.7%), “occupation” (82/296, 27.7%), “farmer” (77/296, 26.0%), “agriculture” (74/296, 25%), “exposure” (57/296, 19.3%), and “epidemiology” (57/296, 19.3%).

In terms of overall citations, “cancer” (5766/9379, 61.5%), “pesticide” (3569/9379, 38.1%), and “mortality” (3097/9379, 33%) were the most cited keywords. During the past decade, the frequency of the top 5 keywords has drastically increased (Figure S15 in Multimedia Appendix 1). Notably, keywords such as “cancer,” “mortality,” “occupation,” “pesticide,” “agriculture,” and “farmer” have been consistently present in publications spanning at least 30 years (not necessarily consecutively; Figure S16 in Multimedia Appendix 1). In the last decade, emerging keywords, such as “big data,” “administrative health database,” “dust,” and “BMI,” have gained prominence (Figure S17 in Multimedia Appendix 1).

Keyword Co-Occurrence

A keyword co-occurrence network illustrating the frequency of keyword co-mentions in publications was constructed, thereby revealing relationships and conceptual connections (Figure 4). In this network, nodes or vertices represent keywords, with their sizes indicating frequency, while edges denote co-occurrences. The network’s density and arrangement reveal topic interconnectivity, with larger vertices representing more frequently mentioned keywords. The network visualization helps identify clusters of related topics and highlights core research areas.

Using a community detection algorithm (spin-glass model with simulated annealing), 4 distinct clusters or communities of keywords were identified. Each cluster groups keywords that are often mentioned together, with stronger internal associations and weaker connections to keywords in other clusters.

The most frequently used keywords for each cluster were “cancer” (red cluster), “pesticide” (purple cluster), “mortality” (green cluster), and “exposure” (blue cluster). The red cluster highlights associations between various types of cancer, reflecting the fact that studies investigating cancer risks often examine multiple types of cancer. The green cluster links “mortality” with terms such as “mental health,” “injury,” and “animal farming,” explained by the association between workplace accidents, mental health issues (eg, suicide), animal farming, and mortality. In the purple cluster, “pesticide” connects with “occupational exposure” and “farming activity,” emphasizing that pesticide exposure is primarily studied in occupational settings across different types of farming. The blue cluster connects “exposure” to terms such as “neurodegenerative disease,” “respiratory disorder,” “cardiovascular disorder,” “risk factor,” “air pollution,” “age,” and “diet,” indicating the study of various risk factors in relation to several health events. These clusters highlight current research hot spots that focus on 4 main interconnected themes: the associations between risk factors, pesticide exposure, farming activities, and a range of diseases.

Figure 4. Keyword co-occurrence network of the 296 articles published between 1975 and 2024. Each vertex or node represents a keyword, while edges represent the co-occurrence between keywords. Two keywords are connected when they co-occur in the same publication, and the size of each vertex indicates the frequency of a keyword: larger vertices represent more frequently mentioned keywords. Keywords with the same color (cluster) represent a research area. AAW: workplace accident; COPD: chronic obstructive pulmonary disease; CVD: cardiovascular disorder; MSD: musculoskeletal disorder; SES: socioeconomic status.
Thematic Mapping: Research Hot Spots

Figure 5 presents a thematic map that illustrates current research directions. Thematic mapping visualizes the relationship between research themes or topics, enabling the identification of directions, emerging areas, and gaps in the literature. The result is a strategic diagram that shows how themes relate to each other and their relevance within a specific field. The graph is divided into 4 quadrants, categorizing topics based on their relevance (x-axis, Callon centrality) and maturity (y-axis, Callon density) within the broader research landscape. Each circle represents a theme or topic (ie, a cluster of equivalent keywords), with the circle size corresponding to the frequency of the keywords associated with that theme.

The upper-right quadrant represents “hot topics,” which are both highly relevant and mature in the research landscape. Four key hot topics drive AHD-based public health research in farming populations. These include 1 topic focused on cancer research; another on respiratory disorders; and a third encompassing neurodegenerative diseases, workplace accidents, injuries, and mental health issues. The final hot topic involves large-scale studies in France and Europe using big data and insurance claims.

The lower-right quadrant contains “basic topics,” which are relevant but not yet mature in the research landscape. Only 1 such theme emerged: research related to pesticide exposure, mortality, and farming.

In the upper-left quadrant, “niche themes” refer to mature research topics that have not yet achieved full relevance. Three niche themes were identified: the first involves studies examining aging and research conducted in Norway; the second focuses on reproductive disorders and parental exposure, a theme poised to potentially evolve into a hot topic; and the final niche theme covers genetics and metabolism.

Finally, the lower-left quadrant contains “peripheral topics,” which represent either emerging or declining themes with low relevance and maturity. Four peripheral topics were observed, of which 2 (50%) were primarily centered on research on ocular disorders, 1 (25%) on the use of electronic health or medical records, and 1 (25%) on studies conducted in India.

This thematic map helps highlight both well-established and emerging areas of research, as well as gaps that may be ripe for future investigation.

Figure 5. Thematic mapping: research hot spots based on keywords from the 296 articles published between 1975 and 2024. The graph shows how themes relate to each other and their relevance within a specific field. This graph is divided into 4 quadrants, categorizing topics based on their relevance (x-axis and Callon centrality) and maturity (y-axis and Callon density) within the broader research landscape. Each circle represents a theme or topic (ie, a cluster of equivalent keywords), with the circle size corresponding to the frequency of the keywords associated with that theme. AAW: accident at work; COPD: chronic obstructive pulmonary disease; EHR: electronic health record; EMR: electronic medical record.
Bibliometric Farming Exposome

To identify research directions and gaps in the farming exposome literature, a bibliometric keyword co-occurrence analysis was conducted to explore the farming exposome by examining the co-occurrence between keywords associated with potential risk factors and specific health outcomes. This analysis was restricted to exposome-related and health event–related keywords. Of 576 keywords, 130 (22.6%) were related to the exposome, among which 93 (16.1%) were related to the specific external exposome (eg, pesticide), 19 (3.3%) to the general external exposome (eg, climate), and 18 (3.1%) to the internal exposome (eg, oxidative stress). Furthermore, there were 70 (12.2%) health event–related keywords (eg, brain cancer).

The results of this analysis are provided in Tables 5 and 6 and Multimedia Appendix 3, with each cell representing the percentage of occurrences of an exposome-related keyword (eg, air pollution) in all publications mentioning a specific health event–related keyword (eg, Alzheimer disease). For example, a value of 33.3 indicates that an exposome-related keyword appeared in 33.3% of all publications mentioning a specified health event–related keyword. To facilitate interpretation and ease the reading of Tables 5 and 6, exposome-related keywords were categorized into 19 groups (eg, chemical agent) and health event–related keywords into 20 groups.

Table 5. Co-occurrence between keywords related to internal exposomes and health event categories among the articles published between 1975 and 2024. Each cell refers to the number of times (%) a keyword related to an exposome category (eg, chemical agent) was mentioned among all publications in which a keyword related to a health event category (eg, cancer) appeared (N=296). Please note that the absolute value for each row is provided in parentheses with the row header and remains the same for all the parameters in that row.
Health event, nInternal exposome (%)

AgeSexBMIBPaHeredityEthnicityHormoneMenopauseMetabolismOSbInflammation

Cardiovascular disease (n=17)5.8811.85.885.8805.8800000
Work-related disease (n=5)020000000000
Autoimmune disease (IBDc, RAd, vasculitis, and NRe; n=9)0011.100000000
Cancer (n=150)3.6227.52.1700.731.453.620.732.90.730
Dental health (n=2)00000000000
Ocular disorder (n=6)16.70000000000
Frailty (n=2)5005000000000
Anemia (n=1)10010000000010000
Infectious disease (malaria, Lyme disease, tuberculosis, toxoplasmosis, and NR; n=14)07.140007.1400000
Injury (including workplace accident and disability; n=40)4.268.512.13002.1302.13000
Chronic kidney disease (n=3)00000000000
Mental health disorder (depression, suicide, and NR; n=25)9.524.764.7600000000
Metabolic disorder (diabetes, dysthyroidism, and NR; n=9)11.1011.111.1011.100000
Mortality (n=75)4.1717.7001.0401.042.08000
Musculoskeletal disorder (arthritis, low-back pain, and NR; n=14)7.1414.37.1400000000
Neurodegenerative disease (ADf, ALSg, MNDh, MSi, PDj, and NR; n=33)6.060003.036.060012.16.060
Sensory impairment (n=1)00000000000
Reproductive disorder (birth defects, infertility, spontaneous abortion, and NR; n=24)028.614.3007.147.1407.1400
Respiratory disorder (allergy, asthma, COPDk, pneumonia, sarcoidosis, and NR; n=39)5.8811.88.822.9405.88005.8802.94
Skin disorder (dermatitis and NR; n=2)05000000000 0

aBP: blood pressure.

bOS: oxidative stress.

cIBD: inflammatory bowel disease.

dRA: rheumatoid arthritis.

eNR: not reported.

fAD: Alzheimer disease.

gALS: amyotrophic lateral sclerosis.

hMND: motor neuron disease

iMS: multiple sclerosis.

jPD: Parkinson disease.

kCOPD: chronic obstructive pulmonary disease.

Distinct keyword exposome profiles were developed for each health event–related keyword (Figures S18-S43 in Multimedia Appendix 1), as illustrated in Figure 6 for mental health disorders. Most exposome-related keywords associated with keywords related to mental health disorders pertained to the type of occupations as well as chemical, lifestyle, socioeconomic, and psychological factors. Cancer-related keywords were associated mostly with keywords related to the internal (sex) and specific external exposome (chemical agents, lifestyle, and type of occupations). Autoimmune disease–related keywords co-occurred mostly with external exposome–related keywords (chemical agents, lifestyle, type of occupations, and socioeconomic factors). Neurodegenerative disease–related keywords were associated mostly with keywords related to the specific external exposome (lifestyle, chemical agents, and type of occupations). Reproductive disorders co-occurred mostly with internal (sex and BMI) and specific external exposome–related keywords (chemical agents and type of occupations). Keywords related to both musculoskeletal disorder and injury were associated with keywords from all 3 exposome components, in particular sex, type of occupations, lifestyle, biomechanical factors, chemical agents, and psychological factors. Infectious disease–related keywords co-occurred with specific external exposome–related keywords (biological agents and type of occupations), while respiratory disorder–related keywords were associated mostly with internal (sex) and specific external exposome–related keywords (lifestyle, chemical and biological agents, and type of occupations). Cardiovascular disorder–related keywords were associated with keywords from all 3 exposome components, in particular the sex, type of occupations, lifestyle, chemical agents, and socioeconomic and psychological factors.

Table 6. Co-occurrence between keywords related to specific external and general external exposomes and health event categories among the articles published between 1975 and 2024. Each cell refers to the number of times (%) a keyword related to an exposome category (eg, chemical agent) was mentioned among all publications in which a keyword related to a health event category (eg, cancer) appeared (N=296). Please note that the absolute value for each row is provided in parentheses with the row header and remains the same for all the parameters in that row.
Health event, nSpecific external exposome (%)General external exposome (%)

LifestyleCAaBAbBFcOccupationPAdSFePFf

Cardiovascular disease (n=17)11.829.4011.823.5029.417.6
Work-related disease (n=5)0202020400200
Autoimmune disease (IBDg, RAh, vasculitis, and NRi; n=9)22.244.40055.6022.20
Cancer (n=150)17.451.47.251.4558.75.88.70
Dental health (n=2)5000000500
Ocular disorder (n=6)16.716.70000016.7
Frailty (n=2)100500000050
Anemia (n=1)00000000
Infectious disease (malaria, Lyme disease, tuberculosis, toxoplasmosis, and NR; n=14)21.421.4507.1428.6014.37.14
Injury (including workplace accident and disability; n=40)12.814.92.1314.923.42.134.266.38
Chronic kidney disease (n=3)010066.70033.300
Mental health disorder (depression, suicide, and NR; n=25)28.623.804.7633.34.7623.828.6
Metabolic disorder (diabetes, dysthyroidism, and NR; n=9)33.344.4011.144.411.1022.2
Mortality (n=75)17.755.28.334.1754.25.2117.72.08
Musculoskeletal disorder (arthritis, low-back pain, and NR; n=14)14.321.47.1414.342.97.14021.4
Neurodegenerative disease (ADj, ALSk, MNDl, MSm, PDn, and NR; n=33)21.248.56.06036.409.093.03
Sensory impairment (n=1)0000100000
Reproductive disorder (birth defects, infertility, spontaneous abortion, and NR; n=24)7.1435.77.14064.307.140
Respiratory disorder (allergy, asthma, COPDo, pneumonia, sarcoidosis, and NR; n=39)47.135.311.82.9435.308.825.88
Skin disorder (dermatitis and NR; n=2)50500050000

aCA: chemical agent.

bBA: biological agent.

cBF: biomechanical factor.

dPA: physical agent.

eSF: socioeconomic factor.

fPF: psychological factor.

gIBD: inflammatory bowel disease.

hRA: rheumatoid arthritis.

iNR: not reported.

jAD: Alzheimer disease.

kALS: amyotrophic lateral sclerosis.

lMND: motor neuron disease

mMS: multiple sclerosis.

nPD: Parkinson disease.

oCOPD: chronic obstructive pulmonary disease.

Figure 6. Chord diagram of keyword co-occurrence between potential risk factors and mental health disorder keywords from the 296 articles published between 1975 and 2024. Disease-related keywords are displayed on the top half of the chord diagram, while exposome-related keywords are displayed on the bottom half. Each chord or link indicates that an exposome-related keyword was mentioned with a disease-related keyword (co-occurrence) at least once in the same publication. The chord color differs from one exposome-related keyword to another. SES: socioeconomic status.

Principal Findings

This review provides the first comprehensive and objective synthesis of research on the use of AHDs to address health issues in farming populations. It identifies major contributors, key publications, and existing research gaps while also suggesting future directions for leveraging AHDs to study health issues in farming populations. Overall, findings indicate that only a small part of the exposome and a limited range of health events have been examined within farming populations through the reuse of AHDs.

Current Directions in AHD Use for Public Health Research in Farming Populations

Research using AHDs in farming populations has been predominantly conducted in developed countries [150,225], with the United States [172,223,227]; France [207,222,224]; Canada [145,186,195]; and Scandinavian nations [182,189], including Denmark, Finland, Norway, and Sweden, leading the field. This dominance is linked to considerable funding from these regions and international collaborations. Scandinavian countries are particularly advanced in AHD use, offering databases that are highly complete, accessible, and well-integrated into public health research. AHDs from Denmark, Sweden, Canada, and France also provide comprehensive data on a patient’s digital trajectory within their respective health systems [93,98,104,108,113,115,128,247]. France stands out further, with an AHD dedicated specifically to the entire farming population (MSA). This may explain the frequency of large-scale and long-term studies from these countries, some of which included >100,000 participants. However, many studies still had a regional focus, partly due to the use of AHDs by traditional epidemiological studies such as AGRICAN [207] and AHS [172], which rely on limited resources [47-50,53,54,57]. These studies often used AHDs to identify farming populations for inclusion or to supplement cohort data. The international AGRICOH consortium, initiated by the US National Cancer Institute and the International Agency for Research on Cancer, includes 11 (38%) of the 29 cohorts identified in this review [150,225]. However, several cohorts in AGRICOH were not identified, potentially due to lack of publications, language barriers, or limited use of AHDs. There were many publications associated with these well-established cohorts, for which many of the most prolific authors were working [172,182,189,207,222-227].

The most frequently used AHDs in farming population health research were disease registers, followed by electronic health or medical records and insurance claims. More than two-thirds of the studies used disease registers, in particular, cancer and mortality registers. This is not surprising because disease registers are created for clinical and research purposes with a continuous, exhaustive, and optimal digital collection of individual data regarding ≥1 health event in a geographically defined population [53,105]. The coding systems and the granularity of information related to health outcomes, populations, and exposure determinants varied widely across studies. Most studies (291/296, 98.3%) used AHDs to collect sociodemographic and health event information [222].

There was no consensus on the best methods or proxies to assess farming exposure. A variety of exposure proxies and determinants were used across studies, with indirect methods being the most common. Many studies (237/296, 80.1%) dichotomized proxy, for example, classifying individuals as farmers or nonfarmers or as pesticide-exposed versus nonexposed. In nearly two-thirds of the included studies, job title (ie, “being a farmer”) served as the primary exposure proxy. About one-third of studies took into account specific farming activities (eg, dairy farming and crop farming) to reflect the diversity of farming practices. This approach is a valuable proxy for agricultural exposure, offering a broader representation of the farming exposome, which involves multiple stressors beyond just pesticides [147,152,188,201,248,249]. Farming activity information was often derived from digitalized data, such as agricultural censuses or self-reported data from mandatory insurance enrollments [152]. Many studies (111/296, 37.5%) combined AHDs with self-reported data (eg, questionnaires) [172,207], which allowed for more comprehensive data collection but tended to restrict the scope to regional studies due to resource constraints. These studies typically yielded high-quality data, with more potential confounders considered compared to studies relying solely on AHDs. Most studies (68/111, 61.3%) using self-reported data focused on single exposures, mainly pesticides, with only 1 study addressing multiexposure to pesticides [236]. Biological monitoring and airborne chemical sampling were rarely conducted, likely due to practical and financial constraints and the short half-lives of most pesticides [250]. Dermal chemical monitoring has not been reported, even though it is the main exposure route for pesticides [251]. The high number of studies investigating exposure to pesticides may be explained by the fact that AHS focuses on pesticide applicators and their spouses [172] and because many pesticides have adverse health effects on humans, such as neurotoxicity or endocrine disruption [252-256]. Beyond pesticides, farmers face exposure to other chemicals [257], such as air pollution; micro- and nanoplastics [258-263]; biological agents (eg, endotoxins and zoonoses) [264-266]; physical agents (eg, UV radiation, noise, and vibrations) [187,267]; biomechanical factors (eg, repetitive movements, heavy load, and working posture) [198,268,269]; and psychosocial stressors [270-274], which have been less studied than pesticides. Despite these multiple exposures, the broader farming exposome remains understudied.

In addition to AHDs, some studies integrated other secondary data, such as climate data [187,275], air quality data [194,217], JEMs [212,276,277], or crop-exposure matrices [278]. JEMs provide exposure level estimates for various chemicals and stressors based on job categories [250,279]. Although JEMs can provide valuable exposure data, they often lack the specificity of individual-level data, making it difficult to account for task-specific risks, temporal variations, and the inclusion of specific worker subgroups such as female individuals [250,279-282]. The lack of a universal standard for JEMs further complicates their application, which may explain why many studies still rely on self-reported data for more accurate exposure assessment despite the risk of recall bias [250,279-282].

The health outcomes studied were predominantly cancer [145,150,151,171-175,177-179,182,183,185,189,190,196,207,210-212,223], mortality [173,186,194,195,202,205], workplace injuries [198,208], respiratory disorders (eg, asthma and COPD) [151,180,181,213,224,226], neurodegenerative diseases (eg, Parkinson disease) [111,151,176,184,187,188,193,197,201, 248,249,283], and mental health issues [151,244,273], which represent focal points within the research field. This is not surprising because many well-established cohorts centered on cancer research, in particular AGRICAN [207], Cancer in the Norwegian Agricultural Population [182], and NOCCA [189]. In addition, arsenic and inorganic arsenic compounds are classified as carcinogenic to humans by the International Agency for Research on Cancer [252,253,284], while malathion, glyphosate, diazinon, dichlorodiphenyltrichloroethane, and occupational exposures in spraying and application of nonarsenical insecticides are classified as probably carcinogenic to humans (group 2A), and several other pesticides are ranked as possibly carcinogenic to humans (group 2B), such as tetrachlorvinphos and parathion. Regarding mortality, it is often cancer and suicide mortalities that are investigated [186,195,202,244]. Furthermore, several pesticides are neurotoxic [252,255], but existing studies focused mainly on Parkinson disease and multiple sclerosis, with a paucity of data on Alzheimer disease and other neurodegenerative diseases [147,283]. In contrast, certain areas, such as cardiovascular diseases [151,194,203,227,285,286], autoimmune disorders [168,219,237], musculoskeletal disorders [192,204,245,287,288], reproductive disorders [242,289,290], sleep disorders [191], aging-related conditions [151,291], hearing impairment [267,292], and the microbiome [8,293-298], remain underexplored, despite their potential relevance to farming populations.

Challenges of Reusing AHD for Public Health Research in Farming Populations

Each AHD presents unique advantages and limitations. For example, large sample sizes and a large number of available health events are strengths, while generalizability and the absence of key confounders are challenges [64,93,95,105,115]. Access to AHDs is frequently restricted by a variety of factors, including governance and technical barriers, such as language, data structure, interoperability, and coding systems. Additional challenges stem from the type of AHD (eg, insurance claims or cancer registers), inadequate documentation (eg, absence of a data dictionary), limited accessibility due to costs or conditions, and jurisdictional and legal constraints [30,62,64,81,113,115]. Identifying the optimal AHD for a given research question is also complex, especially when considering the heterogeneity in coding systems and country-specific data structures. In countries such as Scandinavia, Canada, and France, individual identifiers facilitate data linkage across multiple AHDs, enhancing research opportunities [93,104,113,115,128,247]. However, many AHDs are not ready for research and require significant processing, cleaning, and understanding before they can be analyzed [93,105,113,222,299]. Another major challenge is the long lag between data access, analysis, and research publication. On average, studies used data that were 7 years old at the time of publication, largely due to delays in data access, administrative approvals, and the need to prepare complex datasets for analysis [223,300]. For instance, the TRACTOR project took 2 years to clean and prepare its dataset for research use [222]. These delays are compounded by the time required to conduct statistical analyses and prepare manuscripts for publication, as well as review and publication times (delay from submission to acceptance and from acceptance to publication). Another limitation of AHDs is the lack of detailed exposure data. AHDs rarely include exposure information due to their administrative focus, requiring researchers to supplement with additional data sources, such as JEMs or self-reported data. When exposure information is recorded in AHDs, it is often too generalized, typically only reflecting broad job classifications, such as farming, without specifying detailed activities or stressors. There are some exceptions, such as MSA data, which capture a wider range of specific farming activities (eg, dairy farming) [222]. However, exposure to specific stressors (eg, chemical compounds) is largely absent from AHDs.

The reference populations used in farming studies vary, which precludes direct comparisons and limits the generalizability of the findings. For example, AGRICAN used the general population as a reference [207], while TRACTOR used a farming population [152,168]. Furthermore, studies differ in their focus on specific farming populations, such as the entire agricultural workforce [207], farm managers [152], or pesticide applicators [172], which may lead to distinct exposure profiles that influence health outcomes because these farming populations have different socioeconomic status, experiences, and behaviors. Hence, to avoid or lessen bias, some studies focused on specific farming populations [152,172]. Moreover, the scope of farming populations included in studies is often limited, omitting subgroups, such as farm families, nearby residents, or consumers exposed to agricultural products, which limits the broader application of the findings. In addition, farming practices can vary significantly between countries and studies, and there is no international standardized classification for farming activities. In some cases, farming categories are derived from legal or administrative sources, as seen in the MSA data [152,168,283]. This lack of standardization limits the comparability and generalizability of findings across studies. In addition, the generalizability of the findings to other countries when using AHDs may be limited because of the differences in health care systems [93].

There are several well-known limitations of AHDs that complicate the investigation of health outcomes [301]. Health events captured in AHDs are typically limited to those requiring medical attention, which may not reflect the true incidence of diseases. In addition, the level of detail varies across diseases, even within the same AHD [92]. Although diagnostic accuracy is generally higher in disease registers, these are often geographically limited and cover only a subset of health conditions. For example, in France, cancer registers only cover 23 (24%) out of 96 administrative regions [155]. In addition, certain conditions, such as depression, are not covered by any registers. Identifying health outcomes in AHDs often requires complex algorithms that combine data from multiple sources, such as drug reimbursements, disease declarations, or medical procedures [104,105,128,152,302-304]. In addition, inconsistencies in case definitions and algorithms across studies and countries hinder the ability to compare and pool risk estimates [104,302,304]. Some AHDs also lack critical clinical information, such as laboratory results and genetic data [115,128,305], and the recorded diagnosis or treatment date may not correspond to the actual onset of the disease. Furthermore, diagnosis codes are not always indicative of a confirmed diagnosis.

Emerging Opportunities and Research Needs

While AHDs are well-used in certain countries, there are underexplored opportunities in regions such as the United Kingdom, where AHDs exist but are underused for research [113]. For low- and middle-income countries, the development and access to AHDs remain limited, and international support is needed to expand this research infrastructure. As already reported by Habib et al [306], there is a notable lack of sex-specific data, even though occupational exposures and health outcomes can vary significantly between sexes due to genetic, physiological, psychological, and behavioral factors [307-311]. Future research should address these disparities to provide a more comprehensive understanding of health risks in farming communities. Although there are inherent delays in using AHDs due to the time required for data generation, consolidation, and access, we advocate for the continued publication of studies, even those using older data. Historical data remain vital for better understanding long-term health patterns, particularly for diseases such as cancer, where tumor initiation can span decades [312,313]. Editors should encourage the publication of studies using older datasets, especially when addressing long-term health outcomes (eg, cancer and neurodegenerative diseases) or when recent data are not available [312].

None of the analyzed AHDs fully adhere to the FAIR principles, possibly because most were developed before the establishment of these principles in 2016 [228-230]. Moreover, the assessment of FAIR compliance of AHDs relied solely on information presented in publications, which may not provide a comprehensive evaluation. Nevertheless, there is a critical need to advocate for the integration of FAIR principles within AHDs to enhance public health research [228-230]. Currently, the landscape is favorable for data reuse, particularly with initiatives such as the forthcoming European Health Data Space [314-316]. Data reuse extends beyond mere access; it encompasses data discovery, a fundamental aspect of FAIR principles that involves recognizing the existence of databases [228-230]. To facilitate this, the creation of data catalogs is essential [228]. Numerous data repositories, such as Re3Data [317], Zenodo [318], CANUE [319], Figshare [320], “Epidémiologie – France” [321], data.gouv [322], Dataverse [320], or Data Europa [320], already exist. In addition, specialized multidisciplinary open-access and peer-reviewed journals such as Scientific Data and Data in Brief publish datasets [318]. A dataset search can also be conducted using the Google (Google LLC) platform [323]. However, the documentation and access conditions for datasets can highly vary across inventories, complicating the selection process for researchers. The absence of indicators or scores for data reusability further hampers efforts to identify the most suitable datasets for specific research questions [45,69]. To our knowledge, no comprehensive catalog of AHDs currently exists to date. A web-based inventory of AHDs, modeled after existing resources, such as OccupationalCohorts.net [324], OccupationalExposureTools.net [325], and Toxicological and Exposure Database Inventory [10], could greatly enhance research endeavors. The motivation for analyzing AHDs often stems from the data they contain. Consequently, as data availability increases, researchers will be better positioned to formulate research questions and engage in a parallel process of “datagraphy” or “datagraphic search” akin to traditional bibliographic research. The objective of datagraphy would be to determine which datasets are best suited for addressing specific research questions, highlighting the need for accessible catalogs to support this goal.

There is also an opportunity to integrate other secondary data, such as person-generated data (eg, mobile health, social media, digital footprints, and wearable sensors); contextual data (eg, climate and air quality); and smart agriculture data [2,83,84,326-328]. These datasets, largely untapped in farming population research, could provide new insights into health outcomes and environmental exposures [101].

Nationwide studies using big data were a hot spot. AI, such as machine learning, is particularly useful for analyzing big data and holds substantial promise for future research [329-331], particularly for identifying predictors of health outcomes in farming populations [332]. To date, AI has been underused, with only 2 studies using it, 1 identifying occupational injuries in agriculture [208] and 1 reviewing the development of chronic kidney disease risk prediction models [209]. Incorporating AI, along with cohort enrichment and interdisciplinary expert interpretation, could open new avenues for research.

Many studies continue to examine agriculture as a broad category, highlighting the need for more detailed investigations into specific farming activities and tasks [147,152,188,201,248,249]. Our analysis reveals major research gaps in understanding environmental and occupational exposures among farming populations, particularly with regard to emerging concerns such as per- and polyfluoroalkyl substances, biological agents, micro- and nanoplastics, and the impact of climate change. Climate change is a critical issue for agriculture, as it may drive shifts in pests, diseases, and farming practices [274,333-339]. Parental exposure appears to be a theme that will soon become a hot topic. Furthermore, research is needed to explore the farming exposome, particularly focusing on the “mixture effects” of multiple simultaneous exposures [340,341]. Omics data, which have not been used in farming population studies to date, represent a promising avenue for future research because genetics and metabolism were found to be a niche theme. Omics data refer to the large-scale datasets generated from various omics technologies that analyze biological molecules (eg, genomics, transcriptomics, proteomics, and metabolomics), which provide comprehensive insights into different biological layers and processes [11,342,343].

To enhance the characterization of farming exposome research using keyword analysis, there is a pressing need for standardized keyword reporting. We advocate for the development of a standardized approach to reporting keywords in scientific journals, including defining a minimum set of information (eg, study type, health outcome, population studied, data sources, and positive, negative, or null associations) and adopting a list of standardized terms. Although challenging, this approach would improve literature searches, make data more comparable and FAIR [228-230], and lead to more efficient, frugal (less time and energy spent to identify relevant information), and accurate synthesis of the scientific literature, such as in reviews and bibliometric analyses.

The prominence of topics such as cancer, neurodegenerative diseases, mortality, injuries, and mental health issues underscores the need for targeted prevention strategies. The thematic map analysis indicates that reproductive disorders (eg, birth defects, endometriosis, and infertility) are on the verge of becoming a central research focus. Emerging and understudied health conditions, including ocular disorders, autoimmune diseases (eg, inflammatory bowel disease and rheumatoid arthritis), sleep disorders (eg, sleep apnea), cardiovascular diseases, and musculoskeletal conditions (eg, low-back pain), warrant increased attention and further research. Aging-related health issues, such as frailty, also represent promising avenues for future research, particularly given the growing aging population and associated health care challenges [24].

Limitations

The findings of this review should be considered in view of their limitations. Because of time and resource constraints, a single screening approach was used, with only 1 author (PP) conducting the review and bibliometric analysis. While single screening is an efficient use of time and resources, there is a higher risk of missing relevant studies than when using dual screening [344,345]. However, when completed by an experienced reviewer familiar with the research topic, the proportion of missed studies is limited and estimated to be around 3% [344]. Therefore, we cannot exclude the possibility that some studies may have been missed. Nevertheless, we are confident that none of these methodological limitations would change the overall conclusions of this work. Our restriction on articles published in English and French may have inadvertently excluded potentially relevant publications. We cannot exclude the possibility that publications using AHDs for addressing health issues in farming populations may have been missed if there was no mention of AHD in the publications’ titles and abstracts. However, it is important to mention that our search strategy was similar to recent reviews that specifically examined the use of AHDs for population-based research [93,96,103,104]. We further broadened our search by including synonyms to improve the comprehensiveness of our literature search. Some details and specificities on the AHDs and other digital data used may be limited because only information reported in each study was used. Shortcomings inherent to bibliometric analysis cannot be excluded. Some authors may have duplicate names, and namesakes could exist. This limitation could not be prevented as a unique author identifier (eg, Open Researcher and Contributor ID number) was not available. Self-citation could not be identified.

While our keyword analysis helped map the farming exposome in AHD-based public health research, this profile is incomplete and potentially biased. Because our review focused on AHD-based studies, we likely missed relevant epidemiological studies not using AHDs, leaving gaps in our understanding of the complete farming exposome across public health. In addition, the variability in keyword reporting practices across journals introduced bias into our keyword analysis. Some journals limit the number of keywords, and the lack of standardized keyword ontologies adds further variability. To mitigate this bias, we manually harmonized the keywords (eg, the use of 1 unique term for a given entity). While this approach is time-consuming, it allows for a more accurate analysis. For instance, if this approach was not performed, the same entity could be designated by various terms that would have been considered separate entities or terms, potentially resulting in underestimates (eg, in the number of publications). Despite these challenges, the findings from our scoping review were consistent with the keyword analysis.

Notwithstanding the aforementioned limitations, most of which are inherent to all scoping reviews and bibliometric analyses [93,96,103,104,158-160], we are confident that our findings can provide a comprehensive picture of what has been published until now (the current state of research and general directions) regarding the use of AHDs for addressing health issues in farming populations. This study may lay the groundwork for researchers to quickly identify research priorities and emerging research directions investigating health issues in farming populations using AHDs.

Conclusions

Technological advancements have greatly increased the volume of research data available, positioning AHDs as critical resources for population-based public health studies [41]. Our review underscores the broad public health implications of AHDs, providing actionable insights for researchers, physicians, and policy makers (Textbox 3). Addressing the identified research gaps is crucial to comprehensively understanding health risks in farming populations.

The insights derived from AHDs can inform meaningful recommendations for policy makers and guide future research directions, ultimately aiding health services and health policy development. Our findings underscore the necessity of comprehensive, interdisciplinary approaches to better understand and mitigate the health risks encountered by farming populations. Such efforts will improve data comparability and research quality while also supporting the formulation of targeted prevention strategies. This, in turn, will enhance health outcomes for farming populations and promote the sustainability of agriculture in an increasingly dynamic environment. The findings from this review offer insights that are not only relevant to farming populations but also potentially generalizable to other populations.

Textbox 3. Take-home messages.

Farming population

Research focusing on low- and middle-income countries, as well as on underrepresented subgroups within farming communities (eg, women, children, and contingent workers), remains insufficiently developed. These areas warrant further investigation to ensure more comprehensive insights.

Administrative health database (AHD) use

The use of AHDs in public health research among farming populations is expanding, offering major potential to enhance epidemiological studies and inform public health decisions. Promoting AHD-based research alongside the integration of other secondary data and artificial intelligence approaches represents a promising direction for future exploration. There is also a need to promote findable, accessible, interoperable, and reusable principles. Creating an AHD catalog or inventory could be a solution that would allow researchers to conduct a “datagraphic search” akin to traditional bibliographic research.

Farming exposure

Published studies on farming-related exposures often rely on broad proxies, such as job titles, neglecting to capture the nuances of specific agricultural tasks. While pesticide exposure remains a predominant research focus, emerging concerns, such as per- and polyfluoroalkyl substances, biological agents, micro or nanoplastics, and the effects of climate change, require urgent attention. The farming exposome remains underexplored despite its potential to uncover important associations between risk factors and a diverse range of health outcomes.

Health outcomes

Cancer, respiratory diseases, neurodegenerative disorders, and mental health issues are among the most frequently studied health outcomes in farming populations. However, significant gaps exist in understanding other critical conditions, such as cardiovascular diseases, reproductive disorders, ocular conditions, autoimmune diseases, musculoskeletal disorders, age-related health issues, and microbiome impacts. Addressing these overlooked areas is essential for a more complete understanding of the health risks faced by farming communities.

Acknowledgments

This work was partly funded by MIAI@Grenoble Alpes (Multidisciplinary Institute in Artificial Intelligence at Grenoble Alpes; grant ANR-19-P3IA-0003, 2019) and the French National Research Agency within the framework of the Investissements d’avenir program (grants ANR‐10‐AIRT‐05, 2012; ANR‐15‐IDEX‐02, 2016). The funding sources had no role in the study design and conduct of the study; collection, management, analysis, and interpretation of data; writing of the report; preparation, review, or approval of the paper; or decision to submit the paper for publication. The authors were not precluded from accessing data in the study, and all authors took final responsibility for the decision to submit for publication.

Data Availability

The datasets generated and analyzed during this study are available from the corresponding author upon reasonable request and also in the multimedia appendices.

Authors' Contributions

PP was involved in the conceptualization, methodology, software selection, validation, formal analysis, investigation, data curation, writing the original draft and reviewing and editing, visualization, project administration, and funding acquisition of the study. NV was involved in the conceptualization, reviewing and editing, resource collection, and funding acquisition of the study.

All authors meet all 4 criteria for authorship in the International Committee of Medical Journal Editors recommendations. All authors affirm that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any potential discrepancies from the study as originally planned (and, if relevant, registered) have been explained.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Supplementary materials.

DOCX File , 8778 KB

Multimedia Appendix 2

Publications included and analyzed.

XLSX File (Microsoft Excel File), 80 KB

Multimedia Appendix 3

Co-occurrence between exposome-related keywords and health event-related keywords.

XLSX File (Microsoft Excel File), 39 KB

Multimedia Appendix 4

PRISMA-ScR checklist.

PDF File (Adobe PDF File), 186 KB

Multimedia Appendix 5

BIBLIO checklist for reporting the bibliometric reviews of the biomedical literature.

PDF File (Adobe PDF File), 270 KB

  1. Azari R, Borisch B. What is public health? A scoping review. Arch Public Health. May 10, 2023;81(1):86. [FREE Full text] [CrossRef] [Medline]
  2. Velmovitsky PE, Kirolos M, Alencar P, Leatherdale S, Cowan D, Morita PP. Leveraging mHealth technologies for public health. JMIR Public Health Surveill. Sep 12, 2024;10:e49719. [FREE Full text] [CrossRef] [Medline]
  3. Barouki R, Lévi Y. Rapport 24-02. L’exposome, une contribution majeure pour la prévention. Bull Acad Natl Med. May 2024;208(5):523-535. [CrossRef]
  4. Barouki R, Audouze K, Becker C, Blaha L, Coumoul X, Karakitsios S, et al. The exposome and toxicology: a win-win collaboration. Toxicol Sci. Feb 28, 2022;186(1):1-11. [FREE Full text] [CrossRef] [Medline]
  5. Bennett B, Workman T, Smith MN, Griffith WC, Thompson B, Faustman EM. Characterizing the neurodevelopmental pesticide exposome in a children's agricultural cohort. Int J Environ Res Public Health. Feb 25, 2020;17(5):1479. [FREE Full text] [CrossRef] [Medline]
  6. Goodrich JA, Wang H, Jia Q, Stratakis N, Zhao Y, Maitre L, et al. Integrating Multi-Omics with environmental data for precision health: a novel analytic framework and case study on prenatal mercury induced childhood fatty liver disease. Environ Int. Aug 2024;190:108930. [FREE Full text] [CrossRef] [Medline]
  7. Haddad N, Andrianou XD, Makris KC. A scoping review on the characteristics of human exposome studies. Curr Pollution Rep. Nov 28, 2019;5(4):378-393. [CrossRef]
  8. Merra G, Gualtieri P, La Placa G, Frank G, Della Morte D, De Lorenzo A, et al. The relationship between exposome and microbiome. Microorganisms. Jul 08, 2024;12(7):1386. [FREE Full text] [CrossRef] [Medline]
  9. Pandics T, Major D, Fazekas-Pongor V, Szarvas Z, Peterfi A, Mukli P, et al. Exposome and unhealthy aging: environmental drivers from air pollution to occupational exposures. Geroscience. Dec 09, 2023;45(6):3381-3408. [FREE Full text] [CrossRef] [Medline]
  10. Petit P. Toxicological and exposure database inventory: a review. Int J Hyg Environ Health. Sep 2022;246:114055. [CrossRef] [Medline]
  11. Price EJ, Vitale CM, Miller GW, David A, Barouki R, Audouze K, et al. Merging the exposome into an integrated framework for "omics" sciences. iScience. Mar 18, 2022;25(3):103976. [FREE Full text] [CrossRef] [Medline]
  12. Santos S, Maitre L, Warembourg C, Agier L, Richiardi L, Basagaña X, et al. Applying the exposome concept in birth cohort research: a review of statistical approaches. Eur J Epidemiol. Mar 27, 2020;35(3):193-204. [FREE Full text] [CrossRef] [Medline]
  13. Siroux V, Agier L, Slama R. The exposome concept: a challenge and a potential driver for environmental health research. Eur Respir Rev. Jun 31, 2016;25(140):124-129. [FREE Full text] [CrossRef] [Medline]
  14. Tagliaferro S, Maio S, Pirona F, Stanisci I, Sarno G, Silvi P, EarlyFOOD, HEALS EXHES, et al. Assessing external exposome by implementing an environmental data management system using open data. Sci Rep. Jul 26, 2024;14(1):17142. [FREE Full text] [CrossRef] [Medline]
  15. Vineis P, Barouki R. The exposome as the science of social-to-biological transitions. Environ Int. Jul 2022;165:107312. [FREE Full text] [CrossRef] [Medline]
  16. Wei X, Huang Z, Jiang L, Li Y, Zhang X, Leng Y, et al. Charting the landscape of the environmental exposome. Imeta. Dec 02, 2022;1(4):e50. [CrossRef] [Medline]
  17. Wild CP. The exposome: from concept to utility. Int J Epidemiol. Feb 2012;41(1):24-32. [CrossRef] [Medline]
  18. Zhang P, Carlsten C, Chaleckis R, Hanhineva K, Huang M, Isobe T, et al. Defining the scope of exposome studies and research needs from a multidisciplinary perspective. Environ Sci Technol Lett. Oct 12, 2021;8(10):839-852. [FREE Full text] [CrossRef] [Medline]
  19. Ibanez A, Melloni L, Świeboda P, Hynes W, Ikiz B, Ayadi R, et al. Neuroecological links of the exposome and one health. Neuron. Jun 19, 2024;112(12):1905-1910. [FREE Full text] [CrossRef] [Medline]
  20. Liang KH, Colombijn JM, Verhaar MC, Ghannoum M, Timmermans EJ, Vernooij RW. The general external exposome and the development or progression of chronic kidney disease: a systematic review and meta-analyses. Environ Pollut. Oct 01, 2024;358:124509. [FREE Full text] [CrossRef] [Medline]
  21. Canfell OJ, Littlewood R, Burton-Jones A, Sullivan C. Digital health and precision prevention: shifting from disease-centred care to consumer-centred health. Aust Health Rev. Jun 10, 2022;46(3):279-283. [CrossRef] [Medline]
  22. Mess F, Blaschke S, Schick TS, Friedrich J. Precision prevention in worksite health-a scoping review on research trends and gaps. PLoS One. Jun 10, 2024;19(6):e0304951. [FREE Full text] [CrossRef] [Medline]
  23. Wang Y, Zhu M, Ma H, Shen H. Polygenic risk scores: the future of cancer risk prediction, screening, and precision prevention. Med Rev (2021). Dec 2021;1(2):129-149. [FREE Full text] [CrossRef] [Medline]
  24. Kalia V, Belsky DW, Baccarelli AA, Miller GW. An exposomic framework to uncover environmental drivers of aging. Exposome. 2022;2(1):osac002. [FREE Full text] [CrossRef] [Medline]
  25. Chung MK, House JS, Akhtari FS, Makris KC, Langston MA, Islam KT, et al. Members of the Exposomics Consortium. Decoding the exposome: data science methodologies and implications in exposome-wide association studies (ExWASs). Exposome. 2024;4(1):osae001. [FREE Full text] [CrossRef] [Medline]
  26. Andreu-Perez J, Poon CC, Merrifield RD, Wong ST, Yang GZ. Big data for health. IEEE J Biomed Health Inform. Jul 2015;19(4):1193-1208. [FREE Full text] [CrossRef] [Medline]
  27. Li C, Huang M. Environmental sustainability in the age of big data: opportunities and challenges for business and industry. Environ Sci Pollut Res Int. Dec 03, 2023;30(56):119001-119015. [CrossRef] [Medline]
  28. Pastorino R, De Vito C, Migliara G, Glocker K, Binenbaum I, Ricciardi W, et al. Benefits and challenges of Big Data in healthcare: an overview of the European initiatives. Eur J Public Health. Oct 01, 2019;29(Supplement_3):23-27. [FREE Full text] [CrossRef] [Medline]
  29. Flahault A, Geissbuhler A, Guessous I, Guérin P, Bolon I, Salathé M, et al. Precision global health in the digital age. Swiss Med Wkly. Apr 07, 2017;147(1314):w14423. [FREE Full text] [CrossRef] [Medline]
  30. Manaouil C, Chamot S, Petit P. Le médecin confronté à l’IA (Intelligence artificielle): Éthique et responsabilité. Méd Droit. Jun 2024;2024(186):50-66. [CrossRef]
  31. Samuel G, Lucassen AM. The environmental sustainability of data-driven health research: a scoping review. Digit Health. Jul 05, 2022;8:20552076221111297. [FREE Full text] [CrossRef] [Medline]
  32. Adam NR, Wieder R, Ghosh D. Data science, learning, and applications to biomedical and health sciences. Ann N Y Acad Sci. Jan 25, 2017;1387(1):5-11. [CrossRef] [Medline]
  33. Iyamu I, Xu AX, Gómez-Ramírez O, Ablona A, Chang H, Mckee G, et al. Defining digital public health and the role of digitization, digitalization, and digital transformation: scoping review. JMIR Public Health Surveill. Nov 26, 2021;7(11):e30399. [FREE Full text] [CrossRef] [Medline]
  34. Mooney SJ, Pejaver V. Big data in public health: terminology, machine learning, and privacy. Annu Rev Public Health. Apr 01, 2018;39:95-112. [FREE Full text] [CrossRef] [Medline]
  35. Wesson P, Hswen Y, Valdes G, Stojanovski K, Handley MA. Risks and opportunities to ensure equity in the application of big data research in public health. Annu Rev Public Health. Apr 05, 2022;43(1):59-78. [FREE Full text] [CrossRef] [Medline]
  36. Iyamu I, Gómez-Ramírez O, Xu AX, Chang H, Watt S, Mckee G, et al. Challenges in the development of digital public health interventions and mapped solutions: findings from a scoping review. Digit Health. 2022;8:20552076221102255. [CrossRef] [Medline]
  37. Iyamu I, McKee G, Haag D, Gilbert M. Defining the role of digital public health in the evolving digital health landscape: policy and practice implications in Canada. Health Promot Chronic Dis Prev Can. Feb 2024;44(2):66-69. [FREE Full text] [CrossRef] [Medline]
  38. Khan S, Kandukuri DK, Subramaniyan EU, MohanaSundaram A. Harnessing the untapped potential of digital twin technology in digital public health interventions. Explor Digit Health Technol. Nov 1, 2023;5(4):11-16. [CrossRef]
  39. Maaß L, Pan CC, Freye M. Mapping digital public health interventions among existing digital technologies and internet-based interventions to maintain and improve population health in practice: protocol for a scoping review. JMIR Res Protoc. Mar 31, 2022;11(3):e33404. [FREE Full text] [CrossRef] [Medline]
  40. Odone A, Buttigieg S, Ricciardi W, Azzopardi-Muscat N, Staines A. Public health digitalization in Europe. Eur J Public Health. Oct 01, 2019;29(Supplement_3):28-35. [FREE Full text] [CrossRef] [Medline]
  41. Wienert J, Jahnel T, Maaß L. What are digital public health interventions? First steps toward a definition and an intervention classification framework. J Med Internet Res. Jun 28, 2022;24(6):e31921. [FREE Full text] [CrossRef] [Medline]
  42. Burgun A, Bernal-Delgado E, Kuchinke W, van Staa T, Cunningham J, Lettieri E, et al. Health data for public health: towards new ways of combining data sources to support research efforts in Europe. Yearb Med Inform. Aug 19, 2017;26(01):235-240. [CrossRef]
  43. Canfell OJ, Davidson K, Sullivan C, Eakin E, Burton-Jones A. Data sources for precision public health of obesity: a scoping review, evidence map and use case in Queensland, Australia. BMC Public Health. Mar 24, 2022;22(1):584. [FREE Full text] [CrossRef] [Medline]
  44. Canfell OJ, Davidson K, Woods L, Sullivan C, Cocoros NM, Klompas M, et al. Precision public health for non-communicable diseases: an emerging strategic roadmap and multinational use cases. Front. Public Health. Apr 8, 2022;10:854525. [FREE Full text] [CrossRef] [Medline]
  45. Sielemann K, Hafner A, Pucker B. The reuse of public datasets in the life sciences: potential risks and rewards. PeerJ. 2020;8:e9954. [FREE Full text] [CrossRef] [Medline]
  46. Deal JA, Betz J, Lin FR, Reed NS. Interpreting results from epidemiologic studies. Semin Hear. Feb 15, 2021;42(1):3-9. [FREE Full text] [CrossRef] [Medline]
  47. DiPietro NA. Methods in epidemiology: observational study designs. Pharmacotherapy. Oct 06, 2010;30(10):973-984. [CrossRef] [Medline]
  48. Cancer epidemiology: principles and methods. International Agency for Research on Cancer. 1999. URL: https:/​/publications.​iarc.fr/​_publications/​media/​download/​4106/​a72983d2524d630f6ad0539f747e4780447f1ae3.​pdf [accessed 2024-04-29]
  49. Munnangi S, Boktor SW. Epidemiology of study design. StatPearls. 2014. URL: https://www.ncbi.nlm.nih.gov/books/NBK470342/ [accessed 2024-04-29]
  50. Röhrig B, du Prel JB, Wachtlin D, Blettner M. Types of study in medical research: part 3 of a series on evaluation of scientific publications. Dtsch Arztebl Int. Apr 2009;106(15):262-268. [FREE Full text] [CrossRef] [Medline]
  51. Himali JJ, Baril AA, Cavuoto MG, Yiallourou S, Wiedner CD, Himali D, et al. Association between slow-wave sleep loss and incident dementia. JAMA Neurol. Dec 01, 2023;80(12):1326-1333. [CrossRef] [Medline]
  52. Mahmood SS, Levy D, Vasan RS, Wang TJ. The Framingham Heart Study and the epidemiology of cardiovascular disease: a historical perspective. Lancet. Mar 15, 2014;383(9921):999-1008. [FREE Full text] [CrossRef] [Medline]
  53. Salines G, De LC. Les cohortes : intérêt, role et position de l'InVS. In: Saint-Maurice (Fra) : Institut de veille sanitaire, janvier 2010, 20 p. Princeton, NJ. Princeton University Press; 2018.
  54. Zins M, Goldberg M. L’apport des bases de données d’origine administrative aux cohortes épidémiologiques : l’exemple de la cohorte Constances. Statistique et Société. 2014;2(2):43-48. [FREE Full text]
  55. Chen W, Wang X, Chen J, You C, Ma L, Zhang W, et al. Household air pollution, adherence to a healthy lifestyle, and risk of cardiometabolic multimorbidity: results from the China health and retirement longitudinal study. Sci Total Environ. Jan 10, 2023;855:158896. [CrossRef] [Medline]
  56. Zhao Y, Hu Y, Smith JP, Strauss J, Yang G. Cohort profile: the China Health and Retirement Longitudinal Study (CHARLS). Int J Epidemiol. Feb 12, 2014;43(1):61-68. [FREE Full text] [CrossRef] [Medline]
  57. Kesse-Guyot E, Assmann K, Andreeva V, Castetbon K, Méjean C, Touvier M, et al. Metadata correction of: lessons learned from methodological validation research in e-epidemiology. JMIR Public Health Surveill. Oct 31, 2016;2(2):e160. [FREE Full text] [CrossRef] [Medline]
  58. Quantin C. Bases de données médicoadministratives et épidémiologie : intérêt, limites. Revue d'Épidémiologie et de Santé Publique. Sep 2008;56(5):S254. [CrossRef]
  59. Babik KR, Shockey TM, Moore LL, Wurzelbacher SJ. Standardizing industrial hygiene data collection forms used by workers' compensation insurers. J Occup Environ Hyg. Sep 04, 2018;15(9):676-685. [FREE Full text] [CrossRef] [Medline]
  60. Bosward R, Braunack-Mayer A, Frost E, Carter S. Mapping precision public health definitions, terminology and applications: a scoping review protocol. BMJ Open. Feb 23, 2022;12(2):e058069. [FREE Full text] [CrossRef] [Medline]
  61. Canfell OJ, Kodiyattu Z, Eakin E, Burton-Jones A, Wong I, Macaulay C, et al. Real-world data for precision public health of noncommunicable diseases: a scoping review. BMC Public Health. Nov 24, 2022;22(1):2166. [FREE Full text] [CrossRef] [Medline]
  62. Chao K, Sarker MN, Ali I, Firdaus RB, Azman A, Shaed MM. Big data-driven public health policy making: potential for the healthcare industry. Heliyon. Sep 2023;9(9):e19681. [FREE Full text] [CrossRef] [Medline]
  63. Dolley S. Big Data's role in precision public health. Front Public Health. Mar 7, 2018;6:68. [FREE Full text] [CrossRef] [Medline]
  64. Doyle CM, Lix LM, Hemmelgarn BR, Paterson JM, Renoux C. Data variability across Canadian administrative health databases: differences in content, coding, and completeness. Pharmacoepidemiol Drug Saf. Jan 10, 2020;29 Suppl 1(S1):68-77. [CrossRef] [Medline]
  65. Gallini A, Moisan F, Maura G, Carcaillon-Bentata L, Leray E, Haesebaert J, et al. [Identification of neurodegenerative diseases in administrative databases in France: a systematic review of the literature]. Rev Epidemiol Sante Publique. Oct 2017;65 Suppl 4:S183-S197. [CrossRef] [Medline]
  66. Khoury MJ, Iademarco MF, Riley WT. Precision public health for the era of precision medicine. Am J Prev Med. Mar 2016;50(3):398-401. [FREE Full text] [CrossRef] [Medline]
  67. McCullough LE, Maliniak ML, Amin AB, Baker JM, Baliashvili D, Barberio J, et al. Epidemiology beyond its limits. Sci Adv. Jun 10, 2022;8(23):eabn3328. [FREE Full text] [CrossRef] [Medline]
  68. Näher AF, Vorisek CN, Klopfenstein SA, Lehne M, Thun S, Alsalamah S, et al. Secondary data for global health digitalisation. Lancet Digit Health. Feb 2023;5(2):e93-101. [FREE Full text] [CrossRef] [Medline]
  69. Koesten L, Vougiouklis P, Simperl E, Groth P. Dataset reuse: toward translating principles to practice. Patterns (N Y). Nov 13, 2020;1(8):100136. [FREE Full text] [CrossRef] [Medline]
  70. Hu H, Galea S, Rosella L, Henry D. Big data and population health: focusing on the health impacts of the social, physical, and economic environment. Epidemiology. Nov 2017;28(6):759-762. [CrossRef] [Medline]
  71. Shaw RJ, Harron KL, Pescarini JM, Pinto Junior EP, Allik M, Siroky AN, et al. Biases arising from linked administrative data for epidemiological research: a conceptual framework from registration to analyses. Eur J Epidemiol. Dec 05, 2022;37(12):1215-1224. [FREE Full text] [CrossRef] [Medline]
  72. Subramanian M, Wojtusciszyn A, Favre L, Boughorbel S, Shan J, Letaief KB, et al. Precision medicine in the era of artificial intelligence: implications in chronic disease management. J Transl Med. Dec 09, 2020;18(1):472. [FREE Full text] [CrossRef] [Medline]
  73. Velmovitsky PE, Bevilacqua T, Alencar P, Cowan D, Morita PP. Convergence of precision medicine and public health into precision public health: toward a big data perspective. Front Public Health. Apr 6, 2021;9:561873. [FREE Full text] [CrossRef] [Medline]
  74. Zhang H, Hu H, Diller M, Hogan WR, Prosperi M, Guo Y, et al. Semantic standards of external exposome data. Environ Res. Jun 2021;197:111185. [FREE Full text] [CrossRef] [Medline]
  75. Daniels K, Bonnechère B. Harnessing digital health interventions to bridge the gap in prevention for older adults. Front Public Health. Jan 8, 2023;11:1281923. [FREE Full text] [CrossRef] [Medline]
  76. Liu F, Panagiotakos D. Real-world data: a brief review of the methods, applications, challenges and opportunities. BMC Med Res Methodol. Nov 05, 2022;22(1):287. [FREE Full text] [CrossRef] [Medline]
  77. Maaß L, Freye M, Pan CC, Dassow HH, Niess J, Jahnel T. The definitions of health apps and medical apps from the perspective of public health and law: qualitative analysis of an interdisciplinary literature overview. JMIR Mhealth Uhealth. Oct 31, 2022;10(10):e37980. [FREE Full text] [CrossRef] [Medline]
  78. Mooney SJ, Garber MD. Sampling and sampling frames in big data epidemiology. Curr Epidemiol Rep. Mar 2, 2019;6(1):14-22. [FREE Full text] [CrossRef] [Medline]
  79. Pearson AL, Tribby C, Brown CD, Yang JA, Pfeiffer K, Jankowska MM. Systematic review of best practices for GPS data usage, processing, and linkage in health, exposure science and environmental context research. BMJ Open. Feb 01, 2024;14(2):e077036. [FREE Full text] [CrossRef] [Medline]
  80. Prosperi M, Min JS, Bian J, Modave F. Big data hurdles in precision medicine and precision public health. BMC Med Inform Decis Mak. Dec 29, 2018;18(1):139. [FREE Full text] [CrossRef] [Medline]
  81. Safran C. Update on data reuse in health care. Yearb Med Inform. Aug 2017;26(1):24-27. [FREE Full text] [CrossRef] [Medline]
  82. Severo M, Feredj A, Romele A. Soft data and public policy: can social media offer alternatives to official statistics in urban policymaking? Policy Internet. Jul 19, 2016;8(3):354-372. [CrossRef]
  83. Shakeri Hossein Abad Z, Butler GP, Thompson W, Lee J. Physical activity, sedentary behavior, and sleep on Twitter: multicountry and fully labeled public data set for digital public health surveillance research. JMIR Public Health Surveill. Feb 14, 2022;8(2):e32355. [FREE Full text] [CrossRef] [Medline]
  84. Singh T, Roberts K, Cohen T, Cobb N, Wang J, Fujimoto K, et al. Social media as a research tool (SMaaRT) for risky behavior analytics: methodological review. JMIR Public Health Surveill. Nov 30, 2020;6(4):e21660. [FREE Full text] [CrossRef] [Medline]
  85. Zins M, Cuggia M, Goldberg M. Les données de santé en France - Abondantes mais complexes. Med Sci (Paris). Feb 16, 2021;37(2):179-184. [FREE Full text] [CrossRef] [Medline]
  86. Motahari-Nezhad H, Fgaier M, Mahdi Abid M, Péntek M, Gulácsi L, Zrubka Z. Digital biomarker-based studies: scoping review of systematic reviews. JMIR Mhealth Uhealth. Oct 24, 2022;10(10):e35722. [FREE Full text] [CrossRef] [Medline]
  87. Ayilara OF, Platt RW, Dahl M, Coulombe J, Ginestet PG, Chateau D, et al. Generating synthetic data from administrative health records for drug safety and effectiveness studies. Int J Popul Data Sci. Nov 27, 2023;8(1):2176. [FREE Full text] [CrossRef] [Medline]
  88. Fadel M, Petot J, Gourraud PA, Descatha A. Flexibility of a large blindly synthetized avatar database for occupational research: example from the CONSTANCES cohort for stroke and knee pain. PLoS One. Jul 31, 2024;19(7):e0308063. [FREE Full text] [CrossRef] [Medline]
  89. Giuffrè M, Shung DL. Harnessing the power of synthetic data in healthcare: innovation, application, and privacy. NPJ Digit Med. Oct 09, 2023;6(1):186. [FREE Full text] [CrossRef] [Medline]
  90. Kamel Boulos MN, Zhang P. Digital twins: from personalised medicine to precision public health. J Pers Med. Jul 29, 2021;11(8):745. [FREE Full text] [CrossRef] [Medline]
  91. Vallevik VB, Babic A, Marshall SE, Elvatun S, Brøgger HB, Alagaratnam S, et al. Can I trust my fake data - a comprehensive quality assessment framework for synthetic tabular data in healthcare. Int J Med Inform. May 2024;185:105413. [FREE Full text] [CrossRef] [Medline]
  92. Chamot S, Manaouil C, Fantoni S, Frimat P, Al-Salameh A, Desailloud R, et al. Système national des données de santé (SNDS) : perspectives en santé au travail et environnementale. Arch Mal Prof Environ. Nov 2023;84(6):101875. [CrossRef]
  93. Cohen S, Gilutz H, Marelli AJ, Iserin L, Benis A, Bonnet D, et al. Administrative health databases for addressing emerging issues in adults with CHD: a systematic review. Cardiol Young. Jun 29, 2018;28(6):844-853. [CrossRef] [Medline]
  94. Fraser HS, Mugisha M, Bacher I, Ngenzi JL, Seebregts C, Umubyeyi A, et al. Factors influencing data quality in electronic health record systems in 50 health facilities in Rwanda and the role of clinical alerts: cross-sectional observational study. JMIR Public Health Surveill. Jul 03, 2024;10:e49127. [FREE Full text] [CrossRef] [Medline]
  95. Gavrielov-Yusim N, Friger M. Use of administrative medical databases in population-based research. J Epidemiol Community Health. Mar 2014;68(3):283-287. [CrossRef] [Medline]
  96. Hinds A, Lix LM, Smith M, Quan H, Sanmartin C. Quality of administrative health databases in Canada: a scoping review. Can J Public Health. Jun 27, 2016;107(1):e56-e61. [FREE Full text] [CrossRef] [Medline]
  97. Howell CR, Zhang L, Clay OJ, Dutton G, Horton T, Mugavero MJ, et al. Social determinants of health phenotypes and cardiometabolic condition prevalence among patients in a large academic health system: latent class analysis. JMIR Public Health Surveill. Aug 07, 2024;10:e53371. [FREE Full text] [CrossRef] [Medline]
  98. Jones KH, Heys S, Tingay KS, Jackson P, Dibben C. The good, the bad, the clunky: improving the use of administrative data for research. Int J Popul Data Sci. Dec 04, 2018;4(1):587. [FREE Full text] [CrossRef] [Medline]
  99. Namulanda G, Qualters J, Vaidyanathan A, Roberts E, Richardson M, Fraser A, et al. Electronic health record case studies to advance environmental public health tracking. J Biomed Inform. Mar 2018;79:98-104. [FREE Full text] [CrossRef] [Medline]
  100. Nicholls SG, Quach P, von Elm E, Guttmann A, Moher D, Petersen I, et al. The REporting of studies conducted using observational routinely-collected health data (RECORD) statement: methods for arriving at consensus and developing reporting guidelines. PLoS One. 2015;10(5):e0125620. [FREE Full text] [CrossRef] [Medline]
  101. Silverwood R, Rajah N, Calderwood L, De Stavola B, Harron K, Ploubidis G. Examining the quality and population representativeness of linked survey and administrative data: guidance and illustration using linked 1958 National Child Development Study and Hospital Episode Statistics data. Int J Popul Data Sci. Jan 09, 2024;9(1):2137. [FREE Full text] [CrossRef] [Medline]
  102. Zuiderwijk A, Türk BO, Brazier F. Identifying the most important facilitators of open research data sharing and reuse in Epidemiology: a mixed-methods study. PLoS One. Feb 8, 2024;19(2):e0297969. [FREE Full text] [CrossRef] [Medline]
  103. Hobensack M, Song J, Scharp D, Bowles KH, Topaz M. Machine learning applied to electronic health record data in home healthcare: a scoping review. Int J Med Inform. Feb 2023;170:104978. [FREE Full text] [CrossRef] [Medline]
  104. Tassi MF, le Meur N, Stéfic K, Grammatico-Guillon L. Performance of French medico-administrative databases in epidemiology of infectious diseases: a scoping review. Front Public Health. May 12, 2023;11:1161550. [FREE Full text] [CrossRef] [Medline]
  105. Yoshida S, Takeuchi M, Tanaka-Mizuno S, Mizuno K, Nakashima M, Fukasawa T, et al. Clinical epidemiology and pharmacoepidemiology studies with real-world databases. Proc Jpn Acad Ser B Phys Biol Sci. 2022;98(10):517-528. [FREE Full text] [CrossRef] [Medline]
  106. Howren A, Aviña-Zubieta JA, Puyat JH, Esdaile JM, Da Costa D, De Vera MA. Defining depression and anxiety in individuals with rheumatic diseases using administrative health databases: a systematic review. Arthritis Care Res (Hoboken). Feb 09, 2020;72(2):243-255. [CrossRef] [Medline]
  107. Kim MK, Rouphael C, McMichael J, Welch N, Dasarathy S. Challenges in and opportunities for electronic health record-based data analysis and interpretation. Gut Liver. Mar 15, 2024;18(2):201-208. [FREE Full text] [CrossRef] [Medline]
  108. Lee S, Xu Y, D Apos Souza AG, Martin EA, Doktorchik C, Zhang Z, et al. Unlocking the potential of electronic health records for health research. Int J Popul Data Sci. Jan 30, 2020;5(1):1123. [FREE Full text] [CrossRef] [Medline]
  109. Pletscher F, Mändli Lerch K, Glinz D. Willingness to share anonymised routinely collected clinical health data in Switzerland: a cross-sectional survey. Swiss Med Wkly. Jun 20, 2022;152(2324):w30182. [FREE Full text] [CrossRef] [Medline]
  110. Yuan H, Ali MS, Brouwer ES, Girman CJ, Guo JJ, Lund JL, et al. ISPE Comparative Effectiveness Research Special Interest Group. Real-world evidence: what it is and what it can tell us according to the International Society for Pharmacoepidemiology (ISPE) Comparative Effectiveness Research (CER) Special Interest Group (SIG). Clin Pharmacol Ther. Aug 07, 2018;104(2):239-241. [CrossRef] [Medline]
  111. Andrew A, Zhou J, Gui J, Harrison A, Shi X, Li M, et al. Pesticides applied to crops and amyotrophic lateral sclerosis risk in the U.S. Neurotoxicology. Dec 2021;87:128-135. [FREE Full text] [CrossRef] [Medline]
  112. Chazard E, Ficheur G, Caron A, Lamer A, Labreuche J, Cuggia M, et al. Secondary use of healthcare structured data: the challenge of domain-knowledge based extraction of features. Stud Health Technol Inform. 2018;255:15-19. [Medline]
  113. Grath-Lone LM, Jay MA, Blackburn R, Gordon E, Zylbersztejn A, Wiljaars L, et al. What makes administrative data "research-ready"? A systematic review and thematic analysis of published literature. Int J Popul Data Sci. 2022;7(1):1718. [FREE Full text] [CrossRef] [Medline]
  114. Hashimoto RE, Brodt ED, Skelly AC, Dettori JR. Administrative database studies: goldmine or goose chase? Evid Based Spine Care J. Oct 24, 2014;5(2):74-76. [FREE Full text] [CrossRef] [Medline]
  115. Bezin J, Duong M, Lassalle R, Droz C, Pariente A, Blin P, et al. The national healthcare system claims databases in France, SNIIRAM and EGB: powerful tools for pharmacoepidemiology. Pharmacoepidemiol Drug Saf. Aug 24, 2017;26(8):954-962. [CrossRef] [Medline]
  116. Al-Sahab B, Leviton A, Loddenkemper T, Paneth N, Zhang B. Biases in electronic health records data for generating real-world evidence: an overview. J Healthc Inform Res. Mar 14, 2024;8(1):121-139. [CrossRef] [Medline]
  117. Bastarache L, Brown JS, Cimino JJ, Dorr DA, Embi PJ, Payne PR, et al. Developing real-world evidence from real-world data: transforming raw data into analytical datasets. Learn Health Syst. Jan 14, 2022;6(1):e10293. [FREE Full text] [CrossRef] [Medline]
  118. Fang J, Kapral MK, Richards J, Robertson A, Stamplecoski M, Silver FL. The Registry of Canadian Stroke Network : an evolving methodology. Acta Neurol Taiwan. Jun 2011;20(2):77-84. [Medline]
  119. Fillinger S, de la Garza L, Peltzer A, Kohlbacher O, Nahnsen S. Challenges of big data integration in the life sciences. Anal Bioanal Chem. Oct 28, 2019;411(26):6791-6800. [CrossRef] [Medline]
  120. Fonteneau L, Le Meur N, Cohen-Akenine A, Pessel C, Brouard C, Delon F, et al. [The use of administrative health databases in infectious disease epidemiology and public health]. Rev Epidemiol Sante Publique. Oct 2017;65 Suppl 4:S174-S182. [FREE Full text] [CrossRef] [Medline]
  121. Silenou BC, Avalos M, Helmer C, Berr C, Pariente A, Jacqmin-Gadda H. Health administrative data enrichment using cohort information: comparative evaluation of methods by simulation and application to real data. PLoS One. Jan 31, 2019;14(1):e0211118. [FREE Full text] [CrossRef] [Medline]
  122. Wang SV, Schneeweiss S. Data checks before registering study protocols for health care database analyses. JAMA. May 07, 2024;331(17):1445-1446. [CrossRef] [Medline]
  123. Yamana H, Yamasaki K, Jo T, Yatsuhashi H, Yasunaga H. A descriptive analysis of acute viral hepatitis using a database with electronic medical records and claims data. Ann Clin Epidemiol. 2023;5(4):107-112. [FREE Full text] [CrossRef] [Medline]
  124. Gianfrancesco MA, Goldstein ND. A narrative review on the validity of electronic health record-based research in epidemiology. BMC Med Res Methodol. Oct 27, 2021;21(1):234. [FREE Full text] [CrossRef] [Medline]
  125. Denisov NS, Kamenskikh EM, Fedorova OS. Trends in population-based studies: molecular and digital epidemiology (review). Sovrem Tekhnologii Med. Aug 29, 2022;14(4):60-70. [FREE Full text] [CrossRef] [Medline]
  126. Hamm NC, Hamad AF, Wall-Wieler E, Roos LL, Plana-Ripoll O, Lix LM. Multigenerational health research using population-based linked databases: an international review. Int J Popul Data Sci. Oct 07, 2021;6(1):1686. [FREE Full text] [CrossRef] [Medline]
  127. Poulain R, Tarek MF, Beheshti R. Improving fairness in AI models on electronic health records: the case for federated learning methods. FAccT 23 (2023). Jun 2023;2023:1599-1608. [FREE Full text] [CrossRef] [Medline]
  128. Tuppin P, Rudant J, Constantinou P, Gastaldi-Ménager C, Rachas A, de Roquefeuil L, et al. Value of a national administrative database to guide public decisions: from the système national d'information interrégimes de l'Assurance Maladie (SNIIRAM) to the système national des données de santé (SNDS) in France. Rev Epidemiol Sante Publique. Oct 2017;65 Suppl 4:S149-S167. [CrossRef] [Medline]
  129. Casey JA, Schwartz BS, Stewart WF, Adler NE. Using electronic health records for population health research: a review of methods and applications. Annu Rev Public Health. 2016;37:61-81. [FREE Full text] [CrossRef] [Medline]
  130. Chan JM, Carroll MW, Smyth M, Hamilton Z, Evans D, McGrail K, et al. Comparing health administrative and clinical registry data: trends in incidence and prevalence of pediatric inflammatory bowel disease in British Columbia. Clin Epidemiol. 2021;13:81-90. [FREE Full text] [CrossRef] [Medline]
  131. Chowdhury T, Hemmelgarn B. Evidence-based decision-making 6: utilization of administrative databases for health services research. In: Parfrey PS, Barrett BJ, editors. Clinical Epidemiology: Practice and Methods. Cham, Switzerland. Springer; 2015:469-484.
  132. Chishtie J, Sapiro N, Wiebe N, Rabatach L, Lorenzetti D, Leung AA, et al. Use of epic electronic health record system for health care research: scoping review. J Med Internet Res. Dec 15, 2023;25:e51003. [FREE Full text] [CrossRef] [Medline]
  133. Jutte DP, Roos LL, Brownell MD. Administrative record linkage as a tool for public health research. Annu Rev Public Health. 2011;32:91-108. [CrossRef] [Medline]
  134. Mazzali C, Duca P. Use of administrative data in healthcare research. Intern Emerg Med. Jun 25, 2015;10(4):517-524. [CrossRef] [Medline]
  135. McPheeters ML, Sathe NA, Jerome RN, Carnahan RM. Methods for systematic reviews of administrative database studies capturing health outcomes of interest. Vaccine. Dec 30, 2013;31 Suppl 10:K2-K6. [CrossRef] [Medline]
  136. Sato S, Yasunaga H. A review of studies using Japanese nationwide administrative claims databases. Ann Clin Epidemiol. 2023;5(2):58-64. [FREE Full text] [CrossRef] [Medline]
  137. Mues KE, Liede A, Liu J, Wetmore J, Zaha R, Bradbury BD, et al. Use of the Medicare database in epidemiologic and health services research: a valuable source of real-world evidence on the older and disabled populations in the US. Clin Epidemiol. 2017;9:267-277. [FREE Full text] [CrossRef] [Medline]
  138. Bernstein CN. Large registry epidemiology in IBD. Inflamm Bowel Dis. Nov 2017;23(11):1941-1949. [CrossRef] [Medline]
  139. Bloom PP, Tapper EB. The use of administrative data to investigate the population burden of hepatic encephalopathy. J Clin Med. Nov 10, 2020;9(11):3620. [FREE Full text] [CrossRef] [Medline]
  140. Hudson M, Tascilar K, Suissa S. Comparative effectiveness research with administrative health data in rheumatoid arthritis. Nat Rev Rheumatol. Jun 15, 2016;12(6):358-366. [CrossRef] [Medline]
  141. Mitchell E, Waring T, Ahern E, O'Donovan D, O'Reilly D, Bradley DT. Predictors and consequences of homelessness in whole-population observational studies that used administrative data: a systematic review. BMC Public Health. Aug 24, 2023;23(1):1610. [FREE Full text] [CrossRef] [Medline]
  142. Ulrich EH, So G, Zappitelli M, Chanchlani R. A review on the application and limitations of administrative health care data for the study of acute kidney injury epidemiology and outcomes in children. Front Pediatr. Oct 27, 2021;9:742888. [FREE Full text] [CrossRef] [Medline]
  143. Curl CL, Spivak M, Phinney R, Montrose L. Synthetic pesticides and health in vulnerable populations: agricultural workers. Curr Environ Health Rep. Mar 20, 2020;7(1):13-29. [FREE Full text] [CrossRef] [Medline]
  144. Demos K, Sazakli E, Jelastopulu E, Charokopos N, Ellul J, Leotsinidis M. Does farming have an effect on health status? A comparison study in west Greece. Int J Environ Res Public Health. Feb 26, 2013;10(3):776-792. [FREE Full text] [CrossRef] [Medline]
  145. Kachuri L, Harris MA, MacLeod JS, Tjepkema M, Peters PA, Demers PA. Cancer risks in a population-based study of 70,570 agricultural workers: results from the Canadian census health and Environment cohort (CanCHEC). BMC Cancer. May 19, 2017;17(1):343. [FREE Full text] [CrossRef] [Medline]
  146. Ramos AK, Girdžiūtė L, Starič J, Rautianinen RH. Identifying "vulnerable agricultural populations" at risk for occupational injuries and illnesses: a European perspective. J Agromedicine. Jul 27, 2021;26(3):340-345. [CrossRef] [Medline]
  147. Sturm ET, Castro C, Mendez-Colmenares A, Duffy J, Burzynska AZ, Stallones L, et al. Risk factors for brain health in agricultural work: a systematic review. Int J Environ Res Public Health. Mar 13, 2022;19(6):3373. [FREE Full text] [CrossRef] [Medline]
  148. Guindo Y, Parent ME, Richard H, Luce D, Barul C. Expert-based assessment of chemical and physical exposures, and organizational factors, in past agricultural jobs. Environ Res. Dec 15, 2024;263(Pt 3):120238. [FREE Full text] [CrossRef] [Medline]
  149. Komarek AM, De Pinto A, Smith VH. A review of types of risks in agriculture: what we know and what we need to know. Agric Syst. Feb 2020;178:102738. [CrossRef]
  150. Leon ME, Schinasi LH, Lebailly P, Beane Freeman LE, Nordby KC, Ferro G, et al. Pesticide use and risk of non-Hodgkin lymphoid malignancies in agricultural cohorts from France, Norway and the USA: a pooled analysis from the AGRICOH consortium. Int J Epidemiol. Oct 01, 2019;48(5):1519-1535. [FREE Full text] [CrossRef] [Medline]
  151. Pérès K, Matharan F, Allard M, Amieva H, Baldi I, Barberger-Gateau P, et al. Health and aging in elderly farmers: the AMI cohort. BMC Public Health. Jul 27, 2012;12(1):558. [FREE Full text] [CrossRef] [Medline]
  152. Petit P, Chamot S, Al-Salameh A, Cancé C, Desailloud R, Bonneterre V. Farming activity and risk of treated thyroid disorders: insights from the TRACTOR project, a nationwide cohort study. Environ Res. May 15, 2024;249:118458. [FREE Full text] [CrossRef] [Medline]
  153. Tricco AC, Lillie E, Zarin W, O'Brien KK, Colquhoun H, Levac D, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 02, 2018;169(7):467-473. [FREE Full text] [CrossRef] [Medline]
  154. The Joanna Briggs Institute Reviewers’ Manual 2015 methodology for JBI scoping reviews. The Joanna Briggs Institute. 2015. URL: https://reben.com.br/revista/wp-content/uploads/2020/10/Scoping.pdf [accessed 2024-04-03]
  155. Beydon M, Pinto S, De Rycke Y, Fautrel B, Mariette X, Seror R, et al. Risk of cancer for patients with rheumatoid arthritis versus general population: a national claims database cohort study. Lancet Reg Health Eur. Dec 2023;35:100768. [FREE Full text] [CrossRef] [Medline]
  156. Moysan M. Des archives à la base de données : valeur(s) des données en épidémiologie. Hist Méd Santé. Dec 15, 2022;(22):151-164. [FREE Full text] [CrossRef]
  157. Syed R, Eden R, Makasi T, Chukwudi I, Mamudu A, Kamalpour M, et al. Digital health data quality issues: systematic review. J Med Internet Res. Mar 31, 2023;25:e42615. [FREE Full text] [CrossRef] [Medline]
  158. Cobo MJ, López-Herrera AG, Herrera-Viedma E, Herrera F. An approach for detecting, quantifying, and visualizing the evolution of a research field: a practical application to the Fuzzy Sets Theory field. J Informetr. Jan 2011;5(1):146-166. [CrossRef]
  159. Donthu N, Kumar S, Mukherjee D, Pandey N, Lim WM. How to conduct a bibliometric analysis: an overview and guidelines. J Bus Res. Sep 2021;133:285-296. [CrossRef]
  160. Moral-Muñoz JA, Herrera-Viedma E, Santisteban-Espejo A, Cobo MJ. Software tools for conducting bibliometric analysis in science: an up-to-date review. Prof Inf J. Jan 19, 2020;29(1). [FREE Full text] [CrossRef]
  161. Bienert IR, Oliveira RC, Andrade PB, Caramori CA. Bibliometric indexes, databases and impact factors in cardiology. Rev Bras Cir Cardiovasc. 2015;30(2):254-259. [FREE Full text] [CrossRef] [Medline]
  162. Bihari A, Tripathi S, Deepak A. A review on h-index and its alternative indices. J Inf Sci. May 31, 2021;49(3):624-665. [CrossRef]
  163. Shao Y, Chien TW, Jang FL. The use of radar plots with the Yk-index to identify which authors contributed the most to the journal of Medicine in 2020 and 2021: a bibliometric analysis. Medicine (Baltimore). Nov 11, 2022;101(45):e31033. [FREE Full text] [CrossRef] [Medline]
  164. Kumar S, Kumar S. Trends of collaborative research in journals of oilseeds research (India), 1993-2004. Indian J Agric Library Inf Services. 2008;24:90.
  165. Wu T, Duan Y, Zhang T, Tian W, Liu H, Deng Y. Research trends in the application of artificial intelligence in oncology: a bibliometric and network visualization study. Front Biosci (Landmark Ed). Aug 31, 2022;27(9):254. [FREE Full text] [CrossRef] [Medline]
  166. Wang B, Zhao P. Worldwide research productivity in the field of back pain: a bibliometric analysis. Medicine (Baltimore). Oct 2018;97(40):e11566. [FREE Full text] [CrossRef] [Medline]
  167. Aria M, Cuccurullo C, D’Aniello L, Misuraca M, Spano M. Thematic analysis as a new culturomic tool: the social media coverage on COVID-19 pandemic in Italy. Sustainability. Mar 20, 2022;14(6):3643. [CrossRef]
  168. Petit P, Leroyer A, Chamot S, Fumery M, Bonneterre V. Farming activities and risk of inflammatory bowel disease: a French nationwide population-based cohort study. J Crohns Colitis. Sep 03, 2024;18(9):1415-1429. [CrossRef] [Medline]
  169. Montazeri A, Mohammadi S, M Hesari P, Ghaemi M, Riazi H, Sheikhi-Mobarakeh Z. Preliminary guideline for reporting bibliometric reviews of the biomedical literature (BIBLIO): a minimum requirements. Syst Rev. Dec 15, 2023;12(1):239. [FREE Full text] [CrossRef] [Medline]
  170. Aria M, Cuccurullo C. bibliometrix : an R-tool for comprehensive science mapping analysis. J Informetr. Nov 2017;11(4):959-975. [CrossRef]
  171. Spitzer WO, Hill GB, Chambers LW, Helliwell BE, Murphy HB. The occupation of fishing as a risk factor in cancer of the lip. N Engl J Med. Aug 28, 1975;293(9):419-424. [CrossRef]
  172. Alavanja MC, Samanic C, Dosemeci M, Lubin J, Tarone R, Lynch CF, et al. Use of agricultural pesticides and prostate cancer risk in the Agricultural Health Study cohort. Am J Epidemiol. May 01, 2003;157(9):800-814. [CrossRef] [Medline]
  173. Alavanja MC, Dosemeci M, Samanic C, Lubin J, Lynch CF, Knott C, et al. Pesticides and lung cancer risk in the agricultural health study cohort. Am J Epidemiol. Nov 01, 2004;160(9):876-885. [CrossRef] [Medline]
  174. Andersen A, Barlow L, Engeland A, Kjaerheim K, Lynge E, Pukkala E. Work-related cancer in the Nordic countries. Scand J Work Environ Health. 1999;25 Suppl 2:1-116. [FREE Full text] [Medline]
  175. Andreotti G, Koutros S, Hofmann JN, Sandler DP, Lubin JH, Lynch CF, et al. Glyphosate use and cancer incidence in the agricultural health study. J Natl Cancer Inst. May 01, 2018;110(5):509-516. [FREE Full text] [CrossRef] [Medline]
  176. Elbaz A, Clavel J, Rathouz PJ, Moisan F, Galanaud J, Delemotte B, et al. Professional exposure to pesticides and Parkinson disease. Ann Neurol. Oct 2009;66(4):494-504. [CrossRef] [Medline]
  177. Engel LS, Hill DA, Hoppin JA, Lubin JH, Lynch CF, Pierce J, et al. Pesticide use and breast cancer risk among farmers' wives in the agricultural health study. Am J Epidemiol. Jan 15, 2005;161(2):121-135. [CrossRef] [Medline]
  178. Eriksson M, Karlsson M. Occupational and other environmental factors and multiple myeloma: a population based case-control study. Br J Ind Med. Feb 01, 1992;49(2):95-103. [FREE Full text] [CrossRef] [Medline]
  179. Hemminki K, Saloniemi I, Salonen T, Partanen T, Vainio H. Childhood cancer and parental occupation in Finland. J Epidemiol Community Health. Mar 01, 1981;35(1):11-15. [FREE Full text] [CrossRef] [Medline]
  180. Karjalainen A, Kurppa K, Martikainen R, Klaukka T, Karjalainen J. Work is related to a substantial portion of adult-onset asthma incidence in the Finnish population. Am J Respir Crit Care Med. Aug 15, 2001;164(4):565-568. [CrossRef] [Medline]
  181. Karjalainen A, Martikainen R, Klaukka T, Saarinen K, Uitti J. Risk of asthma among Finnish patients with occupational rhinitis. Chest. Jan 2003;123(1):283-288. [CrossRef] [Medline]
  182. Kristensen P, Andersen A, Irgens LM, Laake P, Bye AS. Incidence and risk factors of cancer among men and women in Norwegian agriculture. Scand J Work Environ Health. Feb 1996;22(1):14-26. [FREE Full text] [CrossRef] [Medline]
  183. Lee WJ, Sandler DP, Blair A, Samanic C, Cross AJ, Alavanja MC. Pesticide use and colorectal cancer risk in the Agricultural Health Study. Int J Cancer. Jul 15, 2007;121(2):339-346. [FREE Full text] [CrossRef] [Medline]
  184. Levecque C, Elbaz A, Clavel J, Richard F, Vidal JS, Amouyel P, et al. Association between Parkinson's disease and polymorphisms in the nNOS and iNOS genes in a community-based case-control study. Hum Mol Genet. Jan 01, 2003;12(1):79-86. [CrossRef] [Medline]
  185. Mills PK, Yang R. Prostate cancer risk in California farm workers. J Occup Environ Med. Mar 2003;45(3):249-258. [CrossRef] [Medline]
  186. Morrison H, Savitz D, Semenciw R, Hulka B, Mao Y, Morison D, et al. Farming and prostate cancer mortality. Am J Epidemiol. Feb 01, 1993;137(3):270-280. [CrossRef] [Medline]
  187. Orton SM, Wald L, Confavreux C, Vukusic S, Krohn J, Ramagopalan S, et al. Association of UV radiation with multiple sclerosis prevalence and sex ratio in France. Neurology. Feb 2011;76(5):425-431. [CrossRef]
  188. Pouchieu C, Piel C, Carles C, Gruber A, Helmer C, Tual S, et al. Pesticide use in agriculture and Parkinson's disease in the AGRICAN cohort study. Int J Epidemiol. Feb 01, 2018;47(1):299-310. [CrossRef] [Medline]
  189. Pukkala E, Martinsen JI, Lynge E, Gunnarsdottir HK, Sparén P, Tryggvadottir L, et al. Occupation and cancer - follow-up of 15 million people in five Nordic countries. Acta Oncol. Jul 21, 2009;48(5):646-790. [CrossRef] [Medline]
  190. Reif J, Pearce N, Fraser J. Cancer risks in New Zealand farmers. Int J Epidemiol. Dec 1989;18(4):768-774. [CrossRef] [Medline]
  191. Stokes L, Stark A, Marshall E, Narang A. Neurotoxicity among pesticide applicators exposed to organophosphates. Occup Environ Med. Oct 01, 1995;52(10):648-653. [FREE Full text] [CrossRef] [Medline]
  192. Vingård E, Alfredsson L, Goldie I, Hogstedt C. Occupation and osteoarthrosis of the hip and knee: a register-based cohort study. Int J Epidemiol. Dec 1991;20(4):1025-1031. [CrossRef] [Medline]
  193. Vukusic S, Van Bockstael V, Gosselin S, Confavreux C. Regional variations in the prevalence of multiple sclerosis in French farmers. J Neurol Neurosurg Psychiatry. Jul 18, 2007;78(7):707-709. [FREE Full text] [CrossRef] [Medline]
  194. Weichenthal S, Villeneuve PJ, Burnett RT, van Donkelaar A, Martin RV, Jones RR, et al. Long-term exposure to fine particulate matter: association with nonaccidental and cardiovascular mortality in the agricultural health study cohort. Environ Health Perspect. Jun 2014;122(6):609-615. [FREE Full text] [CrossRef] [Medline]
  195. Wigle DT, Semenciw RM, Wilkins K, Riedel D, Ritter L, Morrison HI, et al. Mortality study of Canadian male farm operators: non-Hodgkin's lymphoma mortality and agricultural practices in Saskatchewan. J Natl Cancer Inst. Apr 04, 1990;82(7):575-582. [CrossRef] [Medline]
  196. Wiklund K, Dich J, Holm LE, Eklund G. Risk of cancer in pesticide applicators in Swedish agriculture. Br J Ind Med. Nov 01, 1989;46(11):809-814. [FREE Full text] [CrossRef] [Medline]
  197. Horwitz H, Ahlgren B, Nærum E. Effect of occupation on risk of developing MS: an insurance cohort study. BMJ Open. Jun 20, 2013;3(6):e002894. [FREE Full text] [CrossRef] [Medline]
  198. Karttunen JP, Rautiainen RH, Quendler E. Gender division of farm work and occupational injuries. J Agric Saf Health. Jul 16, 2019;25(3):117-127. [CrossRef] [Medline]
  199. Keller JE, Howe HL. Risk factors for lung cancer among nonsmoking Illinois residents. Environ Res. Jan 1993;60(1):1-11. [CrossRef] [Medline]
  200. Marant Micallef C, Shield KD, Vignat J, Baldi I, Charbotel B, Fervers B, et al. Cancers in France in 2015 attributable to occupational exposures. Int J Hyg Environ Health. Jan 2019;222(1):22-29. [CrossRef] [Medline]
  201. Moisan F, Spinosi J, Dupupet JL, Delabre L, Mazurie J, Goldberg M, et al. The relation between type of farming and prevalence of Parkinson's disease among agricultural workers in five French districts. Mov Disord. Feb 01, 2011;26(2):271-279. [FREE Full text] [CrossRef] [Medline]
  202. Toch-Marquardt M, Menvielle G, Eikemo TA, Kulhánová I, Kulik MC, Bopp M, et al. Euro-GBD-SE consortium. Occupational class inequalities in all-cause and cause-specific mortality among middle-aged men in 14 European populations during the early 2000s. PLoS One. Sep 30, 2014;9(9):e108072. [FREE Full text] [CrossRef] [Medline]
  203. Tüchsen F, Andersen O, Costa G, Filakti H, Marmot MG. Occupation and ischemic heart disease in the European Community: a comparative study of occupations at potential high risk. Am J Ind Med. Oct 1996;30(4):407-414. [CrossRef] [Medline]
  204. Wang CC, Lin GL, Lin YJ, Chen WL, Wu WT. Occupational health surveillance and detection of emerging occupational diseases among Taiwan farmers, through analysis of national-based farmers' and medico-administrative databases. Am J Ind Med. Jan 17, 2023;66(1):85-93. [CrossRef] [Medline]
  205. Langholz PL, Sommerseth HL. Old-age mortality and social class in northern Norway in the first half of the twentieth century. Scand Econ Hist Rev. Sep 13, 2023;72(2):153-170. [CrossRef]
  206. Das AV, Bagga B, Joseph J. Clinical profile and demographic distribution of keratitis: an electronic medical record-driven data analytics from an eye care network in India. Ocul Immunol Inflamm. Jul 21, 2024;32(5):722-726. [CrossRef] [Medline]
  207. Lemarchand C, Tual S, Levêque-Morlais N, Perrier S, Belot A, Velten M, et al. AGRICAN group. Cancer incidence in the AGRICAN cohort study (2005-2011). Cancer Epidemiol. Aug 2017;49:175-185. [FREE Full text] [CrossRef] [Medline]
  208. Scott E, Hirabayashi L, Levenstein A, Krupa N, Jenkins P. The development of a machine learning algorithm to identify occupational injuries in agriculture using pre-hospital care reports. Health Inf Sci Syst. Dec 29, 2021;9(1):31. [FREE Full text] [CrossRef] [Medline]
  209. Wilke RA, Qamar M, Lupu RA, Gu S, Zhao J. Chronic kidney disease in agricultural communities. Am J Med. Oct 2019;132(10):e727-e732. [FREE Full text] [CrossRef] [Medline]
  210. Engel LS, Werder E, Satagopan J, Blair A, Hoppin JA, Koutros S, et al. Insecticide use and breast cancer risk among farmers' wives in the agricultural health study. Environ Health Perspect. Sep 06, 2017;125(9):097002. [FREE Full text] [CrossRef] [Medline]
  211. Louis LM, Lerro CC, Friesen MC, Andreotti G, Koutros S, Sandler DP, et al. A prospective study of cancer risk among agricultural health study farm spouses associated with personal use of organochlorine insecticides. Environ Health. Sep 06, 2017;16(1):95. [FREE Full text] [CrossRef] [Medline]
  212. Elbaek Pedersen J, Hansen J. Risk of breast cancer in daughters of agricultural workers in Denmark. Environ Res. Jan 01, 2024;240(Pt 1):117374. [CrossRef] [Medline]
  213. Kasten-Arias C, Hodes T, Marino M, Kaufmann J, Lucas JA, Estela Vasquez Guzman C, et al. Healthcare utilization for asthma exacerbation among children of migrant and seasonal farmworkers. Prev Med Rep. Feb 2024;38:102598. [FREE Full text] [CrossRef] [Medline]
  214. Mengist Dessie A, Fenta Feleke S, Getaye Workie S, Getinet Abebe T, Mossu Chanie Y, Kassa Yalew A. Prevalence of skin disease and its associated factors among primary schoolchildren: a cross-sectional study from a Northern Ethiopian town. Clin Cosmet Investig Dermatol. 2022;15:791-801. [FREE Full text] [CrossRef] [Medline]
  215. Mills PK, Zahm SH. Organophosphate pesticide residues in urine of farmworkers and their children in Fresno County, California. Am J Ind Med. 2001;40(5):571-577. [CrossRef]
  216. Andrew N, Beare R, Ravipati T, Parker E, Snowdon D, Naude K, et al. Developing a linked electronic health record derived data platform to support research into healthy ageing. Int J Popul Data Sci. Jun 12, 2023;8(1):2129. [FREE Full text] [CrossRef] [Medline]
  217. Contiero P, Borgini A, Bertoldi M, Abita A, Cuffari G, Tomao P, et al. An epidemiological study to investigate links between atmospheric pollution from farming and SARS-CoV-2 mortality. Int J Environ Res Public Health. Apr 12, 2022;19(8):4637. [FREE Full text] [CrossRef] [Medline]
  218. Lotterman A, Baliatsas C, de Rooij MM, Huss A, Jacobs J, Dückers M, et al. Increased risk of pneumonia amongst residents living near goat farms in different livestock-dense regions in the Netherlands. PLoS One. Jul 5, 2023;18(7):e0286972. [FREE Full text] [CrossRef] [Medline]
  219. Nerich V, Monnet E, Weill A, Vallier N, Vanbockstael V, Auleley G, et al. Fine-scale geographic variations of inflammatory bowel disease in France: correlation with socioeconomic and house equipment variables. Inflamm Bowel Dis. May 2010;16(5):813-821. [CrossRef] [Medline]
  220. Nilles JD, Lim D, Boyer MP, Wilson BD, Betar RA, Showalter HA, et al. The occurrence of bone and joint cancers and their association with rural living and radon exposure in Iowa. Environ Geochem Health. Mar 05, 2023;45(3):925-940. [FREE Full text] [CrossRef] [Medline]
  221. Yzermans CJ, Moleman YP, Spreeuwenberg P, Nielen MM, Dückers ML, Smit LA, et al. Risk of pneumonia in the vicinity of goat farms: a comparative assessment of temporal variation based on longitudinal health data. Pneumonia (Nathan). Sep 05, 2023;15(1):13. [FREE Full text] [CrossRef] [Medline]
  222. Petit P, Bosson-Rieutort D, Maugard C, Gondard E, Ozenfant D, Joubert N, et al. The TRACTOR project: TRACking and MoniToring occupational risks in agriculture using French insurance health data (MSA). Ann Work Expo Health. Mar 15, 2022;66(3):402-411. [CrossRef] [Medline]
  223. Mills PK, Kwong S. Cancer incidence in the United Farmworkers of America (UFW), 1987–1997. Am J Ind Med. 2001;40(5):596-603. [CrossRef]
  224. Guillien A, Laurent L, Soumagne T, Puyraveau M, Laplante J, Andujar P, et al. Anxiety and depression among dairy farmers: the impact of COPD. Int J Chron Obstruct Pulmon Dis. 2018;13:1-9. [FREE Full text] [CrossRef] [Medline]
  225. A consortium of agricultural cohort studies. AGRICOH. URL: https://agricoh.iarc.fr/ [accessed 2024-05-17]
  226. Jouneau S, Pinault M, Gouyet T, Brinchault G, Guillot S, Viel J, et al. GERPRA. [AIRBAg study: preliminary results after one year of screening for COPD in dairy farmers]. Rev Mal Respir. Mar 2015;32(3):240-248. [CrossRef] [Medline]
  227. Soto S, Yoder AM, Nuño T, Aceves B, Sepulveda R, Rosales CB. Health conditions among farmworkers in the southwest: an analysis of the National Agricultural Workers survey. Front Public Health. Nov 3, 2022;10:962085. [FREE Full text] [CrossRef] [Medline]
  228. Contaxis N, Clark J, Dellureficio A, Gonzales S, Mannheimer S, Oxley PR, et al. Ten simple rules for improving research data discovery. PLoS Comput Biol. Feb 10, 2022;18(2):e1009768. [FREE Full text] [CrossRef] [Medline]
  229. Vesteghem C, Brøndum RF, Sønderkær M, Sommer M, Schmitz A, Bødker JS, et al. Implementing the FAIR Data Principles in precision oncology: review of supporting initiatives. Brief Bioinform. May 21, 2020;21(3):936-945. [FREE Full text] [CrossRef] [Medline]
  230. Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. Mar 15, 2016;3(1):160018. [FREE Full text] [CrossRef] [Medline]
  231. Roux P, Guillien A, Soumagne T, Ritter O, Laplante J, Travers C, et al. Smoking habits in French farmers: a cross-sectional study. BMC Public Health. Feb 04, 2017;17(1):166. [FREE Full text] [CrossRef] [Medline]
  232. Audi C, Baïz N, Maesano CN, Ramousse O, Reboulleau D, Magnan A, et al. Serum cytokine levels related to exposure to volatile organic compounds and PM2.5 in dwellings and workplaces in French farmers - a mechanism to explain nonsmoking COPD. Int J Chron Obstruct Pulmon Dis. 2017;12:1363-1374. [FREE Full text] [CrossRef] [Medline]
  233. Baldi I, Gruber A, Rondeau V, Lebailly P, Brochard P, Fabrigoule C. Neurobehavioral effects of long-term exposure to pesticides: results from the 4-year follow-up of the PHYTONER study. Occup Environ Med. Feb 22, 2011;68(2):108-115. [CrossRef] [Medline]
  234. Christensen CH, Platz EA, Andreotti G, Blair A, Hoppin JA, Koutros S, et al. Coumaphos exposure and incident cancer among male participants in the Agricultural Health Study (AHS). Environ Health Perspect. Jan 2010;118(1):92-96. [FREE Full text] [CrossRef] [Medline]
  235. Achard P, Maugard C, Cancé C, Spinosi J, Ozenfant D, Maître A, et al. Medico-administrative data combined with agricultural practices data to retrospectively estimate pesticide use by agricultural workers. J Expo Sci Environ Epidemiol. Jul 04, 2020;30(4):743-755. [CrossRef] [Medline]
  236. Hohenadel K, Harris SA, McLaughlin JR, Spinelli JJ, Pahwa P, Dosman JA, et al. Exposure to multiple pesticides and risk of non-Hodgkin lymphoma in men from six Canadian provinces. Int J Environ Res Public Health. Jun 21, 2011;8(6):2320-2330. [FREE Full text] [CrossRef] [Medline]
  237. Chung EY, Risi D, Holt JL, Lonergan M, Kotwal S, Yong K, et al. Retrospective study on the epidemiology of antineutrophil cytoplasmic autoantibodies-associated vasculitis in two Australian health districts. Intern Med J. Apr 13, 2022;52(4):605-613. [CrossRef] [Medline]
  238. Li X, Sundquist J, Sundquist K. Socioeconomic and occupational risk factors for rheumatoid arthritis: a nationwide study based on hospitalizations in Sweden. J Rheumatol. Jun 2008;35(6):986-991. [Medline]
  239. Apalla Z, Lallas A, Sotiriou E, Lazaridou E, Vakirlis E, Trakatelli M, et al. Farmers develop more aggressive histologic subtypes of basal cell carcinoma. Experience from a tertiary hospital in Northern Greece. J Eur Acad Dermatol Venereol. Apr 17, 2016;30 Suppl 3(S3):17-20. [CrossRef] [Medline]
  240. Lope V, Pollán M, Gustavsson P, Plato N, Pérez-Gómez B, Aragonés N, et al. Occupation and thyroid cancer risk in Sweden. J Occup Environ Med. Sep 2005;47(9):948-957. [CrossRef] [Medline]
  241. Nordby KC, Andersen A, Kristensen P. Incidence of lip cancer in the male Norwegian agricultural population. Cancer Causes Control. Aug 2004;15(6):619-626. [CrossRef]
  242. Kristensen P, Andersen A, Irgens LM. Hormone-dependent cancer and adverse reproductive outcomes in farmers' families--effects of climatic conditions favoring fungal growth in grain. Scand J Work Environ Health. Aug 2000;26(4):331-337. [FREE Full text] [CrossRef] [Medline]
  243. Døssing M, Petersen KT, Vyberg M, Olsen JH. Liver cancer among employees in Denmark. Am J Ind Med. Sep 1997;32(3):248-254. [CrossRef] [Medline]
  244. Klingelschmidt J, Chastang JF, Khireddine-Medouni I, Chérié-Challine L, Niedhammer I. [Occupational factors associated with suicide among French employees from the special agricultural social security scheme (MSA) working between 2007 and 2013]. Rev Epidemiol Sante Publique. Feb 2020;68(1):1-8. [CrossRef] [Medline]
  245. Manninen P, Heliövaara M, Riihimäki H, Mäkelä P. Does psychological distress predict disability? Int J Epidemiol. Oct 1997;26(5):1063-1070. [CrossRef] [Medline]
  246. Zöller B, Li X, Sundquist J, Sundquist K. Socioeconomic and occupational risk factors for venous thromboembolism in Sweden: a nationwide epidemiological study. Thromb Res. May 2012;129(5):577-582. [CrossRef] [Medline]
  247. Lucyk K, Lu M, Sajobi T, Quan H. Administrative health data in Canada: lessons from history. BMC Med Inform Decis Mak. Aug 19, 2015;15(1):69. [FREE Full text] [CrossRef] [Medline]
  248. Kab S, Spinosi J, Chaperon L, Dugravot A, Singh-Manoux A, Moisan F, et al. Agricultural activities and the incidence of Parkinson's disease in the general French population. Eur J Epidemiol. Mar 9, 2017;32(3):203-216. [CrossRef] [Medline]
  249. Perrin L, Spinosi J, Chaperon L, Kab S, Moisan F, Ebaz A. Pesticides expenditures by farming type and incidence of Parkinson disease in farmers: a French nationwide study. Environ Res. Jun 2021;197:111161. [CrossRef] [Medline]
  250. Carles C, Bouvier G, Lebailly P, Baldi I. Use of job-exposure matrices to estimate occupational exposure to pesticides: a review. J Expo Sci Environ Epidemiol. Mar 2017;27(2):125-140. [CrossRef] [Medline]
  251. Tudi M, Li H, Li H, Wang L, Lyu J, Yang L, et al. Exposure routes and health risks associated with pesticide application. Toxics. Jun 19, 2022;10(6):335. [FREE Full text] [CrossRef] [Medline]
  252. Ahmad MF, Ahmad FA, Alsayegh AA, Zeyaullah M, AlShahrani AM, Muzammil K, et al. Pesticides impacts on human health and the environment with their mechanisms of action and possible countermeasures. Heliyon. Apr 15, 2024;10(7):e29128. [FREE Full text] [CrossRef] [Medline]
  253. Cavalier H, Trasande L, Porta M. Exposures to pesticides and risk of cancer: evaluation of recent epidemiological evidence in humans and paths forward. Int J Cancer. Mar 01, 2023;152(5):879-912. [FREE Full text] [CrossRef] [Medline]
  254. Cresto N, Forner-Piquer I, Baig A, Chatterjee M, Perroy J, Goracci J, et al. Pesticides at brain borders: impact on the blood-brain barrier, neuroinflammation, and neurological risk trajectories. Chemosphere. May 2023;324:138251. [CrossRef] [Medline]
  255. Richardson JR, Fitsanakis V, Westerink RH, Kanthasamy AG. Neurotoxicity of pesticides. Acta Neuropathol. Sep 13, 2019;138(3):343-362. [FREE Full text] [CrossRef] [Medline]
  256. Sun J, Fang R, Wang H, Xu D, Yang J, Huang X, et al. A review of environmental metabolism disrupting chemicals and effect biomarkers associating disease risks: where exposomics meets metabolomics. Environ Int. Jan 2022;158:106941. [FREE Full text] [CrossRef] [Medline]
  257. Blair A, Freeman LB. Epidemiologic studies in agricultural populations: observations and future directions. J Agromedicine. May 07, 2009;14(2):125-131. [FREE Full text] [CrossRef] [Medline]
  258. Carvalho JG, Augusto HC, Ferraz R, Delerue-Matos C, Fernandes VC. Micro(nano)plastic and related chemicals: emerging contaminants in environment, food and health impacts. Toxics. Oct 20, 2024;12(10):762. [FREE Full text] [CrossRef] [Medline]
  259. Clarke K, Manrique A, Sabo-Attwood T, Coker ES. A narrative review of occupational air pollution and respiratory health in farmworkers. Int J Environ Res Public Health. Apr 13, 2021;18(8):4097. [FREE Full text] [CrossRef] [Medline]
  260. Domingo NG, Balasubramanian S, Thakrar SK, Clark MA, Adams PJ, Marshall JD, et al. Air quality-related health damages of food. Proc Natl Acad Sci U S A. May 18, 2021;118(20):e2013637118. [FREE Full text] [CrossRef] [Medline]
  261. Hao Y, Sun H, Zeng X, Dong G, Kronzucker HJ, Min J, et al. Smallholder vegetable farming produces more soil microplastics pollution than large-scale farming. Environ Pollut. Jan 15, 2023;317:120805. [CrossRef] [Medline]
  262. Ng EL, Huerta Lwanga E, Eldridge SM, Johnston P, Hu H, Geissen V, et al. An overview of microplastic and nanoplastic pollution in agroecosystems. Sci Total Environ. Jun 15, 2018;627:1377-1388. [CrossRef] [Medline]
  263. van Kersen W, Bossers A, de Steenhuijsen Piters WA, de Rooij MM, Bonten M, Fluit AC, et al. Air pollution from livestock farms and the oropharyngeal microbiome of COPD patients and controls. Environ Int. Nov 2022;169:107497. [FREE Full text] [CrossRef] [Medline]
  264. Ganter M. Zoonotic risks from small ruminants. Vet Microbiol. Dec 14, 2015;181(1-2):53-65. [CrossRef] [Medline]
  265. Golomazou E, Mamedova S, Eslahi AV, Karanis P. Cryptosporidium and agriculture: a review. Sci Total Environ. Mar 15, 2024;916:170057. [CrossRef] [Medline]
  266. McDaniel CJ, Cardwell DM, Moeller Jr RB, Gray GC. Humans and cattle: a review of bovine zoonoses. Vector Borne Zoonotic Dis. Jan 2014;14(1):1-19. [FREE Full text] [CrossRef] [Medline]
  267. Arora K, Xu L, Bhagianadh D. Dementia and cognitive decline in older adulthood: are agricultural workers at greater risk? J Gerontol B Psychol Sci Soc Sci. Sep 13, 2021;76(8):1629-1643. [FREE Full text] [CrossRef] [Medline]
  268. Douphrate DI, Rosecrance JC, Reynolds SJ, Stallones L, Gilkey DP. Tractor-related injuries: an analysis of workers' compensation data. J Agromedicine. May 07, 2009;14(2):198-205. [CrossRef] [Medline]
  269. Dos Anjos Magri C, Garófallo Garcia R, Binotto E, Duarte da Silva Lima N, de Alencar Nääs I, Sgavioli S, et al. Occupational risk factors in health of broiler-farm workers: a systematic review. Arch Environ Occup Health. Oct 15, 2021;76(8):482-493. [CrossRef] [Medline]
  270. Barbosa Junior M, Sokulski CC, Salvador R, Pinheiro E, de Francisco AC, Trojan F. What kills the agricultural worker? A systematic review on suicide. Rural Remote Health. Aug 2021;21(3):6067. [FREE Full text] [CrossRef] [Medline]
  271. Garrett-Wright D, Malin C, Jones MS. Mental health in farming communities. J Psychosoc Nurs Ment Health Serv. Oct 2023;61(10):39-43. [CrossRef] [Medline]
  272. Lunner Kolstrup C, Kallioniemi M, Lundqvist P, Kymäläinen HR, Stallones L, Brumby S. International perspectives on psychosocial working conditions, mental health, and stress of dairy farm operators. J Agromedicine. Jul 03, 2013;18(3):244-255. [CrossRef] [Medline]
  273. Petit P, Gandon G, Dubuc M, Vuillerme N, Bonneterre V. Agricultural activities and risk of treatment for depressive disorders among the entire French agricultural workforce: the TRACTOR project, a nationwide retrospective cohort study. Lancet Reg Health Eur. Aug 2023;31:100674. [FREE Full text] [CrossRef] [Medline]
  274. Daghagh Yazd S, Wheeler SA, Zuo A. Key risk factors affecting farmers' mental health: a systematic review. Int J Environ Res Public Health. Dec 02, 2019;16(23):4849. [FREE Full text] [CrossRef] [Medline]
  275. Nordby K, Andersen A, Irgens LM, Kristensen P. Indicators of mancozeb exposure in relation to thyroid cancer and neural tube defects in farmers' families. Scand J Work Environ Health. Apr 2005;31(2):89-96. [FREE Full text] [CrossRef] [Medline]
  276. Kauppinen T, Uuksulainen S, Saalo A, Mäkinen I, Pukkala E. Use of the Finnish Information System on Occupational Exposure (FINJEM) in epidemiologic, surveillance, and other applications. Ann Occup Hyg. Apr 2014;58(3):380-396. [CrossRef] [Medline]
  277. Shim YK, Mlynarek SP, van Wijngaarden E. Parental exposure to pesticides and childhood brain cancer: U.S. Atlantic coast childhood brain cancer study. Environ Health Perspect. Jun 2009;117(6):1002-1006. [FREE Full text] [CrossRef] [Medline]
  278. Baldi I, Carles C, Blanc-Lapierre A, Fabbro-Peray P, Druet-Cabanac M, Boutet-Robinet E, et al. PESTIMAT Group. A French crop-exposure matrix for use in epidemiological studies on pesticides: PESTIMAT. J Expo Sci Environ Epidemiol. Jan 23, 2017;27(1):56-63. [CrossRef] [Medline]
  279. Oltramare C, Mediouni Z, Shoman Y, Hopf NB, Graczyk H, Berthet A. Determinants of pesticide exposure in occupational studies: a meta-analysis. Toxics. Jul 18, 2023;11(7):623. [FREE Full text] [CrossRef] [Medline]
  280. Burstyn I, Lavoué J, Van Tongeren M. Aggregation of exposure level and probability into a single metric in job-exposure matrices creates bias. Ann Occup Hyg. Nov 2012;56(9):1038-1050. [CrossRef] [Medline]
  281. Fadel M, Evanoff BA, Andersen JH, d'Errico A, Dale AM, Leclerc A, et al. Not just a research method: if used with caution, can job-exposure matrices be a useful tool in the practice of occupational medicine and public health? Scand J Work Environ Health. Sep 01, 2020;46(5):552-553. [FREE Full text] [CrossRef] [Medline]
  282. Peters S, Vienneau D, Sampri A, Turner MC, Castaño-Vinyals G, Bugge M, et al. Occupational exposure assessment tools in Europe: a comprehensive inventory overview. Ann Work Expo Health. Jun 06, 2022;66(5):671-686. [FREE Full text] [CrossRef] [Medline]
  283. Petit P, Gondard E, Gandon G, Moreaud O, Sauvée M, Bonneterre V. Agricultural activities and risk of Alzheimer's disease: the TRACTOR project, a nationwide retrospective cohort study. Eur J Epidemiol. Mar 10, 2024;39(3):271-287. [FREE Full text] [CrossRef] [Medline]
  284. List of classifications. International Agency for Research on Cancer. URL: https://monographs.iarc.who.int/list-of-classifications [accessed 2024-10-30]
  285. Desai B, Sahni S, Jordan H, Sahni R, Reinbeau R, Nguyen A, et al. Risk of cardiovascular disease in male farmers over the age of 45: a review of literature. Cureus. May 2022;14(5):e24642. [FREE Full text] [CrossRef] [Medline]
  286. Kim H, Jung W, Jung S, Cho S, Jung I, Song H, et al. Is farming a risk occupation for cardio-cerebrovascular diseases? A scoping review on cardio-cerebrovascular disease risk in farmers. J Prev Med Public Health. Aug 23, 2024;57(6):521-529. [FREE Full text] [CrossRef] [Medline]
  287. Hulkkonen S, Shiri R, Auvinen J, Miettunen J, Karppinen J, Ryhänen J. Risk factors of hospitalization for carpal tunnel syndrome among the general working population. Scand J Work Environ Health. Jan 01, 2020;46(1):43-49. [FREE Full text] [CrossRef] [Medline]
  288. Kim J, Youn K, Park J. Risk factors for musculoskeletal disorders in Korean farmers: survey on occupational diseases in 2020 and 2022. Healthcare (Basel). Oct 11, 2024;12(20):2026. [FREE Full text] [CrossRef] [Medline]
  289. Hemminki K, Niemi ML, Saloniemi I, Vainio H, Hemminki E. Spontaneous abortions by occupation and social class in Finland. Int J Epidemiol. Jun 1980;9(2):149-153. [CrossRef] [Medline]
  290. Strand K, Wergeland E, Bjerkedal T. Fertility patterns according to occupational grouping in Norway, 1989. Scand J Soc Med. Mar 01, 1996;24(1):50-54. [CrossRef] [Medline]
  291. Liu Y, Huang L, Hu F, Zhang X. Investigating frailty, polypharmacy, malnutrition, chronic conditions, and quality of life in older adults: large population-based study. JMIR Public Health Surveill. Oct 11, 2024;10:e50617. [FREE Full text] [CrossRef] [Medline]
  292. Lie A, Skogstad M, Johannessen HA, Tynes T, Mehlum IS, Nordby K, et al. Occupational noise exposure and hearing: a systematic review. Int Arch Occup Environ Health. Apr 7, 2016;89(3):351-372. [FREE Full text] [CrossRef] [Medline]
  293. Kraemer JG, Aebi S, Oppliger A, Hilty M. The Indoor-Air Microbiota of pig farms drives the composition of the pig farmers' nasal microbiota in a season-dependent and farm-specific manner. Appl Environ Microbiol. May 01, 2019;85(9):e03038. [FREE Full text] [CrossRef] [Medline]
  294. Mucci N, Tommasi E, Chiarelli A, Lulli LG, Traversini V, Galea RP, et al. WORKbiota: a systematic review about the effects of occupational exposure on microbiota and workers' health. Int J Environ Res Public Health. Jan 18, 2022;19(3):1043. [FREE Full text] [CrossRef] [Medline]
  295. Shukla SK, Ye Z, Sandberg S, Reyes I, Fritsche TR, Keifer M. The nasal microbiota of dairy farmers is more complex than oral microbiota, reflects occupational exposure, and provides competition for staphylococci. PLoS One. Aug 29, 2017;12(8):e0183898. [FREE Full text] [CrossRef] [Medline]
  296. Sun J, Liao XP, D'Souza AW, Boolchandani M, Li S, Cheng K, et al. Environmental remodeling of human gut microbiota and antibiotic resistome in livestock farms. Nat Commun. Mar 18, 2020;11(1):1427. [FREE Full text] [CrossRef] [Medline]
  297. Vilela C, Araújo B, Soares-Guedes C, Caridade-Silva R, Martins-Macedo J, Teixeira C, et al. From the gut to the brain: is microbiota a new paradigm in Parkinson’s disease treatment? Cells. Apr 30, 2024;13(9):770. [FREE Full text] [CrossRef] [Medline]
  298. Gois MF, Fernández-Pato A, Huss A, Gacesa R, Wijmenga C, Weersma RK, et al. Impact of occupational pesticide exposure on the human gut microbiome. Front Microbiol. Aug 10, 2023;14:1223120. [FREE Full text] [CrossRef] [Medline]
  299. Christen P, Schnell R. Thirty-three myths and misconceptions about population data: from data capture and processing to linkage. Int J Popul Data Sci. Jan 31, 2023;8(1):2115. [FREE Full text] [CrossRef] [Medline]
  300. Dattani N, Hardelid P, Davey J, Gilbert R. Accessing electronic administrative health data for research takes time. Arch Dis Child. May 28, 2013;98(5):391-392. [CrossRef] [Medline]
  301. Verheij RA, Curcin V, Delaney BC, McGilchrist MM. Possible sources of bias in primary care electronic health record data use and reuse. J Med Internet Res. May 29, 2018;20(5):e185. [FREE Full text] [CrossRef] [Medline]
  302. Bousquet P, Caillet P, Coeuret-Pellicer M, Goulard H, Kudjawu Y, Le Bihan C, et al. [Using cancer case identification algorithms in medico-administrative databases: literature review and first results from the REDSIAM Tumors group based on breast, colon, and lung cancer]. Rev Epidemiol Sante Publique. Oct 2017;65 Suppl 4:S236-S242. [CrossRef] [Medline]
  303. Goldberg M, Carton M, Doussin A, Fagot-Campagna A, Heyndrickx E, Lemaitre M, et al. [The REDSIAM network]. Rev Epidemiol Sante Publique. Oct 2017;65 Suppl 4:S144-S148. [CrossRef] [Medline]
  304. Lipscombe LL, Hwee J, Webster L, Shah BR, Booth GL, Tu K. Identifying diabetes cases from administrative data: a population-based validation study. BMC Health Serv Res. May 02, 2018;18(1):316. [FREE Full text] [CrossRef] [Medline]
  305. Caputo M, Pecere A, Sarro A, Mele C, Ucciero A, Pagano L, et al. Incidence and prevalence of hyperthyroidism: a population-based study in the Piedmont Region, Italy. Endocrine. Jul 13, 2020;69(1):107-112. [CrossRef] [Medline]
  306. Habib RR, Hojeij S, Elzein K. Gender in occupational health research of farmworkers: a systematic review. Am J Ind Med. Dec 16, 2014;57(12):1344-1367. [FREE Full text] [CrossRef] [Medline]
  307. Biswas A, Harbin S, Irvin E, Johnston H, Begum M, Tiong M, et al. Sex and gender differences in occupational hazard exposures: a scoping review of the recent literature. Curr Environ Health Rep. Dec 27, 2021;8(4):267-280. [FREE Full text] [CrossRef] [Medline]
  308. Biswas A, Tiong M, Irvin E, Zhai G, Sinkins M, Johnston H, et al. Gender and sex differences in occupation-specific infectious diseases: a systematic review. Occup Environ Med. Aug 29, 2024;81(8):425-432. [FREE Full text] [CrossRef] [Medline]
  309. Comfort N, Re DB. Sex-specific neurotoxic effects of organophosphate pesticides across the life course. Curr Environ Health Rep. Dec 23, 2017;4(4):392-404. [FREE Full text] [CrossRef] [Medline]
  310. Wang W, Jin J, He R, Gong H. Gender differences in pesticide use knowledge, risk awareness and practices in Chinese farmers. Sci Total Environ. Jul 15, 2017;590-591:22-28. [CrossRef] [Medline]
  311. Xu L, Xu X, Wu X, Kuang H, Xu C. Sex-dependent environmental health risk analysis of flupyradifurone. Environ Sci Technol. Feb 01, 2022;56(3):1841-1853. [CrossRef] [Medline]
  312. Ketchen Jr DJ, Roccapriore AY, Connelly BL. Using old data: when is it appropriate? J Manag. Jun 04, 2023;49(8):2541-2548. [CrossRef]
  313. Vallot C. Epigenomic tumor evolution under the spotlight: the promises of single-cell approaches. C R Biol. May 11, 2022;345(1):11-16. [FREE Full text] [CrossRef] [Medline]
  314. Horgan D, Hajduch M, Vrana M, Soderberg J, Hughes N, Omar MI, et al. European health data space-an opportunity now to grasp the future of data-driven healthcare. Healthcare (Basel). Aug 26, 2022;10(9):1629. [FREE Full text] [CrossRef] [Medline]
  315. Martani A, Geneviève LD, Wangmo T, Maurer J, Crameri K, Erard F, et al. Sensing the (digital) pulse. Future steps for improving the secondary use of data for research in Switzerland. Digit Health. Apr 20, 2023;9:20552076231169826. [FREE Full text] [CrossRef] [Medline]
  316. Raab R, Küderle A, Zakreuskaya A, Stern AD, Klucken J, Kaissis G, et al. Federated electronic health records for the European Health Data Space. Lancet Digit Health. Nov 2023;5(11):e840-e847. [FREE Full text] [CrossRef] [Medline]
  317. Pampel H, Weisweiler NL, Strecker D, Witt M, Vierkant P, Elger K, et al. re3data - indexing the global research data repository landscape since 2012. Sci Data. Aug 29, 2023;10(1):571. [FREE Full text] [CrossRef] [Medline]
  318. Suhr B, Dungl J, Stocker A. Search, reuse and sharing of research data in materials science and engineering-a qualitative interview study. PLoS One. Sep 15, 2020;15(9):e0239216. [FREE Full text] [CrossRef] [Medline]
  319. Noaeen M, Doiron C, Syer J, Brook J. Advancing population health through open environmental data platforms. Curr Top Behav Neurosci. 2024;68:297-323. [CrossRef] [Medline]
  320. Dul J, van Raaij E, Caputo A. Advancing scientific inquiry through data reuse: necessary condition analysis with archival data. Strateg Change. Nov 02, 2023;33(1):35-40. [CrossRef]
  321. Catalogue des bases de données individuelles en santé en France. INSERM. 2020. URL: https://epidemiologie-france.aviesan.fr/ [accessed 2024-10-30]
  322. Plateforme ouverte des données publiques françaises. République Française. URL: https://www.data.gouv.fr/fr/ [accessed 2024-10-30]
  323. Dataset search. Google. URL: https://datasetsearch.research.google.com/ [accessed 2024-10-30]
  324. Kogevinas M, Schlünssen V, Mehlum IS, Turner MC. The OMEGA-NET international inventory of occupational cohorts. Ann Work Expo Health. Jul 01, 2020;64(6):565-568. [FREE Full text] [CrossRef] [Medline]
  325. Peters S. Although a valuable method in occupational epidemiology, job-exposure -matrices are no magic fix. Scand J Work Environ Health. May 01, 2020;46(3):231-234. [FREE Full text] [CrossRef] [Medline]
  326. Nittas V, Puhan MA, von Wyl V. Toward a working definition of eCohort studies in health research: narrative literature review. JMIR Public Health Surveill. Jan 21, 2021;7(1):e24588. [FREE Full text] [CrossRef] [Medline]
  327. Singh AV, Bansod G, Mahajan M, Dietrich P, Singh SP, Rav K, et al. Digital transformation in toxicology: improving communication and efficiency in risk assessment. ACS Omega. Jun 20, 2023;8(24):21377-21390. [CrossRef] [Medline]
  328. Soussi A, Zero E, Sacile R, Trinchero D, Fossa M. Smart sensors and smart data for precision agriculture: a review. Sensors (Basel). Apr 21, 2024;24(8):2647. [FREE Full text] [CrossRef] [Medline]
  329. Bukhtiyarova O, Abderrazak A, Chiu Y, Sparano S, Simard M, Sirois C. Major areas of interest of artificial intelligence research applied to health care administrative data: a scoping review. Front Pharmacol. Jul 18, 2022;13:944516. [FREE Full text] [CrossRef] [Medline]
  330. Caruana A, Bandara M, Musial K, Catchpoole D, Kennedy PJ. Machine learning for administrative health records: a systematic review of techniques and applications. Artif Intell Med. Oct 2023;144:102642. [CrossRef] [Medline]
  331. Leal Neto O, Von Wyl V. Digital transformation of public health for noncommunicable diseases: narrative viewpoint of challenges and opportunities. JMIR Public Health Surveill. Jan 25, 2024;10:e49575. [FREE Full text] [CrossRef] [Medline]
  332. Wang J, Kharrat FG, Gariépy G, Gagné C, Pelletier J, Massamba VK, et al. Predicting the population risk of suicide using routinely collected health administrative data in Quebec, Canada: model-based synthetic estimation study. JMIR Public Health Surveill. Jun 28, 2024;10:e52773. [FREE Full text] [CrossRef] [Medline]
  333. Ben-Ari T, Boé J, Ciais P, Lecerf R, Van der Velde M, Makowski D. Causes and implications of the unforeseen 2016 extreme yield loss in the breadbasket of France. Nat Commun. Apr 24, 2018;9(1):1627. [FREE Full text] [CrossRef] [Medline]
  334. Bhawra J, Elsahli N, Patel J. Applying digital technology to understand human experiences of climate change impacts on food security and mental health: scoping review. JMIR Public Health Surveill. Jul 23, 2024;10:e54064. [FREE Full text] [CrossRef] [Medline]
  335. Comi M, Becot F, Bendixsen C. Automation, climate change, and the future of farm work: cross-disciplinary lessons for studying dynamic changes in agricultural health and safety. Int J Environ Res Public Health. Mar 08, 2023;20(6):4778. [FREE Full text] [CrossRef] [Medline]
  336. Ebi KL, Vanos J, Baldwin JW, Bell JE, Hondula DM, Errett NA, et al. Extreme weather and climate change: population health and health system implications. Annu Rev Public Health. Apr 01, 2021;42:293-315. [FREE Full text] [CrossRef] [Medline]
  337. El Khayat M, Halwani DA, Hneiny L, Alameddine I, Haidar MA, Habib RR. Impacts of climate change and heat stress on farmworkers' health: a scoping review. Front Public Health. Feb 8, 2022;10:782811. [FREE Full text] [CrossRef] [Medline]
  338. Shahzad A, Ullah S, Dar AA, Sardar MF, Mehmood T, Tufail MA, et al. Nexus on climate change: agriculture and possible solution to cope future climate change stresses. Environ Sci Pollut Res. Jan 29, 2021;28(12):14211-14232. [CrossRef]
  339. Gibb K, Beckman S, Vergara XP, Heinzerling A, Harrison R. Extreme heat and occupational health risks. Annu Rev Public Health. May 2024;45(1):315-335. [FREE Full text] [CrossRef] [Medline]
  340. Lefèvre-Arbogast S, Chaker J, Mercier F, Barouki R, Coumoul X, Miller GW, et al. Assessing the contribution of the chemical exposome to neurodegenerative disease. Nat Neurosci. May 29, 2024;27(5):812-821. [FREE Full text] [CrossRef] [Medline]
  341. Shan L, Heusinkveld HJ, Paul KC, Hughes S, Darweesh SK, Bloem BR, et al. Towards improved screening of toxins for Parkinson's risk. NPJ Parkinsons Dis. Dec 19, 2023;9(1):169. [FREE Full text] [CrossRef] [Medline]
  342. Abdelzaher H, Tawfik SM, Nour A, Abdelkader S, Elbalkiny ST, Abdelkader M, et al. Climate change, human health, and the exposome: utilizing OMIC technologies to navigate an era of uncertainty. Front Public Health. Sep 21, 2022;10:973000. [FREE Full text] [CrossRef] [Medline]
  343. DeBord DG, Carreón T, Lentz TJ, Middendorf PJ, Hoover MD, Schulte PA. Use of the "exposome" in the practice of epidemiology: a primer on -Omic technologies. Am J Epidemiol. Aug 15, 2016;184(4):302-314. [FREE Full text] [CrossRef] [Medline]
  344. Waffenschmidt S, Knelangen M, Sieben W, Bühn S, Pieper D. Single screening versus conventional double screening for study selection in systematic reviews: a methodological systematic review. BMC Med Res Methodol. Jun 28, 2019;19(1):132. [FREE Full text] [CrossRef] [Medline]
  345. Waffenschmidt S, Sieben W, Jakubeit T, Knelangen M, Overesch I, Bühn S, et al. Increasing the efficiency of study selection for systematic reviews using prioritization tools and a single-screening approach. Syst Rev. Sep 14, 2023;12(1):161. [FREE Full text] [CrossRef] [Medline]


AGR: annual growth rate
AGRICAN: Agriculture and Cancer
AGRICOH: agricultural cohort
AHD: administrative health database
AHS: Agricultural Health Study
AI: artificial intelligence
AMI: Aging Multidisciplinary Investigation
ANR: French National Research Agency (Agence Nationale de la Recherche in French)
AS: association strength
BIBLIO: preliminary guideline for reporting bibliometric reviews of the biomedical literature
COPD: chronic obstructive pulmonary disease
EHR: electronic health record
EMR: electronic medical record
FAIR: findable, accessible, interoperable, and reusable
FERMA: risk factors of the rural environment and the allergic and respiratory disease
GDP: gross domestic product
JEM: job-exposure matrix
MIAI: Multidisciplinary Institute in Artificial Intelligence
MSA: Mutualité Sociale Agricole
NOCCA: Nordic Occupational Cancer Study
PRISMA-ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews
TRACTOR: Tracking and Monitoring Occupational Risks in Agriculture


Edited by A Mavragani; submitted 05.06.24; peer-reviewed by G Wu, F Clerc; comments to author 30.09.24; revised version received 08.10.24; accepted 07.11.24; published 09.01.25.

Copyright

©Pascal Petit, Nicolas Vuillerme. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 09.01.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.