Published on in Vol 10 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/58076, first published .
Web-Based Respondent-Driven Sampling to Assess Biobehavioral Factors Among Men Who Have Sex With Men in Thailand: Cross-Sectional Study

Web-Based Respondent-Driven Sampling to Assess Biobehavioral Factors Among Men Who Have Sex With Men in Thailand: Cross-Sectional Study

Web-Based Respondent-Driven Sampling to Assess Biobehavioral Factors Among Men Who Have Sex With Men in Thailand: Cross-Sectional Study

1Division of Epidemiology, Department of Disease Control, Ministry of Public Health, , DDC 10 Building, Tiwanon Road, Mueang, Nonthaburi, , Thailand

2Division of Global HIV & TB, U.S. Centers for Disease Control and Prevention, , Nonthaburi, , Thailand

3Informatics Section, Business Services Office, U.S. Centers for Disease Control and Prevention, , Nonthaburi, , Thailand

4Division of Global HIV & TB, U.S. Centers for Disease Control and Prevention, , Atlanta, GA, , United States

*all authors contributed equally

Corresponding Author:

Watcharapol Srinor, BSPH


Background: Respondent-driven sampling (RDS) is the current standard for sampling key populations at risk for HIV infections but is usually limited to local implementation in single towns or cities. Web-based sampling eliminates this spatial constraint but often relies on self-selected convenience samples. We piloted a web-based RDS survey with biomarker collection among men who have sex with men (MSM) in Thailand.

Objective: This study aimed to evaluate and demonstrate the feasibility of implementing a web-based RDS survey as a routine surveillance system in Thailand. The goal was to enhance surveillance efforts targeting hard-to-reach populations in the country.

Methods: We developed a website to fully function like a conventional RDS survey office, including coupon verification, eligibility screening, consenting, interviewing (self-administered), peer recruitment training, coupon issuance, compensation, and recruitment tracking. All functions were automated; data managers monitored recruitment, data collection, and payment and could be contacted by recruits as needed. Eligible participants were male, older than 15 years, resided in Thailand, and had anal sex with a man in the past 6 months. Recruits who resided in Bangkok were additionally invited to physically attend a participating health clinic of their choice for an HIV-related blood draw. Data were weighted to account for the complex sampling design.

Results: The survey was implemented from February to June 2022; seeds (21 at start, 14 added later) were identified mostly through targeted web-based banner ads; coupon uptake was 45.1%. Of 2578 candidate recruits screened for eligibility, 2151 (83.4%) were eligible and 2142 (83.1%) enrolled. Almost all (2067/2578, 80.2%) completed the questionnaire; however, 318 survey records were removed from analysis as fraudulent enrollments. The final sample size was 1749, the maximum number of waves achieved was 191, and sampling covered all 6 geographic regions and 75 of 77 (97.4%) provinces; convergence was reached for several salient variables. The mean age was 20.5 (SD 4.0) years, and most (69.8%) had never tested for HIV before, with fear of stigma as the biggest reason (97.1%) for not having tested. Most (76.9%) had visited gay-focused physical venues several times a week. A condom was used in 97.6% of the last sex acts, 11.0% had purchased sex from other men (past 12 mo), 4.5% had sold sex to men (past 12 mo), and 95.3% had 3+ male sex partners (last 3 mo). No participant in Bangkok presented for a blood draw.

Conclusions: We successfully conducted a web-based RDS survey among MSM in Thailand, covering nearly the entire country, although, as in physical RDS surveys, sampling was dominated by younger MSM. The survey also failed to collect biomarkers in Bangkok. Public health interventions should aim at increasing testing and addressing (the perception of) stigma.

JMIR Public Health Surveill 2024;10:e58076

doi:10.2196/58076

Keywords



The global share of new HIV infections due to key populations and their partners is steadily rising and was estimated at 70% in 2021 [1]. Key populations, including men who have sex with men (MSM), transgender persons, sex workers, people who inject drugs, and people in prisons and other enclosed settings, face extraordinarily high HIV acquisition risks [1], warranting intense HIV control efforts tailored to these populations. To inform programming and policy making, accurate HIV estimates at the population level are needed, in addition to monitoring and evaluation of programs. However, the lack of sampling frames for key populations poses a substantial challenge for surveillance and estimation, and necessitates work-arounds to generate population-level estimates, such as HIV prevalence, uptake of services, and progress toward the Joint United Nations Programme on HIV and AIDS (UNAIDS) 95-95-95 targets. Currently, the most frequently used sampling and analytic design to facilitate such population-level estimates is respondent-driven sampling (RDS) [2,3]. RDS represents an advanced version of snowball sampling, with refinements in the way data are collected and analyzed. Relying on the recruitment among socially connected peers, RDS surveys are typically conducted in a single location and rely on staffed survey offices for data collection.

In 2021, Thailand had 48.59 million internet users, representing 69.5% of the population, an increase of 7.4% from 2020. Additionally, there were 55.00 million social media users, accounting for 78.7% of the total population [4]. According to a survey conducted among gay men in Thailand (n=277), 78.7% of respondents reported using social network apps such as Facebook and Jack’d to find partners [5]. The rise of the internet and the increasing access to and time spent on the internet also open new possibilities for HIV surveillance, such as population size estimation [6,7] and surveys [8]. Similar to most health-related web-based surveys, web-based data collection among key populations primarily relies on samples based on self-selection and or unknown sampling probabilities [9,10]. Successfully implemented web-based RDS surveys among various populations [11,12], including MSM [13], remain sparse perhaps because building the required IT infrastructure can appear daunting. Still, web-based surveys facilitating population estimates are attractive for 2 principal reasons: the promise of large-scale surveys covering geographies not normally included in biobehavioral surveys [14] due to budgetary or sampling (rural areas) reasons and lower costs compared with a “brick-and-mortar” survey office. However, new challenges arise with web-based data collection such as fraudsters or duplicate enrollments due to the compensation paid out to recruits, as well as the impracticality of biomarker collection. For HIV-focused population surveys, biomarkers are often considered essential metrics facilitating estimates such as HIV prevalence, HIV status awareness, or viral load suppression.

Thailand faces a key population-dominated HIV epidemic with an estimated adult HIV prevalence of 1.0% [15]. Most new infections (52%) appear to arise from male-male sex [16]. Thailand has a long history of HIV surveillance, including time location sampling surveys for MSM and benefits from a robust case surveillance system [17]. A consensus meeting reviewing data from 2014 to 2018 concluded that HIV prevalence and incidence among MSM in Bangkok declined [18] but nevertheless remained high, with UNAIDS estimating HIV prevalence among MSM in Thailand at 11.9% [19].

This study aimed to evaluate and demonstrate the feasibility of a web-based RDS survey that could be implemented as a routine surveillance system in the country. The goal was to enhance surveillance efforts to reach hard-to-access populations in Thailand, where there is a high rate of social media use. If this study proved that web-based surveys are feasible in Thailand, it could establish a standardized web-based surveillance model for the country. This would provide valuable data for effectively addressing the HIV/AIDS epidemic.

We piloted a web-based RDS survey with biomarker collection among MSM in Thailand. In this paper, we report on the survey’s design, implementation, and findings.


Survey Setting and Design

We conducted a web-based RDS survey among MSM in all of Thailand, dubbed the Kai Noi survey. The survey was implemented by staff from the Division of Epidemiology (Ministry of Public Health, Thailand) with funding support and technical assistance from the US Centers for Disease Control and Prevention (CDC). The RDS survey system’s digital architecture is explained in detail in “A Web-Based, Respondent-Driven Sampling Survey Among Men Who Have Sex With Men (Kai Noi): Description of Methods and Characteristics” [20]. In short, a website was constructed to fully function like a conventional RDS survey office, including coupon verification, eligibility screening, consenting, interviewing (self-administered), peer recruitment training, coupon issuance, compensation, and recruitment tracking. All these functions were automated; even so, a data manager monitored recruitment and data collection, answered anecdotal questions from recruits (via the web or by phone), and supervised the execution of payments. Additional survey monitoring included coupon uptake and sampling speed, as well as convergence and bottleneck graphs.

Eligible participants and seeds (ie, participants who started the peer referral-based sampling process) were born male, identified as male, were 15 years or older, had anal sex with another man in the last 6 months, had not previously participated in this survey, resided in Thailand, could read Thai, and presented a valid electronic coupon (eCoupon; except seeds). Seeds were solicited through banner ads or micromessaging in gay-friendly social media (pages) and were sought from each of Thailand’s 6 geographic regions to ensure geographic diversity and social distance. Eligible and consenting seeds started the recruitment process using eCoupons that could be distributed through various social media, such as chat applications, or SMS. eCoupons were labeled with a nonserial coupon ID and displayed a web address linking to the survey website [21].

Interview Data Collection

The self-administered questionnaire was short and covered basic demographics, personal network size (used to inform sampling weights), recruiter-recruit relationship, cyberspace use, HIV service uptake, and HIV-related risk behaviors. Preprogrammed data checks and skip patterns were embedded. An attention filter question was inserted in about the middle of the questionnaire (ie, a question informing the recruit that this question probes whether he pays attention and asking him to select a specific response value). Time points at which sentinel events occurred were recorded for documentation and monitoring.

Data Analysis

RDS Analyst [22] software was used to derive weighted estimates, adjusting for the complex sampling design using Gile’s SS estimator and a national MSM size estimate of 588,000 (AIDS Epidemic Model, Thailand data sheet, unpublished data, 2022). We conducted an analysis to determine the independence between recruiters and recruits by examining the tendency for people to form social ties with others who share similar traits, which is known as homophily. The analysis considered two scenarios: (1) when the outcome clusters by network or (2) when network members cluster in space and the outcome is spatially clustered. We also performed a convergence analysis to assess whether the sample size in this study was sufficient to achieve results in the desired characteristics, and to determine if increasing the sample size further would lead to no changes in the results. Additionally, we conducted a univariate analysis, stratified by age or other characteristics as needed.

Biomarkers

Following interview completion, participants in Bangkok only were also offered to attend 1 of 10 routine health clinics of their choice for a blood draw. A venous blood sample would be tested for HIV serology (using the prevailing HIV rapid test algorithm), HIV recency, hepatitis B and C, and syphilis. At the clinic, recruits were to present their phone number and undergo in-person consenting at the clinic.

Ethical Considerations

The survey protocol was approved by the Ethical Review Committee of the MOPH, and was approved by the CDC in Atlanta, United States. Informed consent was obtained electronically, separately for the interview and biomarker testing. A copy of the consent could be downloaded by the participant. Personal identifiers, including cell phone numbers and IP addresses, were collected and converted to unique anonymous codes. All records were labeled using the eCoupon ID number. Participants could call survey staff with questions. At the clinic, recruits were to be pre- and posttest counseled and, as warranted, initiated on HIV preexposure prophylaxis (PrEP) or treatment, as well as provided treatment or care for syphilis and viral hepatitis as per MOPH guidelines. Recruits were compensated for survey enrollment and interviews (US $8.80); peer recruitment (US $4.40) for each successfully recruited peer, up to a maximum of US $8.80; and biomarker collection at a clinic (US $14.70).

The research project was approved by the Ethics Committee for Research in Human Subjects, Department of Disease Control, Thailand (FWA number 00013622). The US CDC determined that it was research with CDC investigators not engaged (Project ID 0900f3eb81afe111).

Survey Costs

The costs for preparing and implementing the web-based RDS survey included meetings and consultations to design the survey and develop the protocol, creation of the Kai Noi website, graphic design, hardware equipment, training, testing, server rental fees, volunteer compensation, web-based advertising, and compensation paid to survey participants, resulting in total costs of US $36,046 or US $22 per participant (n=1643).


Sampling

The Kai Noi survey was implemented from February to June 2022. A total of 673 self-selected web-based candidate seeds volunteered to initiate the eligibility screening; of these, 390 (57.9%) were deemed ineligible and a further 248 (36.8%) did not complete the eligibility interview. The remaining 35 (5.2%) candidates were eligible and all consented to enroll. We initiated peer recruitment with 21 of the 35 candidate seeds and added the remaining 14 seeds during the sampling period. Of the 35 seeds, 13 (31.1%) successfully recruited at least 1 peer, and the remaining 22 did not successfully recruit any eligible peers.

Each seed and recruit was issued 3 eCoupons for peer recruitment. Of 6207 eCoupons issued, 3000 (45.1%) were redeemed. The mean time between eCoupon issuance and redemption was 9.7 (SD 32.7) hours. A further 1008 eCoupons were presented that had already been redeemed and were therefore deemed invalid; in addition, 243 eCoupon IDs logged in by candidate participants were deemed invalid as they never been had issued. Almost half (48%) of candidate recruits logged into the survey website between 5 PM and 11:59 PM (GMT +7).

A total of 2578 candidate recruits (including seeds) with valid eCoupons and unique phone numbers were admitted to screening for eligibility, of which 136 (5.3%) were deemed ineligible due to reasons related to sex or gender (36.1%), age (5.9%) or lack of recent same-sex behavior (58.0%); a further 291 (11.3%) did not complete the eligibility screening questionnaire. The remaining 2151 (83.4%) were deemed eligible. Of these, 2142 (83.1%) agreed to participate, and of these, 2067 (80.2%) completed the questionnaire. During the sampling period, survey staff noticed potentially fraudulent behavior, confirmed it, and subsequently identified 318 survey records as repeat enrollments and hence fraudulent. These records, along with their redeemed and unredeemed eCoupons were removed from the analysis, leaving 1749 valid survey records. The mean social network size (based on having been in contact with known and likely eligible peers within the last 7 days) among enrolled survey participants was 2.3 (IQR 1-3). We excluded 103 records stemming from participants who did not pass the main interview’s attention filter question, leaving a final sample size of 1643 for the main interview data analysis. This sample restriction did not apply to the eligibility screening data (sex, age, residence, and same-sex sexual behavior) which was administered separately and prior to the main interview.

Table 1 displays seed-related sampling characteristics. For the sample of 1749 recruits, the maximum number of waves achieved was 191; with 1 seed accounting for 1549 (88.6%) recruits sampled. Recruits were sampled from all 6 regions (Table 2) and from 75 of 77 (97.4%) provinces (data not shown).

Figure 1 displays the distribution of recruitment by seed (n=1749); Figure 2 shows the distribution of survey enrollments across Thailand.

Table 1. Description of recruitment by seed.
CharacteristicAge (years)DegreeProvince (seed)Max number of wavesNumber of recruits (including seeds)Percentage of sample (%)
Seed number
13110Bangkok18734.2
4208Payao4181.0
53620Bangkok130.2
7222Ratchaburi350.3
12255Bangkok140.2
13305Nongbualamphu15482.7
143910Chiang Mai250.3
15315Chiang Rai3120.7
17195Khon Khaen191154988.6
18316Ang thong120.1
19472Phetchaburi120.1
28283Ratchaburi130.2
313014Nonthaburi130.2
Total (excluding nonproductive seeds)a172798.7
Number of nonproductive seeds221.3
Grand total1749100

aNot applicable.

Table 2. Demographic characteristics of men who have sex with men sampling. Sample size for age data is 1749, as age data were collected in the eligibility interview, separate from the main interview, with the embedded attention filter question that led to the exclusion of 103 records from the main interview, leading to a final sample size of 1643 for data analysis.
CharacteristicsUnweighted, n (%)Weighted (%)
PointL95% CIaU95% CIb
Sampling by region (n=1749)
 North336 (19.2)20.117.422.9
 Central653 (37.3)37.133.540.8
 Northeast378 (21.6)22.219.325.0
 West220 (12.6)12.710.415.0
 East74 (4.2)4.13.05.1
 South88 (5.0)3.82.45.2
Age distribution (years), (n=1749)
 15‐19807 (46.1)59.351.267.4
 20‐24552 (31.6)25.921.430.4
 25‐29315 (18.0)11.58.015.0
 30+75 (14.3)3.31.455.3
Highest education (n=1643)
 No schooling14 (0.9)0.50.01.7
 Secondary school39 (2.4)2.70.84.5
 High school1138 (69.3)83.076.489.6
 Diploma98 (6.0)3.82.35.2
 Bachelor233 (14.2)6.12.99.3
 Master and higher121 (7.4)3.91.36.6
Work (n=1643)
 Student927 (56.4)70.658.382.8
 Employee320 (19.5)16.99.124.6
 Own business99 (6.0)2.71.34.1
 Laborer97 (5.9)3.71.46.0
 Office worker67 (4.1)2.00.93.2
 Unemployed60 (3.7)1.30.03.4
 Merchant35 (2.1)1.00.41.7
 Government employee18 (1.1)0.70.21.1
 Other20 (1.2)0.90.41.4
Reasons to use the internet (multiple choice format)
 To find sex1451 (88.3)89.385.992.6
 To sell or buy sex87 (5.6)1.91.42.4
 To learn about HIV or use HIV services35 (2.1)0.60.30.9
 To pay or receive money259 (15.8)16.112.020.1
 To play games1006 (61.2)68.161.374.9
Social app use (multiple choice format)
 BlueD1103 (53.2)77.664.690.5
 Grindr75 (3.6)3.01.24.8
 Hornet92 (4.4)2.61.43.9
 Jack’d71 (3.4)2.10.63.5
 Gay Romeo286 (13.8)9.03.614.4
 Tinder445 (21.5)16.711.621.8
 Facebook779 (47.4)39.532.346.6
 Line555 (33.8)20.814.627.0
 Instagram417 (25.4)17.013.021.1
 Twitter494 (30.1)18.514.722.2

aL95%CI: lower 95% CI.

bU95%CI: upper 95% CI.

Figure 1. Recruitment tree by seed.
Figure 2. Distribution of enrollments by residence (N=1749) across 77 provinces in Thailand, displayed on a map with color-scale gradient based on the number of participants in each province.

Homophily and Convergence

Homophily

Table 3 displays homophily by select characteristics. We observed homophily for all 4 traits examined, with substantial homophily for age, region of residence, and timing of last HIV test. Recruiters were almost twice as likely to recruit peers sharing their age (group), region of residence, or HIV status as expected for random recruitment. There was little homophily for condom use.

Table 3. Homophily by select characteristics.
TraitCategoriesHomophilyP value
Age (years)15‐19, 20‐24, 25‐34, and 35+1.97<.001
ResidenceRegions 1 through 61.86<.001
Timing of last HIV testLast 12 months, >12 months ago, never1.98<.001
Condom use at last sexYes, no1.08<.001
Convergence

Figures 3-6 display convergence graphs for select characteristics. Convergence for age and region was reached after approximately 1000 participants, and that for (regional) residence and condom use was seemingly reached earlier. The interpretability of bottleneck graphs was greatly diminished as 1 seed’s recruitment tree accounted for 89% of the total sample, hence no bottleneck graphs are displayed here.

Figure 3. Convergence graph for age.
Figure 4. Convergence graph for HIV testing.
Figure 5. Convergence graph for condom use at last sex.
Figure 6. Convergence graph for region.

MSM Characteristics

Demographics

Table 2 displays the demographic characteristics of MSM sampling covering all 6 regions. The mean age was 20.5 (SD 4.0, median 20, IQR 19‐24, range 15‐56) years. Most had a high school education or higher and were currently in school. The majority (70.6%) used the internet to find sex or play games; BlueD was the most used social media app (77.6%).

HIV-Related Behaviors and Service Uptake

Most (76.9%) MSM visited gay-focused venues several times a week, 11.0% had purchased sex from men in the past 12 months, and 4.5% had sold sex to men in the past 12 months (Table 4). Almost all (95.3%) had more than 2 sex partners in the last 3 months, and almost all (97.6%) used a condom at their last sex act. Few (4.0%) had sex with a woman in the last 12 months or had injected drugs in the last 12 months (0.3%). Two-thirds (68.9%) had never tested for HIV, almost always out of fear of being stigmatized (97.1%). Current PrEP use was high (89.4%).

Table 4. Risk behaviors and service uptake. Only 1 participant stated that his last test result was HIV-positive (data not shown).
Risk behavior (n=1643)Unweighted, n (%)Weighted (%)
PointL95% CIaU95% CIb
Visits to gay-friendly venues (eg, saunas, clubs, bath houses) several times a week1095 (66.6)76.970.583.3
Bought sex from other men (last 12 mo)376 (22.9)11.07.214.8
Sold sex to other men (last 12 mo)211 (12.8)4.52.76.2
Two or more male sex partners (last 3 mo)1555 (95.6)95.393.197.5
Condom use at last same sex act1554 (94.6)97.695.899.4
Sex with women (last 12 mo)108 (6.6)4.01.86.2
Injecting drug use (last 12 mo)8 (0.5)0.30.00.6
Service uptake
 Timing of last HIV test
  In the last 12 months675 (41.1)28.711.745.7
  Longer than 12 months ago54 (3.3)1.50.42.6
  Never tested for HIV914 (55.6)69.852.187.5
 Reason for never having tested for HIV (n=914)
  Don’t feel at risk40 (4.4)1.60.05.7
  Fear of positive result14 (1.5)0.70.11.3
  No money to get tested12 (1.3)0.10.00.3
  No time to get tested16 (1.8)0.40.01.4
  Fear of stigma832 (91.0)97.191.9100.0
 Preexposure prophylaxis (PrEP; restricted to those who reported previous HIV negative test result, n=728)
  Have heard about PrEP676 (92.9)95.688.2100.0
  Currently using PrEP612 (90.5)89.480.198.8
  Among those who are not currently using PrEP (n=64)
   Used it in last 6 months4 (6.3)2.61.23.9
   Used it >6 months ago7 (10.9)7.52.612.5
   Never used it53 (82.8)89.984.795.1
  Among those never used PrEP (main reason; n=53)
   Don’t know about it1 (1.9)0.30.00.7
   Don’t know where to get prep4 (7.5)5.41.79.1
   Embarrassed to ask for it3 (5.7)1.60.04.3
   Don’t feel at risk36 (67.9)81.264.997.6
   Afraid of side effects4 (7.5)4.90.011.5
   Don’t want others to know5 (9.4)6.50.016.9

aL95% CI: lower 95% CI.

bU95% CI: upper 95% CI.

Biomarkers

There were 144 participants who resided in Bangkok. All of them were invited to attend 1 of 10 clinics to provide a blood sample; 82% agreed online to do so. However, no recruit presented at any of the 10 clinics during the survey, and no biomarker data were collected.


We successfully conducted a web-based RDS survey among MSM in Thailand, adding to the still small body of literature on web-based probability sampling among key populations. The survey covered nearly the entire country, with no region or province dominating recruitment, and reached convergence for salient variables. However, our goal of collecting biomarkers was not achieved, and the fraudulent enrollments are a reminder that financial compensation for web-based data collection may quickly attract fake (duplicate) enrollments.

As seen with other conventional RDS surveys, sampling was dominated by younger, relatively well-educated MSM. A large proportion of MSM uses the internet to “find” (mostly noncommercial) sex, which suggests that the web may be a useful tool to engage MSM for safe sex and HIV prevention messages. Of potential importance are the estimates for social media apps as these estimates can inform the reach of public health messaging in cyberspace across various apps. The most frequently used app appears to be BlueD, followed by Facebook. The estimated HIV testing uptake was low with less than a third ever having tested for HIV, even considering the age distribution skewed toward younger ages. Still, restricting our analysis to just the 4 provinces overlapping with the 2020 MSM biobehavioral surveys (Bangkok, Chiang Mai, Chonburi, and Phuket) [23], the lifetime HIV testing uptake estimates appear similar (web-based RDS: 63.8%; biobehavioral surveys: 52.9%). “Fear of stigma” was by far the most frequent reason given for not having tested, an important finding for Thailand’s HIV testing services. The low rate of HIV testing is contrasted by high proportions of MSM reporting condom use (at last sex) and PrEP use.

This survey also confirms that many (young) MSM continue to use offline venues such as saunas, bars, and clubs several times a week. HIV risk behaviors were confirmed in several dimensions, including buying and selling sex, as well as having multiple sex partners (for most MSM, 3 to 5 partners in the preceding 3 months). A positive finding was the very high proportion of condomized sex acts (98%).

Several substantial challenges and limitations must be noted, including a very low proportion (n=1) of participants reporting an HIV-positive status, which is unlikely for a sample size of 1643 and suggests that some participants were reluctant to report an HIV-positive status. Convergence for the traits examined was only reached after hundreds of recruits had been sampled, perhaps due to the very large sampling area. Because a single seed accounted for a very large proportion of our sample, we could not assess bottlenecks; at the same time, this observation supports the assumption that Thai MSM form a single network. The self-reported degrees were somewhat lower than expected, raising the possibility of inaccuracies. Further, our survey was exposed to fraud, an unsurprising risk given that compensation in RDS surveys is common. Survey staff suspected and eventually confirmed large-scale fraud when a participant contacted the survey staff with questions related to compensation that appeared odd to our staff. Over 300 fraudulent enrollments were detected despite multiple security measures to prevent such fraud. From an analysis point of view, the fraud cases in our survey represented mostly a loss of funds (paid-out compensation) and, assuming we identified all fraudulent “participants,” did not do lasting damage to the dataset, as we simply removed fraudulent enrollments from the dataset for analysis. Further, because (unsurprisingly) none of the fraudulent “recruits” peer-referred valid recruits, our dataset did not experience breaks in the recruitment chains. Web-based RDS surveys are well advised to maximize antifraud measures and to monitor proactively to detect fraudulent enrollments. Our national-level survey may mask important differences across localities and hence may not make local surveys redundant. Our biggest limitation is perhaps the failure to collect blood samples from recruits residing in greater Bangkok. The reasons for this are unclear, as most participants residing in Bangkok indicated via the web that they would be willing to present at a clinic, in exchange for additional financial compensation. Alternative designs involving compensation (all-in-one compensation paid only after the blood draw), blood collection (mailing kits for self-collection), or testing (mailing self-tests) should be evaluated. Of note, our survey was launched during the peak period of COVID-19 transmission in Thailand. The number of persons testing for HIV at the same 10 Bangkok sites as in our survey decreased from 82,054 in 2020 to 60,124 in 2022 during the fifth wave of Omicron COVID-19 [24].

The strengths of our survey included a fully automated RDS system that fulfilled all essential functions of a “brick-and-mortar” RDS survey office, including checking coupon validity, eligibility screening, consent, interview, coupon issuance, peer recruitment training, and compensation, promising and delivering efficiency and resource savings. The web-based design also facilitated a sampling process that covered the entire country. The code for our web-based RDS system is available from the investigators and can be redeployed with minimal preparatory effort within Thailand, that is, it can easily be adapted for other target populations. The code could also be used outside Thailand, assuming IT expertise is available to adapt the survey to a different geographic setting.

While the up-front coding work to construct a virtual RDS office can be substantial, other tasks in survey preparation become obsolete, such as securing physical survey office space, or may pose substantially smaller burdens, such as the number of staff or computer and other hardware equipment needed.

The need for key population–specific population-level estimates will keep increasing as countries monitor their progress toward equitable HIV epidemic control. With internet use becoming more ubiquitous, there is potential for web-based RDS surveys to help fill these data needs. More operational research is needed though in refining web-based RDS methods, including fraud prevention and detection, facilitating a more age-diverse sample, and bridging the online-offline gap in collecting biomarker data.

Acknowledgments

This project has been supported by the President’s Emergency Plan for AIDS Relief (PEPFAR) through the Centers for Disease Control and Prevention (CDC) under the terms of Cooperative Agreement #CDC-RFA-GH16-1676.

Disclaimer

The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the funding agencies.

Data Availability

The datasets generated or analyzed during this study are not publicly available due to ethical research considerations, but are available from the corresponding author on reasonable request.

Conflicts of Interest

None declared.

  1. UNAIDS. IN DANGER: UNAIDS Global AIDS Update 2022. Joint United Nations Programme on HIV/AIDS (UNAIDS); 2022. [CrossRef]
  2. Heckathorn DD. Respondent-driven sampling: a new approach to the study of hidden populations. Soc Probl. May 1997;44(2):174-199. [CrossRef]
  3. Heckathorn DD. Respondent-driven sampling II: deriving valid population estimates from chain-referral samples of hidden populations. Soc Probl. Feb 2002;49(1):11-34. [CrossRef]
  4. Digital 2021: Thailand. DataReportal. 2021. URL: https://datareportal.com/reports/digital-2021-thailand [Accessed 2024-11-29]
  5. Boonchutima S, Sriwattana S, Rungvimolsin R, Palahan N. Gays dating applications: information disclosure and sexual behavior. J Health Res. 2017;30(4):231-239. [CrossRef]
  6. Son VH, Safarnejad A, Nga NT, et al. Estimation of the population size of men who have sex with men in Vietnam: social app multiplier method. JMIR Public Health Surveill. Apr 17, 2019;5(2):e12451. [CrossRef] [Medline]
  7. Baral S, Turner RM, Lyons CE, et al. Population size estimation of gay and bisexual men and other men who have sex with men using social media-based platforms. JMIR Public Health Surveill. Feb 8, 2018;4(1):e15. [CrossRef] [Medline]
  8. Hlatshwako TG, Shah SJ, Kosana P, et al. Online health survey research during COVID-19. Lancet Digit Health. Feb 2021;3(2):e76-e77. [CrossRef] [Medline]
  9. Wiatrek S, Zlotorzynska M, Rai R, Sullivan P, Sanchez T. The Annual American Men’s Internet Survey of behaviors of men who have sex with men in the United States: key indicators report 2018. JMIR Public Health Surveill. Mar 4, 2021;7(3):e21812. [CrossRef] [Medline]
  10. Weatherburn P, Hickson F, Reid DS, Marcus U, Schmidt AJ. European men-who-have-sex-with-men internet survey (EMIS-2017): design and methods. Sex Res Soc Policy. Dec 2020;17(4):543-557. [CrossRef]
  11. Górny A, Salamońska J. Web-based respondent-driven sampling in research on multiple migrants: challenges and opportunities. In: Pötzschke S, Rinken S, editors. Migration Research in a Digitized World: Using Innovative Technology to Tackle Methodological Challenges. Springer International Publishing; 2022:59-77. [CrossRef]
  12. Wejnert C, Heckathorn DD. Web-based network sampling: efficiency and efficacy of respondent-driven sampling for online research. Sociol Methods Res. 2008;37(1):105-134. [CrossRef]
  13. Bengtsson L, Lu X, Nguyen QC, et al. Implementation of web-based respondent-driven sampling among men who have sex with men in Vietnam. PLoS One. 2012;7(11):e49417. [CrossRef] [Medline]
  14. Global HIV Strategic Information Working Group. Biobehavioral survey guidelines for populations at risk for HIV. Centers for Disease Control and Prevention. 2017. URL: https://stacks.cdc.gov/view/cdc/50667 [Accessed 2024-11-29]
  15. UNAIDS and AIDSinfo. Country factsheets colombia 2021 HIV and AIDS estimates. UNAIDS. 2021. URL: https://www.unaids.org/en/regionscountries/countries/thailand [Accessed 2023-04-11]
  16. Estimated HIV infections, Thailand. HIV Info HUB. URL: https://hivhub.ddc.moph.go.th/epidemic.php [Accessed 2023-04-11]
  17. จันทรมณี ไ, จันทรมณี ส. A prevalence survey of HIV and sexually transmitted infection among men who have sex with men (MSM), Thailand: venue day time sampling (VDTS) and respondent driven sampling (RDS). Dis Control J. 2016;42(4 SE-Original Article):269-279. [CrossRef]
  18. van Griensven F, Phanuphak N, Manopaiboon C, et al. HIV prevalence and incidence among men who have sex with men and transgender women in Bangkok, 2014-2018: outcomes of a consensus development initiative. PLoS One. 2022;17(1):e0262694. [CrossRef] [Medline]
  19. UNAIDS data 2022. UNAIDS. 2022. URL: https://www.unaids.org/sites/default/files/media_asset/data-book-2022_en.pdf [Accessed 2023-04-11]
  20. Karuchit S, Thiengtham P, Tanpradech S, et al. A web-based, respondent-driven sampling survey among men who have sex with men (Kai Noi): description of methods and characteristics. JMIR Form Res. May 20, 2024;8:e50812. [CrossRef] [Medline]
  21. Kai noi survey. Kainoi.net. URL: http://aidsboe.moph.go.th/kainoi/ [Accessed 2023-04-11]
  22. RDS analyst: software for the analysis of respondent-driven sampling data. The Hard-to-Reach Population Methods Research Group. 2022. URL: https://hpmrg.org [Accessed 2024-11-29]
  23. Bio-behavioral survey among MSM, Thailand. AIDS Surveillance System. URL: http://aidsboe.moph.go.th/aids_system/index.php?link=bbs [Accessed 2023-04-11]
  24. National Health Security Office web report/napwebreport. NHSO. URL: http://napdl.nhso.go.th/NAPWebReport/main_rep.jsp [Accessed 2023-04-11]


CDC: Centers for Disease Control and Prevention
eCoupon: electronic coupon
MSM: men who have sex with men
PrEP: preexposure prophylaxis
RDS: respondent-driven sampling
UNAIDS: Joint United Nations Programme on HIV and AIDS


Edited by Amaryllis Mavragani; submitted 06.03.24; peer-reviewed by Brady Bennett, Elysee Tuyishime; final revised version received 24.08.24; accepted 19.09.24; published 06.12.24.

Copyright

© Watcharapol Srinor, Suvimon Tanpradech, Panupit Thiengtham, Samart Karuchit, Charif Naksuk, Thitipong Yingyong, Thananda Naiwatanakul, Sanny Northbrook, Wolfgang Hladik. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 6.12.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.