The Impact of COVID-19 on People Living with HIV: A Network Science Perspective

Christopher, Jared; Nelson, Aiden; Somerville, Paris; Patel, Simran; Matta, John

doi:10.3390/covid5080119

Open AccessArticle

The Impact of COVID-19 on People Living with HIV: A Network Science Perspective

by

Jared Christopher

,

Aiden Nelson

,

Paris Somerville

,

Simran Patel

and

John Matta

^*

Department of Computer Science, Southern Illinois University Edwardsville, Edwardsville, IL 62025, USA

^*

Author to whom correspondence should be addressed.

COVID 2025, 5(8), 119; https://doi.org/10.3390/covid5080119

Submission received: 28 June 2025 / Revised: 23 July 2025 / Accepted: 25 July 2025 / Published: 28 July 2025

(This article belongs to the Section COVID Public Health and Epidemiology)

Download

Browse Figures

Versions Notes

Abstract

People living with HIV (PLWH) faced diverse challenges during the COVID-19 pandemic, including disruptions to care, housing instability, emotional distress, and economic hardship. This study used graph-based clustering methods to analyze pandemic-era experiences of PLWH in a national sample from the NIH’s All of Us dataset (n = 242). Across three graph configurations we identified consistent subgroups shaped by social connectedness, housing stability, emotional well-being, and engagement with preventive behaviors. Comparison with an earlier local study of PLWH in Illinois confirmed recurring patterns of vulnerability and resilience while also revealing additional national-level subgroups not observed in the smaller sample. Subgroups with strong social or institutional ties were associated with greater emotional stability and proactive engagement with COVID-19 preventive behaviors, while those facing isolation and structural hardship exhibited elevated distress and limited engagement with COVID-19 preventive measures. These findings underscore the importance of precision public health strategies that reflect the heterogeneity of PLWH and suggest that strengthening social support networks, promoting housing stability, and leveraging institutional connections may enhance pandemic preparedness and HIV care in future public health crises.

Keywords:

COVID-19; HIV; syndemics; network science; graph clustering; social determinants of health; pandemic vulnerability; resilience

1. Introduction

The dual pandemics of HIV and COVID-19 have placed substantial strain on vulnerable populations, deepening existing disparities in health outcomes, service access, and social stability. People living with HIV (PLWH) have faced heightened risks, including increased susceptibility to severe COVID-19, interruptions in routine care, and intensified economic hardship [1]. These overlapping challenges point to the need for analytic frameworks capable of capturing the complex, intersecting determinants of risk and resilience among PLWH during times of crisis.

Network science and graph theory provide a useful approach to this challenge, offering tools to represent individuals as nodes linked by shared attributes or experiences. This allows for the detection of latent subgroups that may exhibit common patterns of vulnerability or resilience [2]. In contrast to traditional statistical models, graph-based methods can capture nonlinear and multidimensional relationships that emerge from the social fabric of health determinants.

Our earlier work [3], based on a local survey of HIV-positive individuals and their partners in semi-urban and rural Illinois, demonstrated that graph-based clustering could reveal meaningful subgroups with differing experiences of discrimination, healthcare access, and pandemic-related hardship. That study used a small local sample and identified distinct clusters of PLWH facing varying levels of structural vulnerability and community support.

Building on this approach, the current study expands the analysis to a national level, leveraging data from NIH’s All of Us dataset, which involved a large, diverse cohort of participants across the United States. Our focus is on the subset of HIV-positive individuals who completed both the Social Determinants of Health (SDoH) and COVID-19 Participant Experience (COPE) [4] surveys. Although the All of Us dataset is large, the number of HIV-positive respondents with completed COPE surveys during the relevant time window was modest (n = 242), reflecting persistent challenges in capturing timely data on this population.

A key strength of this work is its ability to identify both vulnerability and resilience within the population of PLWH. While some clusters were marked by housing instability, social disconnection, or emotional distress, others reflected relative stability and protective factors, such as emotionally resilient retirees or individuals with strong institutional ties. The recurrence of similar cluster types across three distinct graph configurations lends confidence to the robustness of these findings and demonstrates the capacity of graph-based methods to reveal complex, emergent social structures.

By extending graph-based clustering to a national cohort, this study contributes to the growing literature on syndemic interactions between HIV and COVID-19. It provides a flexible methodological framework for detecting meaningful subgroups among PLWH, with direct implications for targeted interventions and pandemic preparedness efforts. In particular, the results highlight the value of network-based methods in uncovering population heterogeneity that may not be readily captured by conventional analytic approaches.

2. Related Work

Research on the intersection of HIV and COVID-19 has largely emphasized clinical and public health outcomes, with relatively limited application of network science methods. Papers by Grubb et al. [5] and Lopez et al. [6] examined network characteristics in analyzing the spread of HIV. Brown et al. [7] investigated the impact of the COVID-19 pandemic on HIV prevention and treatment services, highlighting disruptions in testing, access to antiretroviral therapy, and declines in viral suppression, disruptions that were particularly pronounced in underserved communities. The present study builds on this work by further examining the social and economic consequences of COVID-19 for PLWH.

Several papers document the compounded vulnerabilities experienced by PLWH during the COVID-19 pandemic. Elevated vaccine hesitancy among PLWH has been noted [8], often linked to concerns about side effects and the lack of tailored information. Resilience-focused care models have been proposed to help mitigate the psychological impacts of the pandemic [9], which include increased anxiety, depression, and social isolation [10]. Mental health concerns are particularly pressing given the preexisting stigma and barriers to care faced by PLWH, now exacerbated by pandemic-related disruptions.

The socioeconomic impact of COVID-19 has also disproportionately affected PLWH, who have experienced higher rates of job loss [11], economic instability, and interruptions in health insurance coverage [12]. Social determinants of health as they relate to PLWH are examined in [2,13]. These trends motivated our decision to analyze COVID-related economic hardship as a central variable in our graph-theoretic approach.

Although the present study is grounded in graph-based methods, relatively few studies have applied network analysis to HIV-related data in the context of COVID-19. Most network science applications in HIV research have historically focused on transmission dynamics, contact tracing, or intervention design [14,15,16]. Social and sexual network analyses have been used to identify high-risk clusters and inform targeted outreach strategies, particularly among MSM and substance-using populations. Graph-based molecular epidemiology, in which transmission clusters are inferred from HIV genetic sequence data, has also become a critical tool for public health surveillance [17].

More recent work has begun to integrate network science with machine learning to uncover the latent community structure and improve intervention targeting. Xiang et al. [18] review the use of artificial intelligence and machine learning in HIV care, highlighting predictive models for diagnosis, adherence, and behavioral risk. However, few of these approaches explicitly use graph-theoretic methods to model inter-individual similarity across multiple dimensions of socioeconomic and health experience, as is performed in the present study.

3. Methods

3.1. Network Science and Its Use in This Study

This study applies network science and graph-theoretic methods to identify patterns among HIV-positive individuals during the COVID-19 pandemic. In this framework, individuals are represented as nodes (also called vertices), and connections between individuals, based on their similarity across multiple features, are represented as edges. Each edge is assigned a weight that reflects the degree of similarity between two participants.

We follow the approach used in previous work [3], in which networks are constructed from one-hot encoded survey responses, and community detection algorithms are applied to uncover clusters of individuals with shared characteristics or experiences. This allows us to examine whether certain traits, such as social isolation, economic hardship, or resilience, tend to co-occur and define meaningful subgroups within the population. For an illustrative example of how we convert survey answers into a graph, see Appendix A.

Unlike traditional statistical approaches that rely on large sample sizes to achieve significance, network science enables the detection of structure and community patterns, even in modestly sized datasets. This is particularly valuable in studying subpopulations, such as PLWH, where granular insights are often needed but large targeted samples may be difficult to obtain.

Graph-based methods offer a complementary perspective to traditional regression or latent class modeling by allowing the population structure to emerge organically from the data without prespecifying groupings. The overall analytic workflow, from data preparation to clustering and interpretation, is summarized in Figure 1.

3.2. Dataset: All of Us

To investigate the impact of COVID-19 on individuals living with HIV, we used data from the NIH’s All of Us Research Program [19]. Using the Cohort Builder tool [20], we identified a cohort of HIV-positive participants who had completed the Basics, Social Determinants of Health (SDoH), and COVID-19 Participant Experience (COPE) surveys [4]. These instruments were selected to capture multidimensional aspects of participants’ health, socioeconomic context, and pandemic-related experiences.

A total of 360 participants met these inclusion criteria. The cohort was demographically diverse, with a broad age range (18–65+) and approximately one-third identifying as Black or African American (Figure 2 and Figure 3). To temporally align the data with the Burden of HIV survey [3], we restricted the responses to those from the February 2021 COPE survey release. This final filtering yielded a sample of 242 individuals, whose survey responses were used for the downstream analysis.

3.3. Dataset: The Burden of HIV Survey

The Burden of HIV survey [3,21], conducted in 2021 and 2022, was modeled on established surveys of HIV and HIV-adjacent populations, including the Sexual Acquisition and Transmission of HIV Cooperative Agreement Program (SATHCAP) survey [22], conducted in 2006–2008, and the Latino MSM Community Involvement: HIV Protective Effects Survey (LMSM) [23], conducted in 2005. Owing to its extensive list of questions and the diversity of the 22 survey participants, the Burden of HIV survey functions more as an ethnographic instrument, documenting changes in social determinants of health, economic outcomes, and social and medical environments since those earlier surveys.

3.4. Data Preprocessing and Feature Selection

All question–answer pairs from the three selected surveys were one-hot encoded, yielding a binary matrix with 958 features representing individual responses. For example, a feature might represent the answer to a question such as “Were you laid off work due to COVID,” where the values are 1 for yes and 0 for no. In this context, features may also be referred to as variables.

To focus the analysis on COVID-19-related economic hardship, we selected a multi-response question (Athena Code: 1333291) that asked participants to indicate whether they had experienced various pandemic-related disruptions, including job loss, income reduction, or difficulties affording childcare [24]. Responses to this question were treated as the binary outcome variable in a logistic regression model, with the remaining one-hot encoded features serving as predictors.

We retained the top 15, 30, 45, 100, and 250 features most positively associated with economic hardship, providing flexibility for later graph construction while reducing the risk of overfitting.

3.5. Graph Construction

We constructed k-nearest neighbor (KNN) graphs based on participant similarity in the reduced feature spaces. For each feature subset (15, 30, 45, 100, and 250 features), the corresponding response matrix was used to compute KNN graphs using scikit-learn’s NearestNeighbors model [25]. Each participant was a node, and edges were formed by connecting each node to its k nearest neighbors, with k values of 2, 3, 5, 7, and 10. The reciprocal of the Euclidean distance between feature vectors was used as the edge weight such that more similar individuals were connected by stronger (higher weight) edges.

All graphs were constructed using NetworkX [26], resulting in 25 undirected, weighted graphs with 242 nodes each. Each graph was fully connected.

3.6. Clustering Methods

To identify subgroups of participants with shared characteristics, we applied three graph-based clustering methods: (1) the Louvain algorithm; (2) the NBR-Clust framework using vertex attack tolerance (VAT) as the resilience metric; and (3) the NBR-Clust framework using integrity as the resilience metric. Each of these methods reflects a distinct strategy for identifying meaningful structure within the graph, helping reveal hidden subgroups, and increasing robustness to methodological biases.

The first method was the widely used Louvain algorithm [27], which detects communities by optimizing the modularity, a measure of how well a network is divided into clusters. Modularity compares the actual density of edges within a cluster to the density that would be expected if edges were distributed randomly, given each node’s degree. Clusters with a high modularity contain more intra-cluster connections than would be expected by chance. The Louvain algorithm operates in a greedy, hierarchical manner, repeatedly grouping nodes into communities and then refining the clustering.

The second and third methods were based on the NBR-Clust framework [28], which uses resilience-based metrics to uncover meaningful subgroups in a network. Specifically, we applied NBR-Clust with two different resilience measures: vertex attack tolerance (VAT) [29,30] and integrity [31]. This approach identifies individuals who act as structural “bridges” in the network, specifically nodes whose removal would fragment the graph into smaller components. By conceptually removing these key individuals, NBR-Clust reveals latent subgroups that are tightly connected, often reflecting distinct patterns of vulnerability, resilience, or social experience.

In a public health context, this approach helps uncover subgroups linked by shared lived experiences, such as housing insecurity, barriers to care, or pandemic-related hardship. For example, NBR-Clust may identify one cluster of participants primarily affected by job loss, another experiencing severe social isolation, and another facing disruptions in healthcare access. These clusters can guide more tailored intervention strategies.

For completeness, the mathematical definitions of the VAT and integrity metrics are provided below. Readers are not expected to follow the formulas in detail; they serve to formalize how these methods detect structurally meaningful subgroups in the network.

V A T (G) = min_{S \subset V} \{\frac{| S |}{| V - S - C_{max} (V - S) | + 1}\},

(1)

I (G) = min_{S \subset V} \{| S | + C_{max} (V - S)\},

(2)

where S is the set of nodes to be removed, and

C_{max} (V - S)

is the size of the largest connected component after removing S.

Each clustering method produced a different number of clusters, reflecting its sensitivity to different aspects of network structure. The combined use of modularity- and resilience-based methods provided a richer view of the population’s heterogeneity.

3.7. Cluster Evaluation Metrics

To evaluate the quality of the resulting clusterings, we computed four complementary metrics using scikit-learn [25]:

Modularity, which quantifies partitions based on the edge density.
Calinski–Harabasz index [32], which assesses the ratio of between-cluster to within-cluster dispersion (higher values indicate better-defined clusters).
Davies–Bouldin index [33], which measures the average similarity between each cluster and its most similar counterpart (lower values indicate better separation).
Silhouette [34], which evaluates the cohesion and separation of clusters, defined as

$s (i) = \frac{b (i) - a (i)}{max {a (i), b (i)}},$

(3)

where $a (i)$ is the mean intra-cluster distance and $b (i)$ is the mean nearest-cluster distance for each point i.

Together, these metrics guided the selection of the most informative graph and clustering configurations for further analysis.

3.8. Cluster Over-Representation Calculation

To interpret the content of each cluster, we analyzed the frequency of each one-hot encoded feature within the cluster, relative to its frequency in the full sample. The percentages shown in Figure 4, Figure 5 and Figure 6, as well as in Table A1, Table A2 and Table A3, reflect this relative over-representation.

For each feature f, we computed the difference in proportions as follows:

Enrichment (f) = \frac{Frequency (f, Cluster)}{Size (Cluster)} - \frac{Frequency (f, Population)}{Size (Population)}

Positive values indicate that a feature is over-represented in the cluster compared with the overall sample, while negative values indicate under-representation. This calculation allowed us to characterize each cluster in terms of demographics, social determinants, emotional well-being, and pandemic-era hardships.

3.9. Cluster Frequency Analysis

Each graph was clustered using all three community detection methods. For each resulting set of clusters, the performance was evaluated using the four metrics described above. Full results for all graphs and clustering configurations are shown in Appendix B.

To select representative graphs for further analysis, we applied a “Rank Sum” approach: for each graph, the ranks across the four metrics (Davies–Bouldin, Silhouette, Calinski–Harabasz, and Modularity) were summed, with lower totals indicating better overall performance. Ties were resolved by the number of first-place rankings. This process allowed us to identify the most informative clustering configurations from each method.

Based on this selection procedure, the following graphs were chosen for deeper analysis and visualization: the 250-feature,

k = 7

, Louvain-clustered graph (Louvain F250K7); the 250-feature,

k = 10

, NBR-Clust with VAT (VAT F250K10); and the 15-feature,

k = 10

, NBR-Clust with integrity (INT F15K10). Cluster frequency analysis was performed on each of these graphs to identify the over- and under-represented features within each cluster.

The resulting clustered graphs are presented in Figure 4, Figure 5 and Figure 6. In accordance with the All of Us data dissemination policy, clusters with fewer than 20 members are not shown.

In summary, this study used graph-based methods to analyze the patterns of social, economic, and pandemic-related experiences among people living with HIV. The highest-ranked configurations were selected for further analysis, providing a basis for identifying latent communities of vulnerability and resilience, as presented in the following section.

4. Results

Figure 4, Figure 5 and Figure 6 present the clustering results for the three graph configurations analyzed in this study. In each case, we observed distinct subgroups of HIV-positive individuals with varying profiles of housing stability, emotional well-being, pandemic-related behaviors, and social connectedness. As described in Section 3.8, the percentages shown in each figure represent the relative over-representation of traits within a cluster, that is, the degree to which a given trait is more (or less) common in the cluster than in the full sample. Positive percentages indicate over-representation.

4.1. F250K7 Graph: Clustering with Full Feature Set and Louvain

Figure 4 shows the clustering results from the F250K7 graph constructed from the full feature set using Louvain community detection. This analysis produced five well-defined subgroups.

One cluster reflected housing instability and the inability to work. Participants unable to work were +26% over-represented, housing instability concerns were +25%, and inability to pay rent due to COVID-19 impacts was +15%. This subgroup illustrates the intersection of employment disruption and housing stress among PLWH during the pandemic.

Two clusters reflected contrasting patterns of neighborhood context and social connectedness. The “Safe Neighborhood” cluster was enriched for perceptions of neighborhood safety (+41%), although participants in this group who stayed home 3–4 days per week were over-represented by 40%. The “Community Disconnection and Isolation” cluster was enriched for social isolation (+20%), living alone (+21%), and not testing for COVID-19 (+34%), despite reporting a relatively safe environment. This suggests that objective neighborhood safety does not always translate into social well-being.

A fourth cluster reflected “COVID-cautious” behavior in higher-crime areas. Participants were +39% more likely to perceive crime in their neighborhood and were enriched for recent COVID-19 testing (+39%) and COVID-cautious behaviors.

Finally, one cluster represented individuals that received strong social support. Participants were enriched for receiving meals (+63%), bedside care (+57%), and companionship (+37%). This likely reflects individuals in assisted living settings or those with consistent caregiving support.

4.2. F250K10 Graph: Clustering Based on All Features Clustered with VAT

Figure 5 presents clustering results from the F250K10 graph constructed using NBR-Clust with VAT. This clustering yielded four subgroups larger than 20 participants.

Two clusters reflected isolation and economic hardship, though in different social contexts. The “Poor and Socially Isolated” cluster was enriched for never married (+22%), low income (+15%), and isolation (+11%), though some reported having help when confined in bed (+17%). The “Poor, Diverse, and Isolated” cluster had more women and black participants than average, and showed higher rates of disability (+36%), severe poverty (+31%), and limited mobility (+35%).

Another cluster reflected “Strong Social Support.” The participants consistently reported high levels of perceived and received support across multiple domains: always feeling loved (+47%), having meals prepared (+40%), and having companions for enjoyable activities (+38%).

The fourth cluster reflected “Household and Relationship Stability.” The participants were enriched for partnered living (+41%), long-term residence (+28%), and employment-based insurance (+34%), alongside positive emotional indicators.

4.3. F15K10 Graph: Clustering Based on Integrity

Figure 6 shows the results from the F15K10 graph clustered using NBR-Clust with integrity. Two clusters reflected emotional resilience and low pandemic distress. In the “Low-Anxiety Retirees” cluster, participants not bothered by anxiety were +42% over-represented, with similar enrichment for retirement status (+34%) and reports of being unaffected by COVID-19 (+25%). The “Neighborhood-Satisfied Retirees” cluster was enriched for positive neighborhood perceptions (+79%) and emotional well-being (+43%).

A second pair of clusters reflected institutional connection and proactive engagement. The “Employed and Insured” cluster was enriched for wage employment (+77%) and workplace-based COVID-19 testing (+21%). The “Outgoing and Vaccinated” cluster was marked by active social engagement (+54%) and elevated healthcare access.

Finally, one cluster reflected moderate anxiety with adaptive coping. Participants reported elevated anxiety (+55%) but also higher rates of positive coping indicators.

5. Discussion

This study used graph-based clustering methods to explore patterns of vulnerability and resilience among PLWH during the COVID-19 pandemic using national data from the All of Us Research Program. Across three varying graph configurations, we identified consistent subgroups shaped by housing stability, social connectedness, emotional well-being, and access to institutional resources.

A recurring theme was the importance of social and institutional connectedness as a protective factor. Subgroups characterized by strong relationships, whether through family, household partnerships, community networks, or stable employment, were consistently associated with higher emotional well-being and greater engagement with preventive behaviors, such as COVID-19 testing and vaccination. This is consistent with prior literature showing that relational stability and trust in institutions can mitigate pandemic-related stress and barriers to care.

In contrast, other subgroups faced overlapping challenges of housing instability, isolation, and emotional distress. These patterns reinforce a syndemic perspective [35], in which structural and psychosocial disadvantages compound one another to shape health risks.

5.1. Comparisonwith Prior Regional Study

It is informative to compare these results with our earlier work that analyzed a local survey of PLWH in semi-urban and rural Illinois [3]. That study used a similar network science framework, but was based on a small, locally recruited sample (n = 22), whereas the present analysis draws on a larger, nationally representative dataset (n = 242) from the All of Us research program. Table 1 summarizes the key similarities and differences between the two studies.

Despite the differences in scale and sampling, both studies revealed consistent clustering patterns involving social support, isolation, and hardship. For instance, in both datasets, we observed clusters of PLWH with strong family or community connections who also showed higher emotional well-being and greater engagement in COVID-related care behaviors. Conversely, clusters marked by housing instability or weak social networks were more likely to report emotional distress and reduced access to services. These parallels reinforce the conclusion that relational and structural factors shape both resilience and vulnerability during health crises.

There were also important differences between the studies, due in part to variations in the survey design and sampling context. The Illinois survey included more detailed questions about stigma, healthcare discrimination, and LGBT+ community belonging—factors that were not explicitly captured in the All of Us instruments. As a result, certain themes that emerged in the regional study, such as race-based healthcare discrimination or identity-based community engagement, were less prominent or absent in the national analysis. Conversely, the larger and more diverse national sample revealed new patterns, including clusters of emotionally resilient retirees and subgroups characterized by institutional trust and proactive health behaviors.

These differences underscore both the value and the limitations of comparing regional and national datasets. While the Illinois study offered rich, context-specific insights, the present study complements it by capturing population-level patterns and demonstrating the scalability of graph-based clustering approaches in large-scale public health research.

5.2. Limitations

This study has several limitations. First, although All of Us is a large and diverse national dataset, our analysis focused on a subset of 242 HIV-positive participants. This relatively modest sample size limits generalizability to the broader population of PLWH, particularly given the heterogeneity of experiences during the COVID-19 pandemic. However, the analytic approach employed, based on network science and graph-based clustering, was exploratory rather than inferential, and aimed to detect latent structure and uncover subgroups with shared patterns of vulnerability and resilience. Unlike traditional statistical methods that require large sample sizes to support significance testing, network analysis can reveal meaningful insights, even in smaller, well-characterized samples.

We also acknowledge that our reliance on self-reported data introduced the possibility of response bias, including underreporting or overreporting of health experiences. However, because our clustering approach focused on patterns across multiple dimensions rather than on individual variables, it may be more robust to isolated inaccuracies than methods that rely on precise measurement of single constructs. Additionally, by applying multiple clustering methods across a broad range of graph configurations, we increased the robustness of our findings and were better able to see hidden and hard-to-find populations.

The selection of COVID-related economic hardship as the primary target variable may have overlooked other crucial dimensions of the pandemic’s impact, including healthcare quality, stigma experiences, and more nuanced aspects of social support. These important factors were either not included or not sufficiently detailed in the available survey instruments. Future research should integrate richer measures to better capture the lived experiences of PLWH during public health crises, while also seeking to replicate these findings in larger, more targeted cohorts and explore the broader application of graph-based methods across other public health domains.

5.3. Strategic Opportunities

Taken together, these findings highlight opportunities for precision public health strategies that target support to the most vulnerable subgroups while leveraging existing sources of resilience.

Based on these results, several policy implications emerge:

Pandemic preparedness and HIV care programs should prioritize strengthening social support networks. Interventions that build connections, whether through peer groups, family-based support, or community programs, may help buffer isolation and emotional distress among PLWH during future public health emergencies [36].
Housing stability should be addressed as a core component of both HIV care and pandemic planning. Stable housing is known to support health outcomes among PLWH and may reduce compounding risks during crises [37,38].
Public health systems should leverage institutional touchpoints, including employment settings, healthcare providers, and schools, to deliver pandemic-related services, such as testing, vaccination, and mental health resources [39].
Public health agencies should adopt analytic methods that account for heterogeneity. Graph-based clustering techniques can help identify subgroups with complex, intersecting vulnerabilities and inform targeted, equity-focused interventions [40].

For subgroups marked by housing instability, evidence-based approaches, such as Housing First programs and integrated case management services, may be especially effective at reducing both the health risk and social vulnerability. For clusters characterized by social isolation, interventions should prioritize community re-engagement through peer navigation, virtual support groups, or culturally tailored outreach. Meanwhile, subgroups with strong institutional trust and care engagement represent opportunities to reinforce and scale up protective behaviors, such as testing and vaccination, through trusted settings. By aligning intervention strategies with the unique characteristics of each cluster, public health agencies can move toward a more precise, equity-driven response.

6. Conclusions

By applying graph-based clustering to a national cohort of PLWH, this study demonstrates that network science methods can robustly capture latent patterns of vulnerability and resilience. The approach provides a replicable framework for analyzing heterogeneity within public health populations and complements traditional methods by identifying subgroups whose experiences warrant targeted interventions.

Future research should extend these methods to additional datasets and pandemic contexts and further assess their utility in informing precision public health strategies. Despite the study’s limitations, the results provide new insights into the lived experiences of PLWH during COVID-19 and demonstrate the potential for graph-based methods to inform future public health responses.

Author Contributions

Conceptualization, J.M., A.N., P.S., and S.P.; methodology, J.C., A.N., and J.M.; software, A.N. and J.C.; formal analysis, J.C., J.M., A.N., and S.P; data curation, A.N. and S.P.; writing—original draft preparation, A.N., J.M., P.S., and S.P.; writing—review and editing, J.C. and J.M.; visualizations, J.C.; supervision, J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study protocol was approved as exempt by the Institutional Review Board of Southern Illinois University Edwardsville (protocol code 2342, 7 December 2023).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data used are available to approved researchers via the NIH All of Us website at https://allofus.nih.gov/ (accessed on 24 July 2025). The Burden Of HIV survey results and analysis materials are freely available, and others are encouraged to use this data. The code used in the study can be accessed at https://github.com/SIUEComplexNetworksLab/BOHComplexNetworks (accessed on 24 July 2025). The data are available at https://www.openicpsr.org/openicpsr/project/192186/version/V1/view (accessed on 24 July 2025).

Acknowledgments

We gratefully acknowledge the All of Us participants for their contributions, without whom this research would not have been possible. We also thank the National Institutes of Health’s All of Us Research Program (https://allofus.nih.gov/ (accessed on 24 July 2025)) for making available the participant data examined in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PLWH	People living with HIV
COVID-19	Coronavirus disease 2019
COPE	COVID-19 Participant Experience Survey
SDoH	Social Determinants of Health
KNN	k-nearest neighbor
VAT	Vertex attack tolerance
NBR-Clust	Node-based Resilience Clustering
CH	Calinski–Harabasz index
DB	Davies–Bouldin index
LMSM	Latino MSM Community Involvement Survey
SATHCAP	Sexual Acquisition and Transmission of HIV Cooperative Agreement Program

Appendix A. Illustrative Example of Graph Construction

To illustrate how participant similarity is represented in graph form, consider the following simplified example. Three fictional participants respond to three yes/no survey items.

Participant	Lost Job Due to COVID	Difficulty Paying Rent	Felt Socially Isolated
Alice	Yes	Yes	No
Bob	Yes	Yes	Yes
Carol	No	No	Yes

Using one-hot encoding, each participant’s responses are converted to a binary vector, where yes is encoded as 1 and no as 0:

Alice: [1, 1, 0];
Bob: [1, 1, 1];
Carol: [0, 0, 1].

The similarity between participants can then be calculated as the distance between their response vectors using a metric such as Euclidean distance. Smaller distances correspond to greater similarity:

Distance between Alice and Bob: $\sqrt{{(1 - 1)}^{2} + {(1 - 1)}^{2} + {(0 - 1)}^{2}} = 1$ ;
Distance between Alice and Carol: $\sqrt{{(1 - 0)}^{2} + {(1 - 0)}^{2} + {(0 - 1)}^{2}} = \sqrt{3} \approx 1.73$ ;
Distance between Bob and Carol: $\sqrt{{(1 - 0)}^{2} + {(1 - 0)}^{2} + {(1 - 1)}^{2}} = \sqrt{2} \approx 1.41$ .

In the resulting graph:

Each participant is represented as a node.
Edges connect nodes based on pairwise similarity.
Edge weights are set as the inverse of the distance so that stronger (more similar) relationships have higher weights.

This example illustrates how survey response data are transformed into a similarity graph, providing the foundation for subsequent community detection and cluster analysis.

Appendix B. Clustering Selection Tables

The results include a table of clustering metric scores for each of 25 graphs containing the clustered respondents. These graphs range from using 15 to 250 features to construct neighbor relationships, and use K values of 2 to 10 to determine the maximum number of neighbors for any one node. After performing clustering and subsequent cluster analysis on each graph for each clustering method (Louvain, VAT, and INT), the following data tables resulted (broken down by clustering method). The graph chosen is highlighted in red.

Table A1. Table of Louvain graph-clustering metrics. The graph chosen for analysis is highlighted in red.

Feature	K-Value	Davies–	Silhouette	Calinski–	Modularity	Rank
Counts		Bouldin	Score	Harabasz		Sum
F250	K7	5.459	0.008	4.330	0.363	26
F250	K2	5.518	−0.006	2.812	0.56	30
F250	K5	5.527	−0.008	3.396	0.393	35
F250	K10	5.667	0.005	4.362	0.32	36
F250	K3	5.535	−0.013	2.587	0.472	43
F100	K3	5.547	−0.009	2.182	0.468	49
F45	K7	5.922	−0.007	3.124	0.387	50
F15	K2	5.119	−0.027	1.682	0.821	51
F30	K5	5.725	−0.015	2.523	0.483	51
F45	K5	5.624	−0.010	2.269	0.436	51
F45	K10	6.197	−0.003	3.362	0.349	51
F30	K7	6.195	−0.007	2.824	0.437	52
F100	K10	5.962	−0.005	3.328	0.308	53
F15	K3	5.704	−0.021	2.021	0.761	54
F30	K3	5.426	−0.029	2.140	0.576	54
F100	K5	6.008	−0.003	2.532	0.372	54
F100	K7	6.460	0.000	3.175	0.332	55
F15	K7	6.696	−0.015	2.240	0.642	61
F15	K10	6.886	−0.016	2.534	0.591	61
F30	K10	5.788	−0.011	2.308	0.347	61
F15	K5	6.123	−0.019	2.085	0.691	62
F45	K2	5.913	−0.018	1.996	0.609	62
F30	K2	5.860	−0.023	1.881	0.651	64
F100	K2	5.878	−0.019	2.001	0.578	64
F45	K3	6.079	−0.016	1.959	0.505	70

Table A2. Table of VAT graph-clustering metrics. The graph chosen for analysis is highlighted in red.

Feature	K-Value	Davies–	Silhouette	Calinski–	Modularity	Rank
Counts		Bouldin	Score	Harabasz		Sum
F250	K10	3.158	-0.040	1.690	0.201	34
F15	K3	5.389	−0.044	1.653	0.517	40
F100	K3	2.850	−0.057	1.436	0.377	40
F250	K7	2.252	−0.054	1.408	0.196	45
F15	K7	4.704	−0.027	2.185	0.083	45
F30	K5	4.082	−0.049	1.575	0.359	46
F250	K3	2.210	−0.067	1.359	0.369	47
F250	K5	2.267	−0.057	1.374	0.250	47
F15	K10	4.562	−0.031	2.005	0.066	47
F45	K5	3.809	−0.044	1.451	0.309	48
F100	K7	3.266	−0.058	1.556	0.236	49
F15	K2	3.819	−0.051	1.324	0.750	50
F100	K2	3.023	−0.075	1.336	0.478	51
F250	K2	1.908	−0.075	1.323	0.401	51
F100	K10	3.605	−0.041	1.489	0.055	51
F30	K7	4.055	−0.056	1.565	0.247	52
F30	K3	3.162	−0.083	1.382	0.442	53
F30	K2	3.408	−0.072	1.289	0.531	55
F100	K5	3.414	−0.063	1.389	0.245	58
F45	K10	2.570	−0.033	1.151	0.005	59
F30	K10	5.079	−0.031	1.480	0.027	60
F15	K5	4.143	−0.037	1.333	0.174	63
F45	K2	3.803	−0.081	1.276	0.488	66
F45	K3	3.758	−0.087	1.326	0.416	66
F45	K7	3.549	−0.062	1.213	0.053	77

Table A3. Table of INT graph-clustering metrics. The graph chosen for analysis is highlighted in red.

Feature	K-Value	Davies–	Silhouette	Calinski–	Modularity	Rank
Counts		Bouldin	Score	Harabasz		Sum
F15	K5	4.275	−0.041	1.714	0.631	33
F15	K7	4.343	−0.033	1.692	0.556	33
F15	K10	4.910	−0.022	1.896	0.506	33
F250	K3	2.099	−0.046	1.496	0.403	35
F250	K2	1.925	−0.054	1.417	0.403	44
F250	K5	1.916	−0.041	1.421	0.216	44
F100	K10	3.378	−0.040	1.692	0.197	47
F250	K10	2.846	−0.041	1.517	0.163	48
F30	K10	4.475	−0.036	1.780	0.256	49
F30	K5	3.845	−0.048	1.561	0.370	51
F30	K2	2.969	−0.062	1.356	0.554	53
F100	K2	2.484	−0.066	1.359	0.446	53
F45	K5	3.586	−0.044	1.455	0.319	54
F15	K2	2.371	−0.086	1.230	0.644	55
F250	K7	1.974	−0.048	1.357	0.170	56
F45	K7	3.131	−0.060	1.495	0.285	57
F100	K5	3.176	−0.052	1.493	0.275	57
F100	K3	2.438	−0.066	1.384	0.343	58
F100	K7	3.020	−0.053	1.513	0.214	58
F45	K10	3.592	−0.053	1.539	0.264	59
F15	K3	2.276	−0.089	1.152	0.507	60
F45	K2	2.737	−0.073	1.271	0.479	60
F30	K3	2.865	−0.080	1.306	0.403	65
F30	K7	3.589	−0.063	1.446	0.280	68
F45	K3	3.399	−0.081	1.323	0.419	70

Appendix C. Cluster Characteristics

Table A4. Table of features for the F250K7 graph, clustered with Louvain. Trait percentages represent over-representation relative to the full sample, calculated as described in Section 3.8. A visualization based on this table is shown in Figure 4, where colors in the table correspond to matching colors in the figure. Positive percentages are shown in green, while negative percentages are shown in red. Data on clusters smaller than 20 are not shown.

Class	N	Characteristics	Percentage
0	26	Were not tested for COVID-19 in the past month	29%
		Employment status: unable to work	26%
		Have stable house concerns	25%
		Do not have enough money to pay rent because of COVID-19 pandemic	15%
		Stay home every day (under-represented)	−60%
		None of the days (0 days)—crime concern (under-represented)	−53%
		None of the days (0 days)—neighborhood clean (under-represented)	−51%
		None of the days (0 days)—neighborhood safety (under-represented)	−35%
1	53	Strongly disagree that vandalism is common in your neighborhood	43%
		Strongly disagree that crime is a problem in your neighborhood	41%
		Stay home 3–4 days out of the week	40%
		Strongly disagree that too much alcohol use occurs in your neighborhood	40%
		Strongly disagree that your neighborhood is clean	39%
		Strongly agree that your neighborhood is safe	39%
		Strongly disagree that people take good care of houses/apartments	38%
		Strongly disagree with feeling unhappy when withdrawn	37%
		Received the COVID-19 vaccination	13%
2	49	Report that vandalism is not common	41%
		Report that crime occurs in your neighborhood	39%
		Were tested for COVID-19 in the past month	39%
		Had COVID test type: nasal swab	35%
		Current home ownership status: rent	33%
		Had no issue getting a test for COVID-19	22%
		Were tested for COVID-19 to get other healthcare services	20%
		Belong to a high-risk population	19%
		Strongly disagree that your neighborhood is clean (under-represented)	−37%
		Strongly disagree that your neighborhood is safe (under-represented)	−35%
3	64	Always have someone to prepare your meals	63%
		Always have someone to help with daily chores if you were sick	61%
		Always have someone to help you if confined to bed	57%
		Always have someone to help you deal with a personal problem	52%
		Always have someone to love and make you feel wanted	52%
		Always have someone to take you to the doctor if needed	50%
		Never feel there is no one you can turn to	39%
		Never feel lack of companionship	37%
		Never feel isolated from others	29%
		Personally know someone who has died of COVID-19	10%
4	50	Were not tested for COVID-19 in the past month	34%
		Disagree that too many people hang around streets near your home	34%
		Disagree that too much drug use occurs in your neighborhood	32%
		Disagree that crime is a problem in your neighborhood	31%
		Disagree that too much alcohol use occurs in your neighborhood	29%
		Agree that neighborhood is clean	29%
		Agree that neighborhood is safe	26%
		Live alone (household size excluding self = 0)	21%
		Race: White	21%
		Sometimes feel isolated from others	20%

Table A5. Table of features for the F250K10 graph, clustered with the NBR-Clust framework with VAT. Trait percentages represent over-representation relative to the full sample, calculated as described in Section 3.8. A visualization based on this table is shown in Figure 5, where colors in the table correspond to matching colors in the figure. Positive percentages are shown in green, while negative percentages are shown in red. Data on clusters smaller than 20 are not shown.

Class	N	Characteristics	Percentage
0	72	Current marital status: never married	22%
		Sometimes get help when confined to bed	17%
		Annual income: 10k–25k	15%
		Agree that neighborhood is clean	15%
		Often feel isolated from others	11%
		Strongly agree that neighborhood is clean (under-represented)	−28%
		Always have someone to prepare your meals (under-represented)	−28%
		Always have someone to love and make you feel wanted (under-represented)	−28%
		Always have someone to help if confined to bed (under-represented)	−28%
		Always have someone to help with chores when sick (under-represented)	−25%
2	23	Employment status: unable to work (disabled)	36%
		Stayed home all day in last 5 days	35%
		Annual income: <10k	31%
		Feels loved by God/higher power many times a day	31%
		Gender identity: woman	27%
		Race ethnicity: black	26%
		Wishes to be closer to God/higher power	25%
		Gender identity: man (under-represented)	−39%
		1-person living situation (under-represented)	−36%
		Sexual orientation: gay (under-represented)	−35%
3	62	Always have someone to love and feel wanted	47%
		Always have someone to help if confined to bed	42%
		Always have someone to prepare your meals	40%
		Always have someone to have a good time with	38%
		Always have someone to take you to the doctor	37%
		Always have someone to help with chores	37%
		Always have someone to turn to for help	36%
		Felt confident about handling problems	35%
		Never feel there is no one to turn to	33%
		Personally knows someone who died of COVID-19	13%
4	26	1 other person lives at home with you	41%
		Most of the time have someone to love	39%
		Insurance type: employer or union	34%
		Felt nervous/stressed last month	31%
		Lived 20+ years in current situation	28%
		Current marital status: living with partner	23%
		Current marital status: married	21%
		Received 1 dose of COVID-19 vaccine	13%
		Medicaid/government assistance plan (under-represented)	−29%
		0 people under 18 in living situation (under-represented)	−28%

Table A6. Table of features for the F15K10 graph, clustered with the NBR-Clust framework with integrity. Trait percentages represent over-representation relative to the full sample, calculated as described in Section 3.8. A visualization based on this table is shown in Figure 6, where colors in the table correspond to matching colors in the figure. Positive percentages are shown in green, while negative percentages are shown in red. Data on clusters smaller than 20 are not shown.

Class	N	Characteristics	Percentage
0	28	Not at all bothered by anxiety/nervousness	42%
		Employment status: retired	34%
		Never felt difficulties piling up	33%
		Sometimes feel outgoing	31%
		Never felt unable to control important things	30%
		Current status: retired	29%
		Never felt nervous and stressed	29%
		Were unaffected by the COVID-19 outbreak	25%
		Often feel outgoing (under-represented)	−41%
		Employment status: employed (under-represented)	−34%
6	40	Several days of anxiety/nervousness	55%
		Sometimes feel things going their way	52%
		Several days of little interest/pleasure	36%
		Sometimes feel confident about future	29%
		Sometimes feel in control of life	29%
		Sometimes feel optimistic	26%
		Did not receive the COVID-19 vaccination	14%
		Not at all bothered by anxiety (under-represented)	−39%
		Very often confident handling problems (under-represented)	−31%
		Not at all bothered by stress (under-represented)	−27%
7	29	Employment status: employed for wages	77%
		Employment status: employed (part-time or full-time)	74%
		Have insurance through employer/union	49%
		Insurance type: employer/union	45%
		Currently working	34%
		Were tested for COVID-19 because of work or school	21%
		Retired status (under-represented)	−38%
		No social assistance (under-represented)	−32%
		Medicare coverage (under-represented)	−32%
		No days off work (under-represented)	−31%
8	22	Strongly agree neighborhood has recreation facilities	79%
		Not at all bothered by anxiety/nervousness	43%
		Strongly agree about bicycle facilities	36%
		Employment status: retired	34%
		Not at all bothered by lack of interest/pleasure	31%
		Strongly disagree about neighborhood problems	31%
		Never felt unable to cope	29%
		Never felt overwhelmed by responsibilities	28%
		Received the COVID-19 vaccination	19%
		Several days of anxiety (under-represented)	−29%
9	21	Often feel outgoing	54%
		Sometimes feel things going their way	39%
		Some of the time feel supported	24%
		Have Medicare coverage	24%
		Were tested for COVID-19 to get other healthcare services	22%
		Received the COVID-19 vaccination	22%
		Sexual orientation: straight	20%
		No social issues (under-represented)	−24%
		No discrimination experiences (under-represented)	−24%
		Sexual orientation concerns (under-represented)	−24%

References

Bogart, L.M.; Ojikutu, B.O.; Tyagi, K.; Klein, D.J.; Mutchler, M.G.; Dong, L.; Lawrence, S.J.; Thomas, D.R.; Kellman, S. COVID-19 related medical mistrust, health impacts, and potential vaccine hesitancy among Black Americans living with HIV. JAIDS J. Acquir. Immune Defic. Syndr. 2021, 86, 200–207. [Google Scholar] [CrossRef] [PubMed]
Matta, J.; Singh, V.; Auten, T.; Sanjel, P. Inferred networks, machine learning, and health data. PLoS ONE 2023, 18, e0280910. [Google Scholar] [CrossRef] [PubMed]
Matta, J.; Sinha, K.; Woodard, C.; Sappington, Z.; Philbrick, J. Economic and Health Burdens of HIV and COVID-19: Insights from a Survey of Underserved Communities in Semi-Urban and Rural Illinois. In Complex Networks & Their Applications XII; Springer: Cham, Switzerland, 2023; pp. 189–201. [Google Scholar]
Schulkey, C.E.; Litwin, T.R.; Ellsworth, G.; Sansbury, H.; Ahmedani, B.K.; Choi, K.W.; Cronin, R.M.; Kloth, Y.; Ashbeck, A.W.; Sutherland, S.; et al. Design and implementation of the all of us research program COVID-19 Participant Experience (COPE) survey. Am. J. Epidemiol. 2023, 192, 972–986. [Google Scholar] [CrossRef]
Grubb, J.; Lopez, D.; Mohan, B.; Matta, J. Network centrality for the identification of biomarkers in respondent-driven sampling datasets. PLoS ONE 2021, 16, e0256601. [Google Scholar] [CrossRef]
Lopez, D.; Mohan, B.; Boone, L.; Matta, J. Preserving Multiple Homophilies in a Network Configuration Model. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Virtual, 1–5 November 2021; pp. 1781–1786. [Google Scholar]
Brown, L.B.; Spinelli, M.A.; Gandhi, M. The interplay between HIV and COVID-19: Summary of the data and responses to date. Curr. Opin. HIV AIDS 2021, 16, 63. [Google Scholar] [CrossRef]
Roman Shrestha, J.P.M.; Shenoi, S.; Khati, A.; Altice, F.L.; Mistler, C.; Aoun-Barakat, L.; Virata, M.; Olivares, M.; Wickersham, J.A. COVID-19 Vaccine Hesitancy and Associated Factors among People with HIV in the United States: Findings from a National Survey. Vaccines 2022, 10, 424. [Google Scholar] [CrossRef]
Lauren Brown, L.; Martin, E.G.; Knudsen, H.K.; Gotham, H.J.; Garner, B.R. Resilience-Focused HIV Care to Promote Psychological Well-Being During COVID-19 and Other Catastrophes. Front. Public Health 2021, 9, 705573. [Google Scholar]
Chenglin Hong, A.Q.; Hoskin, J. The impact of the COVID-19 pandemic on mental health, associated factors and coping strategies in people living with HIV: A scoping review. J. Int. AIDS Soc. 2023, 26, e26060. [Google Scholar] [CrossRef]
Elizabeth Weber Handwerker, P.B.M.; Piacentini, J.; Schultz, M.; Sveikauskas, L. Employment Recovery in the Wake of the COVID-19 Pandemic; US Bureau of Labor Statistics: Washington, DC, USA, 2020. [Google Scholar]
Ha Nguyen Thu, A.N.Q.; Hai, O.K.; Thanh, H.L.T.; Thanh, H.N. Impact of the COVID-19 pandemic on provision of HIV/AIDS services for key populations. Int. J. Health Plan. Manag. 2022, 37, 2852–2868. [Google Scholar] [CrossRef]
Sanjel, P.; Matta, J. Inferred networks and the social determinants of health. In Proceedings of the International Conference on Complex Networks and Their Applications, Madrid, Spain, 30 November–2 December 2021; Springer: Cham, Switzerland, 2021; pp. 703–715. [Google Scholar]
Latkin, C.A.; Knowlton, A. Micro-social structural approaches to HIV prevention: A social ecological perspective. AIDS Care 2005, 17, 102–113. [Google Scholar] [CrossRef] [PubMed]
Pasquale, D.K.; Doherty, I.A.; Leone, P.A.; Dennis, A.M.; Samoff, E.; Jones, C.S.; Barnhart, J.; Miller, W.C. Lost and found: Applying network analysis to public health contact tracing for HIV. Appl. Netw. Sci. 2021, 6, 13. [Google Scholar] [CrossRef]
Nikolopoulos, G.K.; Pavlitina, E.; Muth, S.Q.; Schneider, J.; Psichogiou, M.; Williams, L.D.; Paraskevis, D.; Sypsa, V.; Magiorkinis, G.; Smyrnov, P.; et al. A network intervention that locates and intervenes with recently HIV-infected persons: The Transmission Reduction Intervention Project (TRIP). Sci. Rep. 2016, 6, 38100. [Google Scholar] [CrossRef]
Worobey, M.; Gemmel, M.; Teuwen, D.E.; Haselkorn, T.; Kunstman, K.; Bunce, M.; Muyembe, J.J.; Kabongo, J.M.M.; Kalengayi, R.M.; Van Marck, E.; et al. Direct evidence of extensive diversity of HIV-1 in Kinshasa by 1960. Nature 2008, 455, 661–664. [Google Scholar] [CrossRef] [PubMed]
Xiang, Y.; Du, J.; Fujimoto, K.; Li, F.; Schneider, J.; Tao, C. Application of artificial intelligence and machine learning for HIV prevention interventions. Lancet HIV 2021, 9, e54–e62. [Google Scholar] [CrossRef] [PubMed]
National Institutes of Health (NIH). All of Us Research Program. Available online: https://allofus.nih.gov/ (accessed on 20 April 2024).
All of Us Research Program. Survey Explorer. Available online: https://researchallofus.org/data-tools/survey-explorer/ (accessed on 20 April 2024).
Matta, J. Intersectional Barriers Among PLHIV in Rural Illinois: Insights from a Pilot QCA Study. Int. J. Environ. Res. Public Health 2025, 22, 1011. [Google Scholar] [CrossRef]
Iguchi, M.; Berry, S.; Ober, A.; Fain, T.; Heckathorn, D.; Gorbach, P.; Heimer, R.; Kozlov, A.; Ouellet, L.; Shoptaw, S.; et al. Sexual Acquisition and Transmission of HIV Cooperative Agreement Program (SATHCAP), 2006–2008 [United States] Restricted Use Files. 2010. Available online: https://krimdok.uni-tuebingen.de/Record/1902791916/Description (accessed on 24 July 2025).
Ramirez-Valles, J.; Heckathorn, D.D.; Vázquez, R.; Diaz, R.M.; Campbell, R.T. From networks to populations: The development and application of respondent-driven sampling among IDUs and Latino gay men. AIDS Behav. 2005, 9, 387–402. [Google Scholar] [CrossRef]
Athena Health Database. Version 1.14.0.32.231116.1843, 2015–2024. OMOP Vocabulary Version: v5.0 29-FEB-24. Available online: https://athena.ohdsi.org/search-terms/terms/1333291 (accessed on 24 July 2025).
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Hagberg, A.A.; Schult, D.A.; Swart, P.J. Exploring network structure, dynamics, and function using NetworkX. In Proceedings of the 7th Python in Science Conference (SciPy2008), Pasadena, CA, USA, 19–24 August 2008; pp. 11–15. [Google Scholar]
Blondel, V.D.; Guillaume, J.L.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef]
Matta, J.; Obafemi-Ajayi, T.; Borwey, J.; Wunsch, D.; Ercal, G. Robust graph-theoretic clustering approaches using node-based resilience measures. In Proceedings of the 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, Spain, 12–15 December 2016; pp. 320–329. [Google Scholar]
Matta, J.; Ercal, G.; Borwey, J. The vertex attack tolerance of complex networks. RAIRO-Oper. Res. 2017, 51, 1055–1076. [Google Scholar] [CrossRef]
Ercal, G.; Matta, J. Resilience notions for scale-free networks. Procedia Comput. Sci. 2013, 20, 510–515. [Google Scholar] [CrossRef]
Barefoot, C.; Entringer, R.; Swart, H. Integrity of trees and powers of cycles. Congr. Numer 1987, 58, 103–114. [Google Scholar]
Caliński, T.; Harabasz, J. A dendrite method for cluster analysis. Commun. Stat. 1974, 3, 1–27. [Google Scholar] [CrossRef]
Davies, D.L.; Bouldin, D.W. A Cluster Separation Measure. IEEE Trans. Pattern Anal. Mach. Intell. 1979, PAMI-1, 224–227. [Google Scholar] [CrossRef]
Rousseeuw, P.J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef]
Shiau, S.; Krause, K.D.; Valera, P.; Swaminathan, S.; Halkitis, P.N. The burden of COVID-19 in people living with HIV: A syndemic perspective. AIDS Behav. 2020, 24, 2244–2249. [Google Scholar] [CrossRef]
Sherr, L.; Clucas, C.; Harding, R.; Sibley, E.; Catalan, J. HIV and depression—A systematic review of interventions. Psychol. Health Med. 2011, 16, 493–527. [Google Scholar] [CrossRef] [PubMed]
Aidala, A.A.; Wilson, M.G.; Shubert, V.; Gogolishvili, D.; Globerman, J.; Rueda, S.; Bozack, A.K.; Caban, M.; Rourke, S.B. Housing status, medical care, and health outcomes among people living with HIV/AIDS: A systematic review. Am. J. Public Health 2016, 106, e1–e23. [Google Scholar] [CrossRef] [PubMed]
Leifheit, K.M.; Linton, S.L.; Raifman, J. Expiring eviction moratoriums and COVID-19 incidence and mortality. Am. J. Epidemiol. 2021, 190, 2503–2510. [Google Scholar] [CrossRef]
Chang, C.H.; Shao, R.; Wang, M.; Baker, N.M. Workplace interventions in response to COVID-19: An occupational health psychology perspective. Occup. Health Sci. 2021, 5, 1–23. [Google Scholar] [CrossRef] [PubMed]
Braveman, P.A.; Kumanyika, S.; Fielding, J.; LaVeist, T.; Borrell, L.N.; Manderscheid, R.; Troutman, A. Health disparities and health equity: The issue is justice. Am. J. Public Health 2011, 101, S149–S155. [Google Scholar] [CrossRef]

Figure 1. Analytic workflow used in this study. Graph-based clustering methods were applied to All of Us survey data to identify subgroups of PLWH with shared social, economic, and pandemic- related experiences.

Figure 2. Race distribution of the study participants (n = 242) based on self-reported responses from the All of Us dataset.

Figure 3. Age distribution of study participants by race based on self-reported data.

Figure 4. Clusters derived from the F250K7 graph (full feature set,

k = 7

). Percentages indicate trait over-representation relative to the full sample. A complete list of traits and percentages for this graph is contained in Appendix C, Table A4. Traits are shown for clusters with at least 20 participants, per the All of Us data dissemination policy.

Figure 4. Clusters derived from the F250K7 graph (full feature set,

k = 7

). Percentages indicate trait over-representation relative to the full sample. A complete list of traits and percentages for this graph is contained in Appendix C, Table A4. Traits are shown for clusters with at least 20 participants, per the All of Us data dissemination policy.

Figure 5. Clusters derived from the F250K10 graph (full feature set,

k = 10

). Percentages indicate trait over-representation relative to the full sample. A complete list of traits and percentages for this graph is contained in Appendix C, Table A5. Traits are shown for clusters with at least 20 participants, per the All of Us data dissemination policy.

Figure 5. Clusters derived from the F250K10 graph (full feature set,

k = 10

). Percentages indicate trait over-representation relative to the full sample. A complete list of traits and percentages for this graph is contained in Appendix C, Table A5. Traits are shown for clusters with at least 20 participants, per the All of Us data dissemination policy.

Figure 6. Clusters derived from the F15K10 graph (15 features,

k = 10

). Percentages indicate trait over-representation relative to the full sample. A complete list of traits and percentages for this graph is contained in Appendix C, Table A6. Traits are shown for clusters with at least 20 participants, per the All of Us data dissemination policy.

Figure 6. Clusters derived from the F15K10 graph (15 features,

k = 10

). Percentages indicate trait over-representation relative to the full sample. A complete list of traits and percentages for this graph is contained in Appendix C, Table A6. Traits are shown for clusters with at least 20 participants, per the All of Us data dissemination policy.

Table 1. Key differences and consistencies between the Illinois HIV+ COVID-19 study [3] and the current national-level All of Us analysis.

Aspect	Illinois Paper (Small Sample)	All of Us National Paper (Large Sample)
Population	Local, semi-urban, and rural Illinois	National sample (All of Us)
Sample Size	19 completed respondents	242 respondents
Target Variables	HIV+ status, race (Black), COVID-19 economic impact	HIV+ with COPE and SDoH surveys
Clustering	Based on 30 variables chosen for specific targets	Clustering across full, VAT-ranked, and integrity-ranked features
Findings Unique to Illinois	Patterns of race-based discrimination, LGBT+ community belonging, and long COVID symptoms	These topics were not included in the All of Us survey items used
Findings Unique to National	Not applicable (small local focus)	Clear patterns of “Neighborhood-Satisfied Retirees,” proactive “Outgoing and Vaccinated” cluster, and widespread “Institutional connectedness” as a protective factor
Overlap/Consistencies	Clusters of isolation with hardship and clusters of strong social support	Same: consistent patterns of isolation vs. support across clusters

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Christopher, J.; Nelson, A.; Somerville, P.; Patel, S.; Matta, J. The Impact of COVID-19 on People Living with HIV: A Network Science Perspective. COVID 2025, 5, 119. https://doi.org/10.3390/covid5080119

AMA Style

Christopher J, Nelson A, Somerville P, Patel S, Matta J. The Impact of COVID-19 on People Living with HIV: A Network Science Perspective. COVID. 2025; 5(8):119. https://doi.org/10.3390/covid5080119

Chicago/Turabian Style

Christopher, Jared, Aiden Nelson, Paris Somerville, Simran Patel, and John Matta. 2025. "The Impact of COVID-19 on People Living with HIV: A Network Science Perspective" COVID 5, no. 8: 119. https://doi.org/10.3390/covid5080119

APA Style

Christopher, J., Nelson, A., Somerville, P., Patel, S., & Matta, J. (2025). The Impact of COVID-19 on People Living with HIV: A Network Science Perspective. COVID, 5(8), 119. https://doi.org/10.3390/covid5080119

Article Menu

The Impact of COVID-19 on People Living with HIV: A Network Science Perspective

Abstract

1. Introduction

2. Related Work

3. Methods

3.1. Network Science and Its Use in This Study

3.2. Dataset: All of Us

3.3. Dataset: The Burden of HIV Survey

3.4. Data Preprocessing and Feature Selection

3.5. Graph Construction

3.6. Clustering Methods

3.7. Cluster Evaluation Metrics

3.8. Cluster Over-Representation Calculation

3.9. Cluster Frequency Analysis

4. Results

4.1. F250K7 Graph: Clustering with Full Feature Set and Louvain

4.2. F250K10 Graph: Clustering Based on All Features Clustered with VAT

4.3. F15K10 Graph: Clustering Based on Integrity

5. Discussion

5.1. Comparisonwith Prior Regional Study

5.2. Limitations

5.3. Strategic Opportunities

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Illustrative Example of Graph Construction

Appendix B. Clustering Selection Tables

Appendix C. Cluster Characteristics

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI