Next Article in Journal
Associations between Health Literacy and Sociodemographic Factors: A Cross-Sectional Study in Malaysia Utilising the HLS-M-Q18
Next Article in Special Issue
Persistence as a Robust Indicator of Medication Adherence-Related Quality and Performance
Previous Article in Journal
The Seasonality Impact of the BTEX Pollution on the Atmosphere of Arad City, Romania
Previous Article in Special Issue
Primary Care Records of Chronic-Disease Patient Adherence to Treatment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Network Approach for the Study of Drug Prescriptions: Analysis of Administrative Records from a Local Health Unit (ASL TO4, Regione Piemonte, Italy)

1
Dipartimento di Scienza e Tecnologia del Farmaco, Università degli Studi di Torino, Via Pietro Giuria 9, 10125 Turin, Italy
2
Centro di Competenza sul Calcolo Scientifico C3S, Università degli Studi di Torino, Corso Svizzera 185, 10149 Turin, Italy
3
Struttura Complessa Farmacia Territoriale ASL TO4, Regione Piemonte, Via Po 11, 10034 Chivasso, Italy
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(9), 4859; https://doi.org/10.3390/ijerph18094859
Submission received: 9 March 2021 / Revised: 21 April 2021 / Accepted: 30 April 2021 / Published: 2 May 2021

Abstract

:
In a Drug Prescription Network (DPN), each drug is represented as a node and two drugs co-prescribed to the same patient are represented as an edge linking the nodes. The use of DPNs is a novel approach that has been proposed as a means to study the complexity of drug prescription. The aim of this study is to demonstrate the analytical power of the DPN-based approach when it is applied to the analysis of administrative data. Drug prescription data that were collected at a local health unit (ASL TO4, Regione Piemonte, Italy), over a 12-month period (July 2018–June 2019), were used to create several DPNs that correspond to the five levels of the Anatomical Therapeutic Chemical classification system. A total of 5,431,335 drugs prescribed to 361,574 patients (age 0–100 years; 54.7% females) were analysed. As indicated by our results, the DPNs were dense networks, with giant components that contain all nodes. The disassortative mixing of node degrees was observed, which implies that non-random connectivity exists in the networks. Network-based methods have proven to be a flexible and efficient approach to the analysis of administrative data on drug prescription.

1. Introduction

Drug prescription analysis is attaining resonance for multiple reasons. Firstly, the growing number of older adults means that there is an increased need to monitor comorbidity and poly-therapeutic regimens [1,2]. Secondly, the analysis of prescription patterns may be extremely useful for identifying potential inappropriate prescriptions, adverse drug events and potential drug–drug interactions, as well as for assessing epidemiological models and medication adherence on a large scale [3]. The analysis of medical records could also improve the delivery of care in healthcare settings, as it may be able to provide predictive and prescriptive knowledge. Lastly, research into drug prescription patterns may also be a way to rationalise and possibly prevent hospitalisation, and therefore reduce healthcare costs [4].
The increase in large-scale drug (co)prescription data collected by healthcare systems provides major challenges to the healthcare providers and regulators who are seeking to recognise patterns or trends from this massive amount of data. A data mining technique could be used to analyse and thus convert the collected raw data into specific information and knowledge desired by end-users. Data on drug prescriptions can be analysed by adopting inferential statistics [5], and can also be studied using network models. Network (graph) theory has been used to explore graphs since the 18th century [6]. A network consists of nodes (vertices), which represent the entities of the system. Pairs of nodes are joined by links (edges), which represent a particular kind of interconnection between those entities. In recent decades, research into complex networks has exploded thanks to the great analytical power that network models have to describe real-world applications [7,8,9]. Network models have also been used for the study of drug prescriptions, and the notion of a Drug Prescription Network (DPN) was introduced in two previous studies [10,11]. Each prescription in a DPN is represented by a node, while a pair of co-prescriptions to the same patient is represented by an edge linking the two nodes. Although the use of DPNs is still in its infancy, it has been demonstrated to be a valid means with which to highlight a number of network features, from the local scale (nodes and edges) to the global scale (the overall network).
The aim of this study is to expand upon previous findings on DPNs as a method for the study of drug prescriptions. In particular, concepts of the network theory were adopted to recognise patterns in a large dataset of (co)prescriptions.

2. Materials and Methods

2.1. Data

Drugs prescribed by general practitioners (GPs) operating in the Italian National Health Service setting in the area north of the city of Turin were considered in this study. The source of data was the local health unit (LHU; ASL TO4, Regione Piemonte, Italy) database and the work was carried out in compliance with the General Data Protection Regulation-EU 2016/679. In particular, the study was focused on the implementation of a methodological approach to describe the complexity of drug co-prescription through the adoption of network models, and the data were treated in full accordance with current privacy legislation.

2.2. Drug Prescription Networks

Data were provided in the form of DPNs. Each entry in the dataset was associated with a unique code from the Anatomical Therapeutic Chemical (ATC) classification system [12]. In particular, the ATC classification system consists of five levels ( L ), with subcategories nested within broader parent categories (Figure A1). Data on prescriptions of each ATCi,L (where i refers to drug classification at the given ATC level L ) and co-prescriptions of pairs ATCi,L–ATCj,L (with i j ) were considered, generating 5 networks (DPN-ATCL1–DPN-ATCL5). In these networks, the nodes were the ATCi,Ls found in the dataset and represented the number of patient(s) ( P i ) prescribed with drug i within ATC-level L . The edges were the pairs ATCi,L–ATCj,Ls and represented the number of patient(s) ( C i j ) co-prescribed with the drug pairs i and j within ATC-level L (Figure 1).

2.3. Analysis of Drug Prescription Networks

Several statistical and network measures were used to characterise the structure of the DPNs, including the degree ( k ), the betweenness centrality ( b ), the closeness centrality ( C l ) of a node and their average values relative to a network ( k ¯ , b ¯ and C l ¯ ); the density of the network ( d ), the assortativity coefficient ( r ) [13,14], the Pearson’s correlation coefficient for binary variables (ϕ) [11,15] and the Euclidean distance between graphs [16] were also determined. In addition, dendrograms, built using the ward. D2 linkage [17], were drawn to identify distinct clusters within the distance matrices (Appendix A).

2.4. Software

All analyses were performed using the R statistical and programming language (version 3.2.2; https://cran.r-project.org/; accessed on 12 February 2020) in combination with Cytoscape (version 3.7.2; https://cytoscape.org/; accessed on 12 February 2020). In addition, several add-on packages (and their dependencies) were used: circlize, ComplexHeatmap, ggplot2, ggpubr, igraph, httr, pdist, philentropy, RColorBrewer, RCy3 [18], RJSONIO.

3. Results

3.1. Study Population

A network-based approach was used to analyse a large dataset created from the database at an LHU (ASL TO4) in Regione Piemonte (Italy). A subset of the database containing 5,434,536 records/drugs prescribed to 361,914 patients over a 12-month period (from July 2018 to June 2019) was initially considered. Patient birth year was adopted, within the 1919–2019 range, as inclusion criteria. After excluding incomplete and/or ambiguous entries, the resulting dataset consisted of 5,431,335 entries/drugs prescribed to 361,574 patients. Therefore, this study population is a subset of the overall population (Figure A2). As shown in Figure 2, 54.7% of the patients in the study population were females. Moreover, 11.9% (6.1% and 5.8%, for males and females, respectively), 50.0% (22.4% and 27.6%) and 38.1% (16.8% and 21.3%) were of 0–21, 22–64 and ≥65 years of age, respectively. As shown in Figure 2B, 53.7% of overall drug prescriptions were attributable to female patients. In addition, 2.4% (1.3% and 1.1%), 30.1% (14.2% and 15.9%) and 67.5% (30.8% and 36.7%) of the drugs were prescribed to patients in the age ranges 0–21, 22–64 and ≥65, respectively.
Given the minimal restrictions to the inclusion criteria, this cohort of patients, with their (co)prescriptions, reflects the complexity of the real-world system and can be considered suitable for the assessment and validation of methods for the study of drug prescriptions.

3.2. Structure of Drug Prescription Networks

Pattern recognition in drug prescription datasets is a problem of interest, with relevant potential implications and applications related to prescription behaviour. The adoption of network models, together with network theory tools, may be a means to recognise such patterns. To evaluate this assumption, the structures of the five DPNs (Figure 3) created in this study were analysed. As shown in Figure 3 and Table 1, the number of nodes ( N ) and edges ( E ) in the networks increased across the ATC levels. As indicated by the graph density ( d ), DPN-ATCL1 (14 nodes and 91 edges) was a complete network (a network in which every pair of nodes is connected by a link), whereas DPN-ATCL2–DPN-ATCL5 were progressively more sparse networks, with fewer edges than a complete network.
Node connectivity was then studied. Each individual node i in a network is characterised by its degree ( k i ), which is the number of links it has with other nodes (referred to as the nearest neighbours) and thereby a “local” quantity. Hence, the degree for a distinct ATCi,L is the number of pairs ATCi,L–ATCj,L (with i j ) in the DPNs, and can be used as a measure of the variety of co-prescriptions that involve that ATCi,L. The average degree ( k ¯ ) increased across the five ATC levels (Table 1). These changes were also accompanied by an increase in the variability of k i , which means that both very low-degree and very high-degree ATCi,L can be found in the DPNs, at the higher ATC levels particularly. As shown in Figure A3, k i ranged from 7 (G01, gynecological anti-infectives and antiseptics, and R02, throat preparations) to 72 (B01, antithrombotic agents, C09, agents that act on the renin–angiotensin system, and J01, anti-bacterials for systemic use) for DPN-ATCL2; from 1 (A07B, intestinal adsorbents) to 151 (B01A, antithrombotic agents) for DNP-ATCL3; from 0 (N07AX, other parasympathomimetics) to 285 (A02BC, proton-pump inhibitors) for DPN-ATCL4; and from 0 (B03XA03, methoxy polyethylene glycol-epoetin beta, and N07AX02, choline alfoscerate) to 706 (J01CR02, amoxicillin and beta-lactamase inhibitor) for DPN-ATCL5. The same degree (13) was determined for all nodes in DPN-ATCL1. Moreover, a giant component exists ( k ¯ > 1 ), which contains all of the nodes in the network ( k ¯ > ln N ) [8], in almost all of the DPNs. The relative importance of a node i in the DPNs was also estimated by computing two “global” quantities, betweenness centrality ( b i ) and closeness centrality ( C l i ; Appendix A). As for degree, the average betweenness centrality ( b ¯ i ) and the average closeness centrality ( C l ¯ i ) of the DPNs changed across the five ATC levels (Table 1). Moreover, as shown in Figure 4, by adopting these quantities ( k , b and C l ), the relative importance of each node in the network can be characterised following distinct perspective. For example, due to its high degree and betweenness centrality, J01CR02 emerged as the most “important” node in the DPN-ATCL5. Moreover, despite the small degree, N02BE01 (paracetamol) was a central node in this network, thanks to its betweenness and closeness centrality.
In order to gain further insights into node connectivity, degree–degree correlation was then quantified by determining the assortativity coefficient, r [13,14]. Negative values were computed for almost all of the DPNs, with an inverse relationship between r and N (Table 1). It can therefore be stated that the DPNs displayed the disassortative mixing of node degrees, which implies that high-degree ATCi,Ls are preferentially linked to low-degree ATCj,Ls.
As suggested by these results, methods based on an analysis of the structural organisation of DPNs may be helpful to the study of patterns in poly-pharmacy.

3.3. Structure of Drug Prescription Networks and Popularity of Drug (Co)Prescription

Previous results that were obtained overlooked the number of patients prescribed with each drug/pairs of drugs, meaning that the “popularity” of each (co)prescription was not considered. However, as the structure of DPNs depends on the proportion of each (co)prescription, it can be hypothesised that conclusions as to the “popularity” of each (co)prescription could also be inferred by investigating the structure of the DPNs. In order to evaluate this hypothesis, the variability associated with drug prescriptions in our study population was first quantified by determining the number of patients prescribed with each ATCi ( P i ) at the five ATC levels (Figure A4). P i showed wide variety across the ATC levels and ATC codes. Anti-bacterials for systemic use that belonged to the class of penicillins (J, J01, J01C and J01CR02) were found to be the single ATCs that were most commonly prescribed within the first, second, third and fifth ATC levels, respectively. By contrast, larger variability was observed on the opposite side, where many more ATCs (60 at the ATCL5) were prescribed to only one patient over the study period.
The relationship between P i and k i was then studied to assess whether prescriptions and co-prescriptions are related variables. As shown in Figure 5 and Figure A5, the ATCis that were more frequently prescribed were also the nodes more connected in the DPNs and vice versa, regardless of ATC level. In particular, the most frequently prescribed ATCi,Ls were also the most connected nodes: group J (degree 13), group J01 (72), subgroup J01C (145), subgroup A02BC (degree 285) and chemical substance J01CR02 (degree 706), in DPN-ATCL1, -ATCL2, -ATCL3, -ATCL4, and -ATCL5, respectively. However, some drugs that were prescribed to the same number of patients were co-prescribed together with a different number of other distinct drugs, and this was more likely for the less popular ATCs. For example, in DPN-ATCL5, 13 patients were prescribed with G03GA04 (urofollitropin) and N02BE01 (paracetamol). However, the former was co-prescribed together with 9, while the latter with 140 other distinct drugs.
Finally, we investigated how, and to what extent, graph density depends on the number of patients (co)prescribed with distinct drugs and drug pairs. The Pearson’s coefficient of correlation ϕ i j was calculated for all nodes and edges in the networks [11,15]. Moreover, pre-specified values of ϕ i j ( ϕ * , from 0.00001 to 1) were adopted to include (if ϕ i j ϕ * ) or exclude (otherwise) nodes and edges. Representative results are shown in Figure 6, Figure A6 and Figure A7. The number of connected nodes and edges in the network decreased in a ϕ -correlation-dependent manner, and these changes were mainly due to the loss of the “weaker” edges, which are edges in the network that are characterised by lower values of C i j compared to the corresponding P i and P j . The relative number of connected nodes and edges in the networks at the different ϕ * values was also analysed using logistic models. These analyses allowed us to estimate ϕ m a x , which was the value of the ϕ -correlation that corresponded to the maximum for the difference between the relative number of connected nodes and edges in the network; these values were 0.063, 0.033 and 0.019 for DPN-ATCL1, -ATCL3 and ATCL5, respectively.
As indicated by these results, the structural organisation of DPNs is strictly dependent on the popularity of drug (co)prescriptions. The structure of DPNs can be manipulated and explored by considering the strength of drug co-prescriptions in order to gain further insight into prescription patterns.

3.4. Comparison of Drug Prescription Networks

The ability to discover and quantify similarity/difference in distinct (sub)systems is a problem of interest in many research fields. A method based on the computation of distance matrices [16] was therefore used to assess similarity/difference in distinct groups of the study population. The study population was divided into six groups according to gender and age (three strata, 0–21, 22–64 and 65–100 years of age), and data on drug prescriptions were then analysed to give the corresponding DPNs (for ATCL1, ATCL3 and ATCL5). Finally, DPNs were compared by calculating the Euclidean distances between the DPNs created for females and males at the different age-strata and ATC levels. Figure 7 displays examples of 14 × 14, 155 × 155 and 781 × 781 symmetric distance matrices (age strata 65–100 years) for ATCL1, ATCL3 and ATCL5, respectively. Importantly, the presence of a subset of ATCs with comparatively smaller internal distance does not imply the existence of a well-defined group, unless the group itself displays comparatively higher distance to the remaining ATCs. In addition, ATCs were classified by drawing dendrograms, built using the ward.D2 linkage, to facilitate subset identification [17]. Notably, the dendrograms identified a number of clusters in the distance matrices. However, in some cases, they included a large number of ATCs/drugs, while others included only a few ATCs/drugs characterised by a large external distance. For example, cluster 1, in the distance matrix ATCL1, consisted of seven ATCs (R, S, L, D, G, P and V) characterised by the minimal comparative external distance. On the other hand, cluster 3 included only two ATCs (A and C), characterised by the maximal comparative external distance. An even more marked pattern was observed in the distance matrix ATCL5. In particular, cluster 1 included a large number of drugs that belonged to almost all of the ATC groups, whereas cluster 10 included only two drugs, A11CC05 (cholecalciferol) and B01AC06 (acetylsalicylic acid). Additional examples can be found in Figure A8 and Figure A9. Interestingly, comparisons of distance matrices that were computed for the same ATC level, but at different age strata, reveal additional differences (Table A1).
As demonstrated by these results, DPNs provide a means to discover possible differences that may be caused by changes in drug (co)prescriptions over space and time.

4. Discussion

The great power of networks to describe real-world systems has attracted growing interest, and this modelling approach has been used in many research areas in recent decades. Several studies have introduced networks as helpful tools to investigate specific questions, even in the field of health and medical information analysis [19,20,21,22]. However, to date, only a few studies have dealt with network models to study drug prescription [10,11,23]. Together with previous findings, our results contribute to the development of the foundations upon which further studies can be designed.
Different types of DPNs (i.e., weighted graphs, bipartite graphs) can be created from the administrative drug prescription records collected by the health systems. Moreover, the structural organisation of these networks and their dynamic changes over time can be investigated to draw conclusions on specific questions of interest and related to the patient and prescriber behaviours, as well as to assess the effects of system-level interventions aimed to rationalise drug prescription (i.e., prescription drug monitoring, clinician and patient education, changes in product labelling and treatment guidelines). For example, the adoption of weighted DPNs to recognise patterns and trends in drug-prescribing data was introduced in previous reports. In particular, Cavallo et al. [10] have considered a set of drug prescriptions (for a total of 42,965 patients) written during a 6-month period by a group of 99 GPs operating in the city of Salerno (Italy). Bazzoni et al. [11] have analysed three datasets (145,072, 186,426 and 171,075 patients, respectively; time window, 1 year) extracted from the administrative prescription database of the Lombardy Region (Italy). In comparison, here we have analysed the largest dataset (5,431,335 entries/drugs prescribed to 361,574 patients over a period of 1 year) to recognise drug prescription patterns. Finally, Hu et al. [23] constructed several bipartite networks with two types of nodes—patients and prescribers—to reveal the characteristics and trends of the interaction between patients and prescribers across Queensland, Australia from 2011 to 2018 (prescribing data for fentanyl patches). Collectively, these examples provide evidence on the potential application of the network approach as a data mining technique, which could support the advisory role of healthcare providers and regulators.
By adopting DPNs, the drug prescription process is studied from the point of view of its topology. Despite the differences in both the original datasets and in the particular kind of interconnection between nodes, DPNs have been consistently demonstrated to be dense graphs [10,11]. Our results extend previous results by showing the relationship between ATC level and graph density. The fact that graph density depends on the relationship between the number of edges and the number of nodes in the network implies that the co-prescription of drugs is a very common event, which can be quantified by specific network measures. In addition, (co)prescription patterns in the original dataset are associated with DPN structure. For example, Newman’s assortative coefficient values reveal that DPNs displayed the disassortative mixing of node degrees, and this property was more pronounced at the higher ATC levels. Therefore, DPNs are characterised by non-random connectivity.
A study of the strength of co-prescriptions has demonstrated that: (a) some nodes are linked by weak edges, while others are linked by strong edges; (b) communities of densely interconnected nodes that link preferentially to each other than to other nodes in the network (modules) can be recognised by eliminating the weaker edges in the networks. Similarly to previous works [11,15], our aim was to propose a method to gain insights into the modular structure of DPNs after removing the weaker edges. In particular, ϕ m a x can be determined by analysing the following relationships: ϕ-correlation vs. number of nodes and ϕ-correlation vs. number of edges. This index can be adopted as a threshold value to decrease graph density and focus the analysis toward certain (co)prescriptions selected by means of their “popularity”. Therefore, these results support the conclusion that methods based on the study of DPNs are a flexible approach to the modelling of drug prescription data. Finally, Euclidean distance matrices [16] have been computed in order to discover similarity/difference in multiplex networks.
These analyses, albeit preliminary, have proven to be helpful in quantifying the relative contribution of each ATC in determining gender- and age-specific patterns in drug prescriptions, which likely reflect discrepancies in the prevalence of distinct conditions (e.g., bone disorders). Together with previous analyses, this method may be able to reveal changes in poly-pharmacy patterns over space and time.
Some limits of this study deserve to be considered. The use of DPNs to study the complexity of drug prescription is still in its infancy. Despite the promising results, future studies are needed to better understand the pros and cons. In addition, ASL TO4 does not collect data on drugs not reimbursed by NHS and over-the-counter medications. Moreover, whether the prescribed drugs were really taken is unknown. Therefore, our results cannot provide evidence on the actual drug use.

5. Conclusions

There is a growing need to improve our understanding of patterns and trends in drug prescription and utilization. Network models can be used as a data mining technique to study the complexity regarding drug prescription. The results obtained in this study provide further evidence for the efficiency and flexibility that DPNs show when representing drug prescription data. In particular, concepts and metrics of the network theory add novel perspectives in the path from raw data to knowledge, and could provide valuable evidence on how drugs are prescribed.

Author Contributions

Conceptualization, G.M.; methodology, G.M.; formal analysis, G.M. and L.B.; investigation, G.M., A.D., R.B.P., C.C.; data curation, A.D., R.B.P., M.E., C.R. and E.R.; writing—original draft preparation, G.M.; writing—review and editing, L.G.A., S.T., E.B. and C.C.; visualization, G.M. and L.B.; supervision, G.M. and C.C.; funding acquisition, G.M. and C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by funds from the Università degli Studi di Torino, Ricerca Locale Ex 60% 2018 to G.M. and 2019 to C.C.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

We are grateful to the Management Direction of ASL TO4 (Regione Piemonte, Italy) for the possibility of a new interaction between health care providers and the University of Turin.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

Appendix A.1. Statistical and Network Measures Adopted in the Study

Appendix A.1.1. Degree

The degree of node i in a network ( k i ) is the number of nearest neighbours of that node. Moreover, given a network with N nodes, the average node degree of the network is defined by:
k ¯ = 1 N i = 1 N k i .
Degree quantifies the relative importance of a node in terms of the variety of its connections. It attributes large importance to nodes that are connected to a large number of other nodes in the network [7,15]. In addition, as you need to inspect only one node to find its degree, the degree is a “local” quantity.

Appendix A.1.2. Betweenness Centrality

Quantities which need an inspection of the whole network to be computed are “global” in nature—among them, the betweenness centrality. The betweenness centrality of node i in an undirected network ( b i ) is given by:
b i = r , s n r s i g r s ,
where n r s i is the number of shortest paths between node r and node s that pass through node i , and g r s is the total number of shortest paths between node r and node s . The average betweenness centrality of the network ( b ¯ ) is defined similarly to the average node degree of a network. In general, the betweenness centrality quantifies the relative importance of a node in the communication between other pairs of nodes. In particular, it attributes large importance to nodes that act as bridges between different regions of the network. These are nodes that are traversed by a large number of shortest paths connecting different pairs of nodes in the network. The betweenness centrality is not related to the degree of the node, as nodes of low-degree might connect two large and otherwise unconnected regions of the network and therefore acquire high betweenness centrality [7,15].

Appendix A.1.3. Closeness Centrality

The closeness centrality of a node i in an undirected network ( C l i ) is given by:
C l i = 1 1 N 1 j d i j = 1 l ¯ i ,
where l ¯ i is the average distance of the node i to the other nodes in the network. The average closeness centrality of the network ( C l ¯ ) is defined similarly to the average node degree of a network. As for the betweenness centrality, even the closeness centrality is a “global” measure, which quantifies the relative importance of a node in a network beyond the number of its connections (nearest neighbours): central nodes are those which are relatively close to all other nodes in the network.

Appendix A.1.4. Network Density

The density of a network ( d ) is defined as follows:
d = 2 E N ( N 1 ) = k ¯ N 1 ,
where E is the number of links and N is the number of nodes in the network. A criterion for considering a network as sparse is that d 1 , which is equivalent to say that the number of nodes is markedly higher than the average node degree of the network [7].

Appendix A.1.5. Network Assortativity

It has been shown that in real-world networks nodes are not connected randomly, but instead there is a significant bias for nodes of high degree to connect either to nodes of high degree (assortative networks) or to nodes of low degree (disassortative networks). In these cases, the network displays degree correlations. To establish whether a network is assortative, disassortative or without degree correlations, the assortativity coefficient ( r ) can be calculated, as proposed by Newman M.E.J. (2002, 2003) [13,14]:
r = k i , k j k i k j [ E ( k i k j ) Q ( k i ) Q ( k j ) ] σ Q 2 ,
where k i and k j are the numbers of neighbours (degree) of node i and j , respectively, minus 1, E ( k i k j ) is the fraction of links that connect a node of degree k i to a node of degree k j . Moreover, let P ( k j ) be the probability that a node selected at random in the network has degree k j , then, the distribution of the excess degree of a node at the end of a randomly chosen link is:
Q ( k j ) = ( k j + 1 ) P ( k j + 1 ) i k i P ( k i ) .
Finally, σ Q 2 is the standard deviation of the distribution Q ( k j ) . The assortativity coefficient is a number with values between −1 and 1. Negative values of the assortativity coefficient indicate that the network is overall disassortative, while positive values of r indicate that the network is overall assortative [7,15].

Appendix A.1.6. The Pearson’s Correlation Coefficient for Binary Variables

The Pearson’s correlation coefficient for binary variables (ϕ-correlation or simply ϕ) allows the quantification of the strength of co-prescription associations; it is given by:
ϕ i , j = C i j O P i P j P i P j ( O P i ) ( O P i ) ,
where C i j is the number of patients co-prescribed with both drugs, O is the total number of patients in the studied population and P i is the prevalence of the i t h prescription. When ϕ > 0, co-prescription is larger than expected by chance; when ϕ < 0, co-prescription is smaller than expected by chance [11,24].

Appendix A.1.7. Difference of the Networks (Euclidean Distance)

Given two un-directed networks G ( 1 ) = ( V ( 1 ) , E ( 1 ) , W ( 1 ) ) and G ( 2 ) = ( V ( 2 ) , E ( 2 ) , W ( 2 ) ) , where V ( 1 ) = V ( 2 ) = V are finite and identical sets of nodes, E ( · ) are finite sets of edges and W ( · ) are finite sets of weights, we define the Euclidean distance as:
d E U C = i , j V [ w i j ( 1 ) w i j ( 2 ) ] 2 .

Appendix A.2. The Anatomical Therapeutic Chemical (ATC) Classification System

The ATC classification system is a strict hierarchy, including five levels of classification (L1–L5), and for the anatomical/pharmacological groups (or 1st levels) there are 14 main groups:
  • A—Alimentary tract and metabolism;
  • B—Blood and blood forming organs;
  • C—Cardiovascular system;
  • D—Dermatologicals;
  • G—Genito-urinary system and sex hormones;
  • H—Systemic hormonal preparations, excluding sex hormones and insulins;
  • J—Anti-infectives for systemic use;
  • L—Antineoplastic and immunomodulating agents;
  • M—Musculo-skeletal system;
  • N—Nervous system;
  • P—Antiparasitic products, insecticides and repellents;
  • R—Respiratory system;
  • S—Sensory organs;
  • V—Various.
  • (see https://www.whocc.no/atc/structure_and_principles/, accessed on 25 February 2021).
Figure A1. The ATC classification system. The ATC classification system consists of five levels and several subcategories nested within broader parent categories. The resulting structural organization of the ATC codes is shown in the figure. In particular, the ATC codes were counted both in the overall system (blue columns) and, for comparison, in the DPNs (dark red columns) examined in this study.
Figure A1. The ATC classification system. The ATC classification system consists of five levels and several subcategories nested within broader parent categories. The resulting structural organization of the ATC codes is shown in the figure. In particular, the ATC codes were counted both in the overall system (blue columns) and, for comparison, in the DPNs (dark red columns) examined in this study.
Ijerph 18 04859 g0a1
Figure A2. Overall and study population. Age and sex structures of the overall population (ASL TO4) and study population (DPNs) are shown.
Figure A2. Overall and study population. Age and sex structures of the overall population (ASL TO4) and study population (DPNs) are shown.
Ijerph 18 04859 g0a2
Figure A3. Individual node degree. Data on drug prescription have been represented in the form of DPNs (ATCL1–ATCL5). Individual node degrees were computed and are displayed as bar charts. Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Figure A3. Individual node degree. Data on drug prescription have been represented in the form of DPNs (ATCL1–ATCL5). Individual node degrees were computed and are displayed as bar charts. Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g0a3
Figure A4. Number of patients prescribed with the ATCi. Data on drug prescription have been represented in the form of DPNs (ATCL1–ATCL5). The number of patients prescribed with each individual ATC was determined and is displayed in the bar plots. Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Figure A4. Number of patients prescribed with the ATCi. Data on drug prescription have been represented in the form of DPNs (ATCL1–ATCL5). The number of patients prescribed with each individual ATC was determined and is displayed in the bar plots. Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g0a4
Figure A5. Relationship prescriptions and co-prescriptions in the DPN-ATCL5. The number of patients prescribed with an ATCi and the number of neighbours of that ATCi were displayed as a 2D scatter plot. Horizontal and vertical black lines display the 50th (solid line), 2.5th and 97.5th (dashed lines) percentiles. Each node is represented as a dot; coloured (orange and red) dots highlight subsets of ATC codes endowed with the higher values for the two metrics.
Figure A5. Relationship prescriptions and co-prescriptions in the DPN-ATCL5. The number of patients prescribed with an ATCi and the number of neighbours of that ATCi were displayed as a 2D scatter plot. Horizontal and vertical black lines display the 50th (solid line), 2.5th and 97.5th (dashed lines) percentiles. Each node is represented as a dot; coloured (orange and red) dots highlight subsets of ATC codes endowed with the higher values for the two metrics.
Ijerph 18 04859 g0a5
Figure A6. Relationship between ϕ-correlation and graph density. Drug prescription networks at the first ATC level were created by adopting different ϕ-correlation values (from 0.00001 to 1). The number of connected nodes (blue dots) and edges (grey dotes) in these networks was determined and analysed using logistic models (blue and grey lines, respectively). The red curve displays the difference between the blue and the grey curves; the x-coordinate of its maximum is denoted as ϕ m a x . Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Figure A6. Relationship between ϕ-correlation and graph density. Drug prescription networks at the first ATC level were created by adopting different ϕ-correlation values (from 0.00001 to 1). The number of connected nodes (blue dots) and edges (grey dotes) in these networks was determined and analysed using logistic models (blue and grey lines, respectively). The red curve displays the difference between the blue and the grey curves; the x-coordinate of its maximum is denoted as ϕ m a x . Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g0a6
Figure A7. Relationship between ϕ-correlation and graph density. Drug prescription networks at the fifth ATC level were created by adopting different ϕ-correlation values (from 0.00001 to 1). The number of connected nodes (blue dots) and edges (grey dotes) in these networks was determined and analysed using logistic models (blue and grey lines, respectively). The red curve displays the difference between the blue and the grey curves; the x-coordinate of its maximum is denoted as ϕ m a x . Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Figure A7. Relationship between ϕ-correlation and graph density. Drug prescription networks at the fifth ATC level were created by adopting different ϕ-correlation values (from 0.00001 to 1). The number of connected nodes (blue dots) and edges (grey dotes) in these networks was determined and analysed using logistic models (blue and grey lines, respectively). The red curve displays the difference between the blue and the grey curves; the x-coordinate of its maximum is denoted as ϕ m a x . Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g0a7
Figure A8. Distance matrices for three ATC levels to compare females vs. males at the 0–21 years age strata. The colour in entry ( i , j ) is the Euclidean distance between layers i (females) and j (males) of the binary DPN. Euclidean distances are coded by colours from white (small distance) to red (large distance). Nodes are coloured according to the 14 groups found at the first level of the ATC classification system. Dendrograms identify a number of clusters in the distance matrices.
Figure A8. Distance matrices for three ATC levels to compare females vs. males at the 0–21 years age strata. The colour in entry ( i , j ) is the Euclidean distance between layers i (females) and j (males) of the binary DPN. Euclidean distances are coded by colours from white (small distance) to red (large distance). Nodes are coloured according to the 14 groups found at the first level of the ATC classification system. Dendrograms identify a number of clusters in the distance matrices.
Ijerph 18 04859 g0a8
Figure A9. Distance matrices for three ATC levels to compare females vs. males at the 22–64 years age strata. The colour in entry ( i , j ) is the Euclidean distance between layers i (females) and j (males) of the binary DPN. Euclidean distances are coded by colours from white (small distance) to red (large distance). Nodes are coloured according to the 14 groups found at the first level of the ATC classification system. Dendrograms identify a number of clusters in the distance matrices.
Figure A9. Distance matrices for three ATC levels to compare females vs. males at the 22–64 years age strata. The colour in entry ( i , j ) is the Euclidean distance between layers i (females) and j (males) of the binary DPN. Euclidean distances are coded by colours from white (small distance) to red (large distance). Nodes are coloured according to the 14 groups found at the first level of the ATC classification system. Dendrograms identify a number of clusters in the distance matrices.
Ijerph 18 04859 g0a9
Table A1. Comparisons between DPNs (females vs. males). Representative ATCs endowed with the higher distance values.
Table A1. Comparisons between DPNs (females vs. males). Representative ATCs endowed with the higher distance values.
ATCAge Strata (Years)
0–2122–6465–100
L1JJ, A and HA and C
L3J01CJ01C, H02A, A02B, M01A and A11CA02B, B01A, A11C, C07A, C10A and M01A
L5J01CR02 and H02AB01J01CR02, A11CC05 and H02AB01A11CC05 and B01AC06

References

  1. Nielsen, C.R.; Halling, A.; Andersen-Ranberg, K. Disparities in multimorbidity across Europe—Findings from the SHARE Survey. Eur. Geriatr. Med. 2017, 8, 16–21. [Google Scholar] [CrossRef]
  2. OECD. Health at a Glance 2019; OECD: Paris, France, 2019. [Google Scholar] [CrossRef]
  3. Ma, C.; Smith, H.W.; Chu, C.; Juarez, D.T. Big data in pharmacy practice: Current use, challenges, and the future. Integr. Pharm. Res. Pract. 2015, 4, 91–99. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Dalton, K.; Byrne, S. Role of the pharmacist in reducing healthcare costs: Current insights. Integr. Pharm. Res. Pract. 2017, 6, 37–46. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Hallas, J. Drug utilization statistics for individual-level pharmacy dispensing data. Pharmacoepidemiol. Drug Saf. 2005, 14, 455–463. [Google Scholar] [CrossRef] [PubMed]
  6. Euler, L. Solutio problematis ad geometriam situs pertinentis. Comment. Acad. Sci. Petropolitanae 1736, 8, 128–140. [Google Scholar]
  7. Estrada, E. The Structure of Complex. Networks: Theory and Applications; Oxford University Press: Oxford, UK, 2011. [Google Scholar] [CrossRef] [Green Version]
  8. Barabási, A.L. Network Science; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
  9. Latora, V.; Nicosia, V.; Russo, G. Complex Networks; Cambridge University Press: Cambridge, MA, USA, 2017. [Google Scholar] [CrossRef]
  10. Cavallo, P.; Pagano, S.; Boccia, G.; De Caro, F.; De Santis, M.; Capunzo, M. Network analysis of drug prescriptions. Pharmacoepidemiol. Drug Saf. 2013, 22, 130–137. [Google Scholar] [CrossRef]
  11. Bazzoni, G.; Marengoni, A.; Tettamanti, M.; Franchi, C.; Pasina, L.; Djade, C.D.; Fortino, I.; Bortolotti, A.; Merlino, L.; Nobili, A. The drug prescription network: A system-level view of drug co-prescription in community-dwelling elderly people. Rejuvenation Res. 2015, 18, 153–161. [Google Scholar] [CrossRef]
  12. WHO Collaborating Centre for Drug Statistics Methodology. Guidelines for ATC Classification and DDD Assignment 2020. Available online: https://www.whocc.no/filearchive/publications/2020_guidelines_web.pdf (accessed on 25 February 2021).
  13. Newman, M.E.J. Assortative Mixing in Networks. Phys. Rev. Lett. 2002, 89, 208701. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Newman, M.E.J. Mixing patterns in networks. Phys. Rev. E 2003, 67, 26126. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Hidalgo, C.A.; Blumm, N.; Barabási, A.L.; Christakis, N.A. A dynamic network approach for the study of human phenotypes. PLoS Comput. Biol. 2009, 5, e1000353. [Google Scholar] [CrossRef] [PubMed]
  16. Tantardini, M.; Ieva, F.; Tajoli, L.; Piccardi, C. Comparing methods for comparing networks. Sci. Rep. 2019, 9, 17557. [Google Scholar] [CrossRef] [PubMed]
  17. Murtagh, F.; Legendre, P. Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? J. Classif. 2014, 31, 274–295. [Google Scholar] [CrossRef] [Green Version]
  18. Gustavsen, J.A.; Pai, S.; Isserlin, R.; Demchak, B.; Pico, A.R. RCy3: Network biology using Cytoscape from within R. F1000Research 2019, 8, 1774. [Google Scholar] [CrossRef] [PubMed]
  19. Landon, B.E.; Keating, N.L.; Barnett, M.L.; Onnela, J.-P.; Paul, S.; O’Malley, A.J.; Keegan, T.; Christakis, N.A. Variation in patient-sharing networks of physicians across the United States. JAMA 2012, 308, 265–273. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Srinivasan, K.; Currim, F.; Ram, S. Predicting high-cost patients at point of admission using network science. IEEE J. Biomed. Health Inform. 2018, 22, 1970–1977. [Google Scholar] [CrossRef] [PubMed]
  21. Brunson, J.C.; Laubenbacher, R.C. Applications of network analysis to routinely collected health care data: A systematic review. J. Am. Med. Inform. Assoc. 2018, 25, 210–221. [Google Scholar] [CrossRef] [PubMed]
  22. Geissler, K.H.; Lubin, B.; Ericson, K.M.M. The association between patient sharing network structure and healthcare costs. PLoS ONE 2020, 15, e0234990. [Google Scholar] [CrossRef] [PubMed]
  23. Hu, X.; Gallagher, M.; Loveday, W.; Dev, A.; Connor, J.P. Network analysis and visualisation of opioid prescribing data. IEEE J. Biomed. Health Inform. 2020, 24, 1447–1455. [Google Scholar] [CrossRef] [PubMed]
  24. Bianconi, G. Multilayer Networks: Structure and Function; Oxford University Press: Oxford UK, 2018. [Google Scholar] [CrossRef]
Figure 1. Representation of a drug prescription network. Data on drug (co)prescriptions were analysed to create drug prescription networks that correspond to the five levels of the Anatomical Therapeutic Chemical (ATC) classification system. Nodes and edges in these networks were the single ATCs and ATC–ATC pairs found in the original dataset, respectively.
Figure 1. Representation of a drug prescription network. Data on drug (co)prescriptions were analysed to create drug prescription networks that correspond to the five levels of the Anatomical Therapeutic Chemical (ATC) classification system. Nodes and edges in these networks were the single ATCs and ATC–ATC pairs found in the original dataset, respectively.
Ijerph 18 04859 g001
Figure 2. Study population. A total of 5431,335 entries/drugs prescribed to 361,574 patients were analysed in this study. Panel (A) shows age and sex structures of the study population. Panel (B) shows relative prescriptions (all prescriptions/patient) classified by age and gender.
Figure 2. Study population. A total of 5431,335 entries/drugs prescribed to 361,574 patients were analysed in this study. Panel (A) shows age and sex structures of the study population. Panel (B) shows relative prescriptions (all prescriptions/patient) classified by age and gender.
Ijerph 18 04859 g002
Figure 3. Drug prescription networks. Data on drug prescriptions were analysed to generate five drug prescription networks that correspond to the levels of the ATC classification system (ATCL1–ATCL5). Node diameter and edge thickness were proportional to the number of patients prescribed with the individual ATCis and the pairs ATCi–ATCj within the ATC level L , respectively. Nodes were coloured according to the 14 groups found at the first level of the ATC classification system.
Figure 3. Drug prescription networks. Data on drug prescriptions were analysed to generate five drug prescription networks that correspond to the levels of the ATC classification system (ATCL1–ATCL5). Node diameter and edge thickness were proportional to the number of patients prescribed with the individual ATCis and the pairs ATCi–ATCj within the ATC level L , respectively. Nodes were coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g003
Figure 4. Node degree, betweenness centrality and closeness centrality. To better understand the relative importance of each node (DPN-ATCL5), degree, betweenness centrality and closeness centrality were computed and displayed in the 2D scatter plots. Horizontal and vertical black lines display the 50th (solid line), 2.5th and 97.5th (dashed lines) percentiles. Each node is represented as a dot; coloured (orange and red) dots highlight subsets of ATC codes endowed with the higher relative importance, with respect to the three metrics.
Figure 4. Node degree, betweenness centrality and closeness centrality. To better understand the relative importance of each node (DPN-ATCL5), degree, betweenness centrality and closeness centrality were computed and displayed in the 2D scatter plots. Horizontal and vertical black lines display the 50th (solid line), 2.5th and 97.5th (dashed lines) percentiles. Each node is represented as a dot; coloured (orange and red) dots highlight subsets of ATC codes endowed with the higher relative importance, with respect to the three metrics.
Ijerph 18 04859 g004
Figure 5. Relationship between number of prescriptions and variety of co-prescriptions. Scatter plots display the relationship between the number of patients prescribed with an ATCi and the number of neighbours of that ATCi in the drug prescription networks. Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Figure 5. Relationship between number of prescriptions and variety of co-prescriptions. Scatter plots display the relationship between the number of patients prescribed with an ATCi and the number of neighbours of that ATCi in the drug prescription networks. Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g005
Figure 6. Relationship between ϕ-correlation and graph density. Drug prescription networks at the third ATC level were created by adopting different ϕ-correlation values (from 0.00001 to 1). The number of connected nodes (blue dots) and edges (grey dots) in these networks was determined and analysed using logistic models (blue and grey lines, respectively). The red curve displays the difference between the blue and the grey curves; the x -coordinate of its maximum is denoted ϕ m a x . Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Figure 6. Relationship between ϕ-correlation and graph density. Drug prescription networks at the third ATC level were created by adopting different ϕ-correlation values (from 0.00001 to 1). The number of connected nodes (blue dots) and edges (grey dots) in these networks was determined and analysed using logistic models (blue and grey lines, respectively). The red curve displays the difference between the blue and the grey curves; the x -coordinate of its maximum is denoted ϕ m a x . Nodes are coloured according to the 14 groups found at the first level of the ATC classification system.
Ijerph 18 04859 g006
Figure 7. Distance matrices for three ATC levels to compare females vs. males at the 65–100 years-of-age strata. The colour in entry ( i , j ) is the Euclidean distance between layers i (females) and j (males) of the binary drug prescription networks. Euclidean distances are coded by colours from white (small distance) to red (large distance). Nodes were coloured according to the 14 groups found at the first level of the ATC classification system. Dendrograms identify a number of clusters in the distance matrices.
Figure 7. Distance matrices for three ATC levels to compare females vs. males at the 65–100 years-of-age strata. The colour in entry ( i , j ) is the Euclidean distance between layers i (females) and j (males) of the binary drug prescription networks. Euclidean distances are coded by colours from white (small distance) to red (large distance). Nodes were coloured according to the 14 groups found at the first level of the ATC classification system. Dendrograms identify a number of clusters in the distance matrices.
Ijerph 18 04859 g007
Table 1. Structural and statistical measures of drug prescription networks.
Table 1. Structural and statistical measures of drug prescription networks.
ATCL 1 N 2 ln N E 3 d 4 k ¯ [CV%] 5 b ¯ [CV%] 6 C l ¯   ( ×   10 4 ) [CV%] r 7
1142.639911.00013.000 [0.0]5.571 [374.2]0.876 [41.5]NA 8
2754.31720530.74054.747 [32.2]51.839 [140.4]50.444 [11.7]−0.194
31555.04371910.60392.787 [42.9]91.999 [104.4]26.907 [7.9]−0.189
43085.73122,7470.481147.708 [55.9]173.130 [85.3]9.712 [7.2]−0.264
57816.661104,0630.342266.487 [72.9]423.840 [74.0]2.984 [6.4]−0.332
1 ATCL: Anatomical Therapeutic Chemical level; 2  N : number of nodes; 3  E : number of edges; 4  d : network density; 5  k ¯ : average degree; 6 CV: coefficient of variation (standard deviation/average); 7  r : assortativity coefficient; 8 NA: not applicable.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Miglio, G.; Basso, L.; Armando, L.G.; Traina, S.; Benetti, E.; Diarassouba, A.; Baroetto Parisi, R.; Esiliato, M.; Rolando, C.; Remani, E.; et al. A Network Approach for the Study of Drug Prescriptions: Analysis of Administrative Records from a Local Health Unit (ASL TO4, Regione Piemonte, Italy). Int. J. Environ. Res. Public Health 2021, 18, 4859. https://doi.org/10.3390/ijerph18094859

AMA Style

Miglio G, Basso L, Armando LG, Traina S, Benetti E, Diarassouba A, Baroetto Parisi R, Esiliato M, Rolando C, Remani E, et al. A Network Approach for the Study of Drug Prescriptions: Analysis of Administrative Records from a Local Health Unit (ASL TO4, Regione Piemonte, Italy). International Journal of Environmental Research and Public Health. 2021; 18(9):4859. https://doi.org/10.3390/ijerph18094859

Chicago/Turabian Style

Miglio, Gianluca, Lara Basso, Lucrezia G. Armando, Sara Traina, Elisa Benetti, Abdoulaye Diarassouba, Raffaella Baroetto Parisi, Mariangela Esiliato, Cristina Rolando, Elisa Remani, and et al. 2021. "A Network Approach for the Study of Drug Prescriptions: Analysis of Administrative Records from a Local Health Unit (ASL TO4, Regione Piemonte, Italy)" International Journal of Environmental Research and Public Health 18, no. 9: 4859. https://doi.org/10.3390/ijerph18094859

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop