Next Article in Journal
Game Theoretic Approach for Systematic Feature Selection; Application in False Alarm Detection in Intensive Care Units
Next Article in Special Issue
Prior and Posterior Linear Pooling for Combining Expert Opinions: Uses and Impact on Bayesian Networks—The Case of the Wayfinding Model
Previous Article in Journal
Some Iterative Properties of ( F 1 , F 2 ) -Chaos in Non-Autonomous Discrete Systems
Previous Article in Special Issue
Generalized Skew-Normal Negentropy and Its Application to Fish Condition Factor Time Series
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Investigation into the Relationship among Psychiatric, Demographic and Socio-Economic Variables with Bayesian Network Modeling

1
Department of Statistics, Bitlis Eren University, 13000 Bitlis, Turkey
2
Department of Statistics, Yildiz Technical University, 34349 Istanbul, Turkey
*
Author to whom correspondence should be addressed.
Entropy 2018, 20(3), 189; https://doi.org/10.3390/e20030189
Submission received: 29 January 2018 / Revised: 6 March 2018 / Accepted: 9 March 2018 / Published: 12 March 2018
(This article belongs to the Special Issue Foundations of Statistics)

Abstract

:
The aim of this paper is to investigate the factors influencing the Beck Depression Inventory score, the Beck Hopelessness Scale score and the Rosenberg Self-Esteem score and the relationships among the psychiatric, demographic and socio-economic variables with Bayesian network modeling. The data of 823 university students consist of 21 continuous and discrete relevant psychiatric, demographic and socio-economic variables. After the discretization of the continuous variables by two approaches, two Bayesian networks models are constructed using the b n l e a r n package in R, and the results are presented via figures and probabilities. One of the most significant results is that in the first Bayesian network model, the gender of the students influences the level of depression, with female students being more depressive. In the second model, social activity directly influences the level of depression. In each model, depression influences both the level of hopelessness and self-esteem in students; additionally, as the level of depression increases, the level of hopelessness increases, but the level of self-esteem drops.

1. Introduction

Bayesian networks (BNs), also called belief networks, are graphical structures used to reason and represent knowledge in an uncertain field. They are a combination of graph theory and probability theory to explore relationships between variables [1]. The network structure is a directed acyclic graph (DAG) in which nodes represent random variables [2,3] and arcs represent direct dependencies between variables [4,5].
In psychiatry, scales are used as a helper tool to be used in diagnosis. In this study, the Beck Depression Inventory (BDI), the Beck Hopelessness Scale (BHS) and the Rosenberg Self-Esteem Scale (RSES) are used to investigate the psychiatric characteristics of university students. Bayesian networks have been widely used in many domains including engineering [6], customer behaviors [7,8,9], social behaviors [10,11], clinical decision support [12], system biology [13,14], ecology [15], and so on.
In this paper, the psychiatric variables, which are continuous, are discretized using two approaches: (i) expert knowledge in literature and (ii) a statistical method. Then, Bayesian networks are used to analyze the relationship among BDI, BHS, RSES, demographic variables and socio-economic variables.
The structure of this paper is as follows: In Section 2, the data used in this study are briefly described. In Section 3, the theory of Bayesian networks is summarized. In Section 4, the continuous data are discretized using two approaches. In Section 5, the data are analyzed, and the results are interpreted. Finally, in Section 6, the two methods are compared, and the study is concluded.

2. Data

In this study, the data are composed of three types of variables: (i) psychiatric variables, (ii) demographic variables and (iii) socio-economic variables.

2.1. Psychiatric Variables

2.1.1. Beck Depression Inventory

The Beck Depression Inventory (BDI), created by [16], is a self-report inventory used to measure the severity of depression in individuals aged 13 and over. The BDI consists of 21 multiple-choice questions, with zero being the lowest score and three being the highest score in each question. The total BDI score is calculated by summing the answers of the 21 questions. Possible BDI scores range from 0–63.

2.1.2. Beck Hopelessness Scale

The Beck Hopelessness Scale (BHS), developed by [17], is a self-report inventory that shows the level of hopelessness in adults, aged 17–80. It consists of 20 true-false questions. Eleven items are keyed true and nine false. Therefore, the answers are summed up, and the total BHS score is calculated. BHS scores range from 0–20.

2.1.3. Rosenberg Self-Esteem Scale

The Rosenberg Self-Esteem Scale (RSES), created by [18], is a self-report inventory that represents how an individual judges his or her self-worth. The RSES consists of 10 items and is rated on a four-point Likert scale with one, strongly disagree, and three, strongly agree, for the 1st, 2nd, 4th, 6th and 7th items and one, strongly agree, and three, strongly disagree, for the 3rd, 5th, 8th, 9th and 10th items. The RSES scores are calculated by summing the answers of the 10 items, and they lay in the interval 0–30.

2.2. Participants

The participants of this study are the students of Bitlis Eren University, located in Turkey, in the 2016–2017 academic year. In the first week of May 2017, the students were given a four-page form consisting of the Turkish versions of the Beck Depression Inventory, the Beck Hopelessness Inventory, the Rosenberg Self-Esteem Scale and demographic and socio-economic variables. The students were asked to fill the forms out on a voluntary basis. A total of 890 students from five schools of Bitlis Even University were included in the study, and the data of 823 students were obtained after extracting the missing data.

2.3. Demographic Variables

There are only two demographic variables in the data, which are G e n d e r and A g e .

2.4. Socio-Economic Variables

There are 16 socio-demographic variables in the data. These variables are as follows: S c h o o l _ t y p e , S m o k i n g _ s t a t u s , T y p e _ o f _ a c c o m m o d a t i o n , A l c o h o l _ u s e , S o c i a l _ a c t i v i t y , S t u d e n t s _ j o b , F a m i l y _ s t r u c t u r e , F a t h e r s _ e d u c a t i o n a l _ s t a t u s , F a m i l y _ i n c o m e , M o t h e r s _ e d u c a t i o n a l _ s t a t u s , S t u d e n t s _ i n c o m e , N u m b e r _ o f _ s i b l i n g s , T y p e _ o f _ s e t t l e m e n t , F a t h e r s _ o c c u p a t i o n , M o t h e r s _ o c c u p a t i o n and R e l a t i o n s h i p _ s t a t u s . Both demographic and socio-economic variables are described in Table 1.

3. Bayesian Networks

Let G = ( N , A ) be a directed acyclic graph (DAG), where N is a finite set of nodes and A is a finite set of arrows between the nodes. The DAG characterizes the structure of the BN.
Each node nN in the graph G represents a random variable X n . The set of variables associated with the graph G is denoted by X = ( X n ) n N . A local probability distribution ( p ( x n | x p a ( n ) ) ) is allocated to every node with parents p a ( n ) . A BN for random variables X is a combination of G and P ( G , P ), where P is the set of local distributions for all variables in the network [19].
The absence of arrows in G encodes conditional independence between the random variables X through the factorization of the joint probability distribution,
p ( x ) = n N p ( x v | x p a ( v ) ) .
In Equation (1), there are both discrete and continuous variables as shown in [20]. N consists of Ω and Ψ ( N = Ω Ψ ), where Ω and Ψ represent the sets of discrete and continuous variables, respectively. The random variables X can be redefined as X = ( X n ) n N = ( I , J ) = ( ( I ω ) ω Ω , ( J ψ ) ψ Ψ ) , where I and J are the sets of discrete and continuous variables, respectively. To ensure the availability of exact local computation methods, discrete variables are not allowed to have continuous parents. The joint probability distribution then factorizes into a discrete part and a mixed part [21],
p ( x ) = p ( i , j ) = ω Ω p ( i ω | i p a ( ω ) ) ψ Ψ ( j ψ | i p a ( ψ ) , j p a ( ψ ) ) .

3.1. Structural Learning

There are two kinds of structure learning algorithms. The first is the search-and-score algorithm, which allocates a score to every Bayesian network structure and chooses the structure model with the highest score. The latter is called constraint-based structural learning, and it creates a set of conditional independence analysis on the data and uses this analysis to produce an undirected graph. With an additional independence test, the network is converted into a Bayesian network [22]. In this study, the tabu search algorithm [23] is preferred.

3.2. Parameter Learning

After learning the structure of the BN from the data, the parameters can be estimated and updated. There are two approaches widely-used in the literature: (i) maximum likelihood estimation and (ii) Bayesian estimation [24]. In this paper, Bayesian estimation based on Dirichlet priors is used.

4. Data Discretization

The data discretization is a technique that converts continuous data into discrete data with a finite number of intervals, and it has become remarkably popular in many research areas including data mining, machine learning, artificial intelligence and Bayesian networks. There are many reasons to discretize data, among which the most important ones are (i) reducing and simplifying the dataset, (ii) making modeling fast and easy, (iii) obtaining easily interpretable outputs [25] and (iv) the statistical method to be used may operate in discrete data only as in this study.
As Figure 1 shows, the psychiatric variables are not normally distributed, which violates the Gaussian Bayesian networks (GBNs) assumption. In this case, we may specify a suitable conditional distribution for every variable and build a hybrid Bayesian network [24]. Nevertheless, this approach requires prior knowledge, which is not available in our case. Therefore, we transform the continuous variables into discrete ones, i.e., discretization, and build discrete Bayesian networks. In this paper, we discretize the continuous variables in two ways, which are given in the next two sections.

4.1. Data Discretization by Expert Knowledge

One way to discretize continuous variables is to categorize them by following the studies in the literature. The intervals for the continuous variables in this study were categorized by the domain experts in the previous studies. Therefore, we use this knowledge to discretize the psychiatric variables. Hence, the discretization of the psychiatric variables is as follows:
  • Beck Depression Inventory Score: We discretize this variable following [26] into four levels: BDI scores 0–10 show none or minimal depression, 10–18 mild depression, 19–29 moderate depression and 30–63 severe depression.
  • Beck Hopelessness Scale Score: We discretize this variable following [27] into four levels: The BHS scores 0–3 correspond to none or minimal hopelessness, 4–8 to mild hopelessness, 9–14 to moderate hopelessness and 15–20 to severe hopelessness.
  • Rosenberg Self-Esteem Scale Score: We discretize this variable following [28] into three levels: The RSES scores below 15 indicate low self-esteem, 15–25 normal self-esteem and 26–30 high self-esteem.

4.2. Data Discretization by Statistical Methods

When no prior knowledge from domain experts is available, discretization methods are used to discretize the continuous variables. In the literature, there exist many discretization methods. Some of them are entropy-based discretization [29], error-based discretization [30], the one-rule discretizer (1RD) [31], equal frequency discretization [32] and information-preserving discretisation [33].
In this paper, the information-preserving discretization method is used to discretize the continuous variables. In this method, the basic notion is to initially discretize each variable into large numbers of intervals, say k 1 . Hence, the amount of information lost is kept minimum. Then, the algorithm repeats over the variables and retains, for each of them, the pair of attached intervals minimizing the loss of pairwise mutual information. Once all variables have k 2 , which is the number of intervals the user specifies, ≤ k 1 intervals left, the algorithm stops [24]. One of the reasons to use this method in this paper is that it enables us to specify the number of intervals. As the continuous variables in our study measure the severity of depression, hopelessness and self-esteem, it is reasonable to discretize them into three categories as low, normal and high. Another reason is that this method can be easily applied in R .
When we discretize the psychiatric variables using the discretize function in R (see Appendix A), the intervals are obtained as follows:
  • Beck Depression Inventory Score:
    • (0, 11.5]      : low
    • (11.5, 20.8] : normal
    • (20.8, 63]    : high
  • Beck Hopelessness Scale Score:
    • (0, 6.89]     : low
    • (6.89, 13]   : normal
    • (13, 20]      : high
  • Rosenberg Self-Esteem Score:
    • (0, 15.8]       : low
    • (15.8, 19.8] : normal
    • (19.8, 30]    : high

5. Analysis

In this section, we analyze the datasets by constructing suitable BNs. All of the analysis is conducted in statistical software R using b n l e a r n [34], l a t t i c e [35], g R a i n [36] and R g r a p h v i z [37] packages.
First of all, to test the reliability of the psychiatric scales, Cronbach’s alpha internal consistency coefficients [38] need checking. Cronbach’s alpha internal consistency coefficients for BDI, BHS and RSES are 0.89, 0.87 and 0.83, respectively, meaning these scales are safe (greater than 0.70) for use in the analysis.
We build two different discrete BNs using the same method as two different approaches of discretization were used. We call the BN whose data were discretized by expert knowledge Bayesian Network 1 (BN1) and call the BN whose data were discretized by Information-Preserving Discretization Bayesian Network 2 (BN2).

5.1. Bayesian Network 1

We build the BN using R , and all the code used is given in Appendix B. Before learning the BN, we must specify a prior BN. Since we have no prior knowledge available, we specify an empty DAG. In addition, we must block all the arrows towards G e n d e r and A g e since no other variables can affect these two variables.
Figure 2 shows the DAG of the BN1 and the network score of BN1 is calculated as −15,214.8.
To summarize the significant results, M o t h e r s _ o c c u p a t i o n is independent of all variables. This is because 93% of the mothers are unemployed.
A g e and G e n d e r are independent of all variables as we specified in the model. However, A g e directly affects S t u d e n t s _ i n c o m e , and G e n d e r directly affects D e p r e s s i o n , S c h o o l _ t y p e , T y p e _ o f _ a c c o m m o d a t i o n , S o c i a l _ a c t i v i t y and S m o k i n g _ s t a t u s .
While D e p r e s s i o n is dependent on G e n d e r , the other two psychiatric variables H o p e l e s s n e s s and S e l f _ e s t e e m are conditionally independent of all demographic and socio-economic variables (excluding M o t h e r s _ o c c u p a t i o n ) given D e p r e s s i o n . However, H o p e l e s s n e s s and S e l f _ e s t e e m are directly dependent on D e p r e s s i o n .
Figure 3 represents the conditional probabilities of D e p r e s s i o n given G e n d e r , i.e., P ( D e p r e s s i o n | G e n d e r ) . It is clear that female and male students have approximately equal probabilities of having severe depression ( 10 % ) and moderate depression ( 19 % ). While the probability of a male student having none or minimal depression is 43 % , it is 27 % for his female counterpart. However, female students are more likely to have mild depression ( 44 % ) than male students ( 29 % ). From Figure 3, it can be said that female students are more likely to have a higher level of depression than male students.
Figure 4 shows the bar charts of the conditional probabilities of S e l f _ e s t e e m given D e p r e s s i o n . As the level of depression increases from none or minimal to severe depression, the probability of having a low level of self-esteem increases. For the depression levels none or minimal, mild, moderate and severe depression, the probabilities of students having low self-esteem are 0.01, 0.04, 0.15 and 0.33, respectively, while the probabilities of having high self-esteem are 0.35, 0.20, 0.08 and 0.11, respectively. Therefore, it can be concluded that the level of depression has a negative influence on self-esteem.
Figure 5 presents the bar chart of the conditional probabilities of H o p e l e s s n e s s given D e p r e s s i o n . Unlike with S e l f _ e s t e e m , generally speaking, there is a positive relationship between H o p e l e s s n e s s and D e p r e s s i o n . As the level of depression in students increases, so does the probability of having a higher level of hopelessness. The probabilities of having none or minimal hopelessness, mild hopelessness, moderate hopelessness and severe hopeless are respectively 0.61, 0.29, 0.09 and 0.01 in students with none or minimal depression, 0.42, 0.37, 0.18 and 0.03 in students with mild depression, 0.18, 0.35, 0.37 and 0.10 in students with moderate depression and 0.07, 0.17, 0.38 and 0.38 in students with severe depression. These figures show that a higher level of depression in students results in a higher level of hopelessness.

5.2. Bayesian Network 2

Now, we build a BN just like we did in the previous section. Again, we specify an empty DAG as the prior DAG and block all the arrows towards A g e and G e n d e r (see Appendix C).
Figure 6 represents the DAG of BN2, and the network score of BN2 is −14,748.93.
To abstract the notable relationships, A g e directly affects S t u d e n t s _ i n c o m e , while G e n d e r directly affects T y p e _ o f _ a c c o m m o d a t i o n , S c h o o l _ t y p e , S o c i a l _ a c t i v i t y and S m o k i n g _ s t a t u s .
Of all the demographic and socio-economic variables, the only variable that has a direct influence on one of the psychiatric variables, that is D e p r e s s i o n , is S o c i a l _ a c t i v i t y . In addition, S e l f _ e s t e e m and H o p e l e s s n e s s are directly dependent on D e p r e s s i o n , but both of them are conditionally independent of all the demographic and socioeconomic variables (excluding M o t h e r s _ o c c u p a t i o n ) when the status of D e p r e s s i o n is known.
Figure 7 is the bar chart of the conditional probabilities of D e p r e s s i o n given S o c i a l _ a c t i v i t y . It shows that the presence of S o c i a l _ a c t i v i t y has a negative impact on D e p r e s s i o n . While only approximately 17% of the students with social activity have a high level of depression, this proportion is about 28% in the students with no social activity. In addition, the proportion of students with a low level of depression is significantly lower in the students with social activity compared to those with no social activity.
Figure 8 is the bar chart of the conditional probabilities of H o p e l e s s n e s s given D e p r e s s i o n . When a student is known to have a low level of depression, the probability that he/she has a low level of hopelessness is around 0.83, while the probability of his/her having a high level of hopelessness is merely 0.02. In the students with high level of depression, the proportions of students with low and high levels of hopelessness are approximately 0.27 and 0.35, respectively. Therefore, it is concluded that there is a positive correlation between D e p r e s s i o n and H o p e l e s s n e s s .
Figure 9 illustrates the conditional probabilities of S e l f _ e s t e e m given D e p r e s s i o n . At first glance, it is noted that the students who have a high level of self-esteem dominate the others in each group. The probabilities that a student has high and low levels of self-esteem are about 85% and 2%, respectively, when he/she has a low level of depression, whereas these probabilities are 41% and 33%, respectively, when the student has a high level of depression. These numbers indicate that as the severity of D e p r e s s i o n increases, the severity of self-esteem decreases.

6. Discussion and Conclusions

In this paper, the relationship among the 3 psychiatric, 2 demographic and 16 socio-economic variables was investigated through BN modeling in R . The data were obtained from 823 university students via a survey.
In the analysis, firstly, the Beck Depression Inventory, the Beck Hopelessness Scale and the Rosenberg Self-Esteem Scale scores were calculated from the answers of students. Then, Cronbach’s alpha internal consistency coefficients for the these psychiatric scales were tested and found satisfying. Next, the continuous data were discretized by two approaches, which led to two BN models (BN1 and BN2). Finally, after building the BNs with the tabu search algorithm, inference and queries were made through graphs and conditional probabilities.
To summarize the results, only the occupational status of the mother has no relation with the other variables in both models.
According to BN1, the gender of the students directly influences the choices of students on school type, accommodation type, social activity and smoking status, while the age directly influences only the income of the student. The gender also directly affects the level of depression of students, and female students have a higher chance of being more depressive than male students. This finding is supported by similar studies [39,40,41,42].
According to BN2, similar to BN1, only the income of the students is directly dependent on the age. The gender has direct influence on accommodation type, school type, social activity and smoking status. In addition, depression is directly dependent on social activity. The presence of social activity has a negative effect on depression, which is also supported by [43,44]. Furthermore, self-esteem and hopelessness are directly dependent on depression.
To compare BN1 and BN2, firstly, BN2’s network score is greater than that of BN1, meaning BN2 is more likely to produce better predictions than BN1. While depression is directly dependent on the gender in BN1, it is conditionally independent of the gender given social activity in BN2. In each model, hopelessness and self-esteem are directly dependent on depression and conditionally independent of all the demographic and socio-economic variables excluding the occupational status of the mother, which is independent of all the variables.
In both models, one of the most important results is that as the level of depression rises, so does the level of hopelessness. This finding is consistent with other studies in the literature [45,46]. Another one is that, in contrast to hopelessness, the rise in the severity of depression results in lower self-esteem scores in students. This finding is in line with other studies on the relationship between depression and self-esteem [43,47,48].

Acknowledgments

The authors are grateful to Bitlis Eren University’s students for voluntarily participating in this study, to the academic and administrative staff for helping in the collection of the data and to the authors of the R packages used in this study.

Author Contributions

Both authors equally participated in the design, methodology, writing and interpretation of the results. Gunal Bilek collected the data and conducted the analysis in R as part of his Ph.D. thesis. Both authors have read and approved the paper before the submission.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. R Code for Section 4.2

#read and attach data
data <- read.table("data.txt",header=T)
attach(data)
#discretize continuous variables
d.frame <- data.frame(Self_esteem_score, BDI_score, BHS_score)
discrete.data <- discretize(d.frame, method = "hartemink",breaks = 3, ibreaks = 100,
       idisc = "interval")

Appendix B. R Code for Section 5.1

#load required packages
library(bnlearn)
library(Rgraphviz)
library(gRain)
library(lattice)
#prepare data
data1 <- data.frame(Depression, Hopelessness, Self_esteem, Age, Gender,
         School_type,Family_structure, Fathers_educational_status,
         Mothers_educational_status,Family_income,Students_income,
         Number_of_siblings, Type_of_settlement,Fathers_occupation,
         Mothers_occupation,Type_of_accommodation, Smoking_status
         Alcohol_use, Students_job,Relationship_status,Social_activity)
#create ban list
banlist = matrix(c("Self_esteem","Age","Depression","Age","Hopelessness","Age",
      "School_type","Age","Family_structure","Age","Number_of_siblings",
      "Age","Fathers_educational_status","Age",
      "Mothers_educational_status", "Age","Family_income","Age",
      "Students_income","Age","Smoking_status","Age",
      "Type_of_settlement","Age","Fathers_occupation","Age",
      "Mothers_occupation","Age","Type_of_accommodation","Age",
      "Alcohol_use", "Age", "Students_job", "Age", "Relationship_status",
      "Age","Social_activity","Age","Self_esteem", "Gender","Depression",
       "Gender","Hopelessness","Gender","School_type","Gender",
      "Family_structure","Gender","Fathers_educational_status","Gender",
      "Mothers_educational_status", "Gender", "Family_income", "Gender",
      "Students_income","Gender","Number_of_siblings","Gender",
      "Type_of_settlement","Gender","Fathers_occupation","Gender",
      "Mothers_occupation","Gender","Type_of_accommodation","Gender",
      "Smoking_status","Gender","Alcohol_use","Gender","Students_job",
      "Gender","Relationship_status","Gender","Social_activity",
      "Gender","Age","Gender", "Gender","Age"), ncol = 2, byrow = TRUE)
#Build bayesian network and calculate network score
bn1 <- tabu(data1, blacklist = banlist)
fit1 <- bn.fit(bn1, data1, method="bayes")
score(bn1,data1)
#Plot Figure 2
highlight1 <- list(nodes = nodes(bn1),arcs = arcs(bn1),col="black")
g1 <- graphviz.plot(bn1, highlight = highlight1, shape="rectangle")
#Plot Figures 3, 4 and 5 respectively
bn.fit.barchart(fit1$Depression, ylab = "Levels of Depression")
bn.fit.barchart(fit1$Self_esteem, ylab = "Levels of Self_esteem")
bn.fit.barchart(fit1$Hopelessness, ylab = "Levels of Hopelessness")

Appendix C. R Code for Section 5.2

#Create data frame by discretized intervals
Depression <- ifelse(discrete.data$BDI_score=="(0.938,11.5]","Low",
      ifelse(discrete.data$BDI_score=="(11.5,20.8]","Normal","High"))
Hopelessness <- ifelse(discrete.data$BHS_score=="(0.981,6.89]","Low",
      ifelse(discrete.data$BHS_score=="(6.89,13]" ,"Normal","High"))
Self_esteem <- ifelse(discrete.data$Self_esteem_score=="(0.971,15.8]","Low",
     ifelse(discrete.data$Self_esteem_score=="(15.8,19.8]","Normal","High"))
data2 <- data.frame(Depression, Hopelessness, Self_esteem, Age, Gender,
         School_type,Family_structure,Fathers_educational_status,
         Mothers_educational_status,Family_income,Students_income,
         Number_of_siblings, Type_of_settlement,Fathers_occupation,
         Mothers_occupation, Type_of_accommodation,Smoking_status,
         Alcohol_use, Students_job,Relationship_status,Social_activity)
#Build Bayesian network and calculate network score
bn2 <- tabu(data2, blacklist = banlist)
fit2 <- bn.fit(bn2, data2, method="bayes")
score(bn2, data2)
#Plot Figure 6
highlight2 <- list(nodes = nodes(bn2),arcs = arcs(bn2),col="black")
g2 <- graphviz.plot(bn2, highlight = highlight2, shape="rectangle")
#Plot Figures 7, 8 and 9 respectively
bn.fit.barchart(fit2$Depression, ylab = "Levels of Depression")
bn.fit.barchart(fit2$Self_esteem, ylab = "Levels of Self_esteem")
bn.fit.barchart(fit2$Hopelessness, ylab = "Levels of Hopelessness")

References

  1. Larranaga, P.; Moral, S. Probabilistic graphical models in artificial intelligence. Appl. Soft Comput. 2011, 11, 1511–1528. [Google Scholar] [CrossRef]
  2. Butz, C.; Hua, S.; Chen, J.; Yao, H. A simple graphical approach for understanding probabilistic inference in Bayesian networks. Inf. Sci. 2009, 179, 699–716. [Google Scholar] [CrossRef]
  3. Glymour, C.N. Discovering Causal Structure: Artificial Intelligence, Philosophy of Science, and Statistical Modeling; Academic Press: Cambridge, MA, USA, 1987; p. 394. [Google Scholar]
  4. Spirtes, P.; Glymour, C.; Scheines, R. Causation, Predictionand Search, Adaptive Computation and Machine Learning, 2nd ed.; The MIT Press: Cambridge, MA, USA, 2001. [Google Scholar]
  5. Korb, K.; Nicholson, A. Bayesian Artificial Intelligence; Chapman and Hall: London, UK, 2004; pp. 29–32. [Google Scholar]
  6. Zweig, G. Bayesian network structures and inference techniques for automatic speech recognition. Comput. Speech Lang. 2003, 17, 173–193. [Google Scholar] [CrossRef]
  7. Cene, E.; Karaman, F. Analysing organic food buyers’ perceptions with Bayesian networks: A case study in Turkey. J. Appl. Stat. 2015, 42, 1572–1590. [Google Scholar] [CrossRef]
  8. Renzi, M.; Vicard, P.; Guglielmetti, R.; Musella, F. Probabilistic expert systems for managing information to improve services. TQM J. 2009, 21, 429–442. [Google Scholar] [CrossRef]
  9. Salini, S.; Kenett, R.S. Bayesian networks of customer satisfaction survey data. J. Appl. Stat. 2009, 36, 1177–1189. [Google Scholar] [CrossRef]
  10. Koc, T.; Ulas, E.; Koc, H.; Keskin, B. Analysis of Social Media Usage Habits among the People of Samsun. J. Basic Appl. Res. Int. 2017, 22, 93–98. [Google Scholar]
  11. Koc, T.; Koc, H.; Ulas, E. Üniversite Öğrencilerinin Kötü Alışkanlıklarını Bayesci Ağ yöntemi İle Belirlenmesi. Ç.Ü. Sos. Bilim. Derg. 2017, 26, 230–240. [Google Scholar]
  12. Sesen, M.B.; Nicholson, A.E.; Banares-Alcantara, R.; Kadir, T.; Brady, M. Bayesian Networks for Clinical Decision Support in Lung Cancer Care. PLoS ONE 2013, 8, e82349. [Google Scholar] [CrossRef] [PubMed]
  13. Jansen, R.; Yu, H.; Greenbaum, D.; Kluger, Y.; Krogan, N.J.; Chung, S.; Emili, A.; Snyder, M.; Greenblatt, J.F.; Gerstein, M. A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data. Science 2003, 302, 449–453. [Google Scholar] [CrossRef] [PubMed]
  14. Yu, J.; Smith, V.A.; Wang, P.P.; Hartemink, A.J.; Jarvis, E.D. Advances to Bayesian network inference for generating causal networks from observational biological data. Bioinformatics 2004, 20, 3594–3603. [Google Scholar] [CrossRef] [PubMed]
  15. Edis, S.; Ulas, E. Using Bayesian Network to predict the watershed land use type of Çankırı Acıçay-Tatlıçay. Turkish J. For. 2017, 18, 212–218. [Google Scholar]
  16. Beck, A.T.; Ward, C.H.; Mendelson, M.; Mock, J.; Erbaugh, J. An inventory for measuring depression. Arch. Gen. Psychiatry 1961, 4, 561–571. [Google Scholar] [CrossRef] [PubMed]
  17. Beck, A.T.; Weissman, A.; Lester, D.; Trexler, L. The measurement of pessimism: The Hopelessness Scale. J. Consult. Clin. Psychol. 1974, 42, 861–865. [Google Scholar] [CrossRef] [PubMed]
  18. Rosenberg, M. Society and the Adolescent Self-Image; Princeton University Press: Princeton, NJ, USA, 1965. [Google Scholar]
  19. Fuster-Parra, P.; Tauler, P.; Bennasar-Veny, M.; Ligȩza, A.; López-González, A.; Aguiló, A. Bayesian network modeling: A case study of an epidemiologic system analysis of cardiovascular risk. Comput. Methods Programs Biomed. 2015, 6, 1–15. [Google Scholar] [CrossRef] [PubMed]
  20. Lauritzen, S.L. Propagation of Probabilities, Means, and Variances in Mixed Graphical Association Models. J. Am. Stat. Assoc. 1992, 87, 1098–1108. [Google Scholar] [CrossRef]
  21. Bøtcher, S.G.; Dethlefsen, C. deal: A Package for Learning Bayesian Networks. J. Stat. Softw. 2003, 8, 1–40. [Google Scholar] [CrossRef]
  22. Daly, R.; Shen, Q.; Aitken, S. Learning Bayesian networks: Approaches and issues. Knowl. Eng. Rev. 2011, 26, 99–157. [Google Scholar] [CrossRef]
  23. Bouckaert, R.R. Bayesian Belief Networks: From Construction to Inference. Ph.D. Thesis, Utrecht University, Utrecht, The Netherlands, 1995. [Google Scholar]
  24. Scutari, M.; Denis, J.B. Bayesian Networks: With Examples in R; Chapman and Hall: London, UK, 2014; p. 225. [Google Scholar]
  25. Ropero, R.; Renooij, S.; van der Gaag, L. Discretizing environmental data for learning Bayesian-network classifiers. Ecol. Model. 2018, 368, 391–403. [Google Scholar] [CrossRef]
  26. Beck, A.T.; Beamesderfer, A. Assessment of depression: The depression inventory. Mod. Probl. Pharmacopsychiatry 1974, 7, 151–169. [Google Scholar] [PubMed]
  27. Beck, A.T.; Steer, R.A. Manual for the Beck Hopelessness Scale; Psychological Corp.: San Antonio, TX, USA, 1988. [Google Scholar]
  28. Abraham, D. A Study on the Self-Esteem and Social Relations of Adolescents with Learning Disability. Master’s Thesis, Christ University, Utrecht, The Netherlands, 2010. [Google Scholar]
  29. Fayyad, U.; Irani, K. Multi-interval discretization of continuous-valued attributes for classification learning. In Proceedings of the 13th International Joint Conference on Artificial Intelligence (IJCAI), Chambery, France, 28 August–3 September 1993; pp. 1022–1027. [Google Scholar]
  30. Maass, W. Efficient agnostic PAC-learning with simple hypothesis. In Proceedings of the Seventh Annual Conference on Computational Learning Theory—COLT ’94, New Brunswick, NJ, USA, 12–15 July 1994; ACM Press: New York, NY, USA, 1994; pp. 67–75. [Google Scholar]
  31. Holte, R.C. Very Simple Classification Rules Perform Well on Most Commonly Used Datasets. Mach. Learn. 1993, 11, 63–90. [Google Scholar] [CrossRef]
  32. Dougherty, J.; Kohavi, R.; Sahami, M. Supervised and Unsupervised Discretization of Continuous Features. In Machine Learning Proceedings 1995; Elsevier: Amsterdam, The Netherlands, 1995; pp. 194–202. [Google Scholar]
  33. Hartemink, A.J. Principled Computational Methods for the Validation and Discovery of Genetic Regulatory Networks. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2001. [Google Scholar]
  34. Scutari, M. Learning Bayesian Networks with the bnlearn R Package. J. Stat. Softw. 2010, 35, 1–22. [Google Scholar] [CrossRef]
  35. Sarkar, D. Lattice: Multivariate Data Visualization with R; Springer: Berlin, Germany, 2008; p. 25. [Google Scholar]
  36. Højsgaard, S. Graphical Independence Networks with the gRain Package for R. J. Stat. Softw. 2012, 46, 1–26. [Google Scholar]
  37. Gentry, J.; Long, L.; Gentleman, R.; Falcon, S.; Hahne, F.; Sarkar, D.; Rgraphviz, K.D.H. Provides Plotting Capabilities for R Graph Objects, R package version 2.0; 2009. [Google Scholar]
  38. Cronbach, L.J. Coefficient alpha and the internal structure of tests. Psychometrika 1951, 16, 297–334. [Google Scholar] [CrossRef]
  39. Bebbington, P. The origins of sex differences in depressive disorder: Bridging the gap. Int. Rev. Psychiatry 1996, 8, 295–332. [Google Scholar] [CrossRef]
  40. Nolen-Hoeksema, S. Sex differences in unipolar depression: Evidence and theory. Psychol. Bull. 1987, 101, 259–282. [Google Scholar] [CrossRef] [PubMed]
  41. Nazroo, J.Y.; Edwards, A.C.; Brown, G.W. Gender differences in the prevalence of depression: Artefact, alternative disorders, biology or roles? Sociol. Health Illn. 1998, 20, 312–330. [Google Scholar] [CrossRef]
  42. Angeleri, F.; Angeleri, V.A.; Foschi, N.; Giaquinto, S.; Nolfe, G. The influence of depression, social activity, and family stress on functional outcome after stroke. Stroke 1993, 24, 1478–1483. [Google Scholar] [CrossRef] [PubMed]
  43. Dumont, M.; Provost, M.A. Resilience in Adolescents: Protective Role of Social Support, Coping Strategies, Self-Esteem, and Social Activities on Experience of Stress and Depression. J. Youth Adolesc. 1999, 28, 343–363. [Google Scholar] [CrossRef]
  44. Bozo, O.; Toksabay, N.E.; Kurum, O. Activities of Daily Living, Depression, and Social Support Among Elderly Turkish People. J. Psychol. 2009, 143, 193–206. [Google Scholar] [CrossRef] [PubMed]
  45. Fanaj, N.; Melonashi, E.; Drevinja, F.; Haxhibeqiri, S. Depression, Anxiety, Hopelessness and Self-esteem in a Clinical Sample of Adolescents in Kosovo. Eur. Psychiatry 2015, 30, 1189. [Google Scholar] [CrossRef]
  46. Dori, G.A.; Overholser, J.C. Depression, hopelessness, and self-esteem: Accounting for suicidality in adolescent psychiatric inpatients. Suicide Life-Threat. Behav. 1999, 29, 309–318. [Google Scholar] [PubMed]
  47. De Santis, J.; Gonzalez-Guardo, R.M.; Vasquez, E.P. Psychosocial and cultural correlates of depression among Hispanic men with HIV infection: A pilot study. J. Psychiatr. Ment. Health Nurs. 2012, 19, 860–869. [Google Scholar] [CrossRef] [PubMed]
  48. Provencio-Vasquez, E.; Mata, H.J.; Tomaka, J.; De Santis, J.P. Depression, Self-Esteem, and Childhood Abuse Among Hispanic Men Residing in the U.S.-Mexico Border Region. J. Assoc. Nurses AIDS Care 2017, 28, 795–806. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Histograms of psychiatric variables. BDI, Beck Depression Inventory; BHS, Beck Hopelessness Scale; RSES, Rosenberg Self-Esteem Scale.
Figure 1. Histograms of psychiatric variables. BDI, Beck Depression Inventory; BHS, Beck Hopelessness Scale; RSES, Rosenberg Self-Esteem Scale.
Entropy 20 00189 g001
Figure 2. DAG of BN1.
Figure 2. DAG of BN1.
Entropy 20 00189 g002
Figure 3. Bar chart of conditional probabilities of D e p r e s s i o n given G e n d e r .
Figure 3. Bar chart of conditional probabilities of D e p r e s s i o n given G e n d e r .
Entropy 20 00189 g003
Figure 4. Bar chart of the conditional probabilities of S e l f _ e s t e e m given D e p r e s s i o n .
Figure 4. Bar chart of the conditional probabilities of S e l f _ e s t e e m given D e p r e s s i o n .
Entropy 20 00189 g004
Figure 5. Bar chart of the conditional probabilities of H o p e l e s s n e s s given D e p r e s s i o n .
Figure 5. Bar chart of the conditional probabilities of H o p e l e s s n e s s given D e p r e s s i o n .
Entropy 20 00189 g005
Figure 6. DAG of BN2.
Figure 6. DAG of BN2.
Entropy 20 00189 g006
Figure 7. Bar chart of the conditional probabilities of D e p r e s s i o n given S o c i a l _ a c t i v i t y .
Figure 7. Bar chart of the conditional probabilities of D e p r e s s i o n given S o c i a l _ a c t i v i t y .
Entropy 20 00189 g007
Figure 8. Bar chart of the conditional probabilities of H o p e l e s s n e s s given D e p r e s s i o n .
Figure 8. Bar chart of the conditional probabilities of H o p e l e s s n e s s given D e p r e s s i o n .
Entropy 20 00189 g008
Figure 9. Bar chart of the conditional probabilities of S e l f _ e s t e e m given D e p r e s s i o n .
Figure 9. Bar chart of the conditional probabilities of S e l f _ e s t e e m given D e p r e s s i o n .
Entropy 20 00189 g009
Table 1. Demographic and socio-economic data description.
Table 1. Demographic and socio-economic data description.
VariableDescriptionState Description
A g e Age of student20−, 20–23, 23+
G e n d e r Gender of studentMale, female
F a m i l y _ s t r u c t u r e Family structure of the studentNuclear, extended, single parent, mother or father died
F a t h e r s _ e d u c a t i o n a l _ s t a t u s Educational level of student’s fatherIlliterate, primary school, secondary school, high school, university, Master’s or PhD
M o t h e r s _ e d u c a t i o n a l _ s t a t u s Educational level of student’s motherIlliterate, primary school, secondary school, high school, university, Master’s or PhD
F a m i l y _ i n c o m e Student’s family’s monthly income in TL*0–1400TL, 1401–3000TL, 3001–5000TL, 5000 + TL
S t u d e n t s _ i n c o m e Student’s monthly income in TL*0–500TL, 501–750TL, 751–1000TL, 1000 + TL
N u m b e r _ o f _ s i b l i n g s Number of siblings student has1, 2, 3, 4, 5+
T y p e _ o f _ s e t t l e m e n t Type of settlement students were born inVillage or town, district, city, metropolis
F a t h e r s _ o c c u p a t i o n Occupation of student’s fatherUnemployed, self-employed, public sector employee, private sector employee
M o t h e r s _ o c c u p a t i o n Occupation of student’s motherUnemployed, self-employed, public sector employee, private sector employee
S c h o o l _ t y p e Type of school student studying atFaculty of Arts and Science, Faculty of Engineering, Faculty of Economics and Administrative sciences, Health School, Vocational School
T y p e _ o f _ a c c o m m o d a t i o n Where student is livingGovernment dormitory, private dormitory, flat
S m o k i n g _ s t a t u s Whether or not student is a smokerSmoker, not smoker
A l c o h o l _ u s e Whether or not student consumes alcoholAlcohol user, not alcohol user
S o c i a l _ a c t i v i t y Whether or not student has any social activityHas social activity, no social activity
S t u d e n t s _ j o b Whether or not student has a part-time or full-time jobHas job, no job
R e l a t i o n s h i p _ s t a t u s Student’s relationship statusSingle, in a relationship
TL: Turkish Lira (currency)

Share and Cite

MDPI and ACS Style

Bilek, G.; Karaman, F. An Investigation into the Relationship among Psychiatric, Demographic and Socio-Economic Variables with Bayesian Network Modeling. Entropy 2018, 20, 189. https://doi.org/10.3390/e20030189

AMA Style

Bilek G, Karaman F. An Investigation into the Relationship among Psychiatric, Demographic and Socio-Economic Variables with Bayesian Network Modeling. Entropy. 2018; 20(3):189. https://doi.org/10.3390/e20030189

Chicago/Turabian Style

Bilek, Gunal, and Filiz Karaman. 2018. "An Investigation into the Relationship among Psychiatric, Demographic and Socio-Economic Variables with Bayesian Network Modeling" Entropy 20, no. 3: 189. https://doi.org/10.3390/e20030189

APA Style

Bilek, G., & Karaman, F. (2018). An Investigation into the Relationship among Psychiatric, Demographic and Socio-Economic Variables with Bayesian Network Modeling. Entropy, 20(3), 189. https://doi.org/10.3390/e20030189

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop