Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures

Rupšys, Petras

doi:10.3390/math7080761

Open AccessFeature PaperArticle

Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures

by

Petras Rupšys

Agriculture Academy, Vytautas Magnus University, Universiteto g. 20-214, 53361 LT Akademija, Kaunas District, Lithuania

Mathematics 2019, 7(8), 761; https://doi.org/10.3390/math7080761

Submission received: 22 July 2019 / Revised: 15 August 2019 / Accepted: 16 August 2019 / Published: 19 August 2019

(This article belongs to the Special Issue Stochastic Differential Equations and Their Applications)

Download

Browse Figures

Versions Notes

Abstract

:

This study focuses on the stochastic differential calculus of Itô, as an effective tool for the analysis of noise in forest growth and yield modeling. Idea of modeling state (tree size) variable in terms of univariate stochastic differential equation is exposed to a multivariate stochastic differential equation. The new developed multivariate probability density function and its marginal univariate, bivariate and trivariate distributions, and conditional univariate, bivariate and trivariate probability density functions can be applied for the modeling of tree size variables and various stand attributes such as the mean diameter, height, crown base height, crown width, volume, basal area, slenderness ratio, increments, and much more. This study introduces generalized multivariate interaction information measures based on the differential entropy to capture multivariate dependencies between state variables. The present study experimentally confirms the effectiveness of using multivariate interaction information measures to reconstruct multivariate relationships of state variables using measurements obtained from a real-world data set.

Keywords:

multivariate bertalanffy-type stochastic differential equation; marginal distributions; conditional distributions; entropy; normalized interaction information

1. Introduction

Stand attributes prediction has been a popular and challenging research topic in both forestry science and economics due to its importance to forest managers, governments, as well as economic stakeholders in recent years. Sustainable forest management process requires growth and yield models that enable prediction of the development of forest stands under different natural environment, economic and sociocultural pillars. Diameter at the breast height, total tree height, crown base height and crown width size dimensions (in the sequel—tree size variables), and the number of trees per hectare are substantial components of stand growth and yield models whose evolution provide details on stand development [1]. These tree size variables are the most important predictor variables for the estimation of stem volume, biomass and carbon storage in natural forests. Rational management needs the dynamical individual tree growth and yield models because they provide the evolution for forward and backward directions, and produce detailed information about changes of stand structure [2]. Tree size variables can be modeled as a complex system, each with its own regulatory mechanism and all continuously interacting between them. The mathematical and numerical methods used to describe the dynamic of biological system are largely concerned with the derivation, and use of ordinary stochastic and partial differential equations [3,4]. Individual-tree and stand-level growth models traditionally are represented by a system of ordinary differential equations [5]. The basic idea is to describe a system of ordinary differential equations, which specifies changes of a suitable number of tree or stand size variables via age (time) and to summarize the relevant information about the size variables’ dependence. One of the advantages of ordinary differential equation approach lies in parameter interpretation, which simplifies the results’ interpretation by using asymptotic and inflection points. Unfortunately, an ordinary differential equations approach has some limitations, including the absence of the factor of tree size variables dependence and the variance-covariance matrix of tree size variables [6,7,8]. In addition, eco-regional growth and yield models must be updated by including the random factor of a stand quality [9]. Statistical analysis of developed relationships between tree size components at a given stand or location is usually performed based on statistical indexes and tests for an observed dataset. However, no single equation for tree size variables has gained global acceptance. The traditional method is to try a variety of models and choose the best fitted equation based on a particular mathematical norm, such as the least square error or a likelihood norm. The disadvantages of this method of choosing are that it is laborious because too many equations need to be tried and empirical choices of candidate equations make the results subjective. In order to overcome these disadvantages, the multivariate stochastic differential equations have recently gained a lot of attention. Stochastic differential equations are often used in the modeling of population dynamic [10,11,12], tumor growth [13], chemical reaction networks [14], environmental pollution [15,16], forest growth and yield [17]. The deterministic differential equation carries its solution, which is completely determined in the value sense by knowledge of boundary and initial conditions. It means that the identical initial and boundary conditions generate identical solutions. Conversely, a stochastic differential equation (SDE) is a differential equation with a solution which is a stochastic process. Because tree diameter at breast height, total tree height, crown base height and crown width are empirically correlated, the multivariate SDE models should be considered [6,8,18].

The greatest advantage of multivariate SDE approach is that it provides sufficient flexibility to fit a large variety of nested models for a separate tree size and stand size variable, which facilitates the selection and comparison of newly developed models by using information measures technique [19,20,21]. In order to construct the multi-information measures, it is necessary to obtain the probability density function of tree size variable, which in this study are obtained from a 4-variate Bertalanffy SDE describing the development of the tree size variables against the age. SDEs models are much more flexible than deterministic models, but come at a computational cost. The problem of representing the mechanisms governing the evolution of univariate tree size distribution have been directed using univariate SDEs in fluid mechanics [22,23]. Central research finding in tree size distributions by Kohyama et al. [24] is the fact that they are positively skewed. Theoretical studies of tree size growth confirmed that the size frequency distribution of trees is inverse J-shaped, with many small trees and few larger trees due asymmetric competition [25]. The Vasicek type 4-variate fixed effects SDE presented by Rupšys and Petrauskas [6] defines changes in stem diameter and height distribution with age of a stand, which takes into account the 4-variate normal distribution at a given stand age, t. This study focuses on the alternative nonsymetric Bertalanfy type 4-variate diffusion process which links between tree diameter, height, crown base height and crown width dynamics, and their 4-variate lognormal probability density function development. Traditionally stochastic tree growth processes are observed in multiple populations (stands), so to quantify of both between and within stand variation the framework of the random effect parameters have been studied [6,11,26,27,28]. In this basis, the introduction of only one additional random effect parameter allows capturing arbitrary wide stand dependencies without increasing model order, hence retaining model simplicity and ease of parameters estimation. The fixed and mixed parameters estimation for discretely observed SDE is a complex problem and during the past decades it has attracted the attention a lot of researchers. Taking the applicability and generality into account, maximum-likelihood estimation is in the lead among others [29,30]. Generally, when both system noise and random effects are considered, the exact form of the maximum likelihood function is unavailable, and then an approximated maximum likelihood procedure is used [31].

This study focuses on a mixed-effects parameters 4-variate Bertalanffy type diffusion process satisfying an Itô [32] SDE conditional on an initial value taken at a fixed initial time (age) point. In an even-aged stand tree size components’ distribution shows some asymmetry. The goal of this paper is to present a unified perspective of the tree growth in a forest stand network as a nonsymetric Markov process in a multidimensional vector space. Another goal is to study in a general way the main methods of cross comparisons of all new developed growth models by using the Shannon type differential entropy. In the Results and Discussion, we consider possible application to the study of information sharing amongst tree size variables using a dataset of the diameter at breast height, tree height, crown base height and crown width measurements in Scots pine (Pinus Sylvestris L.) stands in Lithuania. All results are implemented in symbolic algebra system MAPLE.

2. Materials and Methods

This paper focuses on a 4-variate Bertalanffy type SDE to study the tree size variables (diameter at breast height, D(t), tree height, H(t), crown base height, CH, and crown width, CW) distribution problem in forest stands. This results in an exact 4-variate asymmetrical conditional (transition) probability density function, whose parameters can be estimated by maximum likelihood procedure based on discrete time observations. The random effects are included to describe between-stand variability. Proceeding as we have in the bivariate Bertalanffy type SDE model [8] that describes the development of the tree size variables evolving in M different stands, the mixed effect parameters 4-variate Bertalanffy type SDE model in a general manner are defined by:

d X^{i} (t) = A (X^{i} (t)) d t + Q {(X^{i} (t))}^{\frac{1}{2}} \cdot d W (t), i = 1, 2, \dots, M

(1)

here: M is the total number of stands used for model fitting, t is the time (stand age),

X (t) = {(X_{1} (t), X_{2} (t), X_{3} (t), X_{4} (t))}^{T} = {(D (t), H (t), C H (t), C W (t))}^{T}

,

t \in [t_{0}; T]

,

t_{0} \geq 0

,

X (t_{0}) = x_{0} = {(x_{10}, x_{20}, x_{30}, x_{40}))}^{T}

,

x_{s 0} \geq 0

,

1 \leq s \leq 4

, the drift vector Aⁱ (x) is defined as:

A^{i} (x) = {(\frac{(α_{1} + φ_{1}^{i}) β_{1} γ_{1}}{e^{β_{1} (t - t_{0})} - γ_{1}} x_{1}, \frac{(α_{2} + φ_{2}^{i}) β_{2} γ_{2}}{e^{β_{2} (t - t_{0})} - γ_{2}} x_{2}, \frac{(α_{3} + φ_{3}^{i}) β_{3} γ_{3}}{e^{β_{3} (t - t_{0})} - γ_{3}} x_{3}, \frac{(α_{4} + φ_{4}^{i}) β_{4} γ_{4}}{e^{β_{4} (t - t_{0})} - γ_{4}} x_{4})}^{T}

(2)

the diffusion matrix

Q (x)

is defined as:

Q (x) = (C (x) B^{\frac{1}{2}}) {(C (x) B^{\frac{1}{2}})}^{T} = C (x) B C (x) = (\begin{matrix} σ_{11} x_{1}^{2} & σ_{12} x_{1} x_{2} & σ_{13} x_{1} x_{3} & σ_{14} x_{1} x_{4} \\ σ_{21} x_{1} x_{2} & σ_{22} x_{2}^{2} & σ_{23} x_{2} x_{3} & σ_{24} x_{2} x_{4} \\ σ_{31} x_{1} x_{3} & σ_{32} x_{2} x_{3} & σ_{33} x_{3}^{2} & σ_{31} x_{3} x_{4} \\ σ_{41} x_{1} x_{4} & σ_{42} x_{2} x_{4} & σ_{43} x_{3} x_{4} & σ_{44} x_{4}^{2} \end{matrix})

(3)

B = (\begin{matrix} σ_{11} & σ_{12} & σ_{13} & σ_{14} \\ σ_{12} & σ_{22} & σ_{23} & σ_{24} \\ σ_{13} & σ_{23} & σ_{33} & σ_{34} \\ σ_{14} & σ_{24} & σ_{34} & σ_{44} \end{matrix}), C (x) = (\begin{matrix} x_{1} & 0 & 0 & 0 \\ 0 & x_{2} & 0 & 0 \\ 0 & 0 & x_{3} & 0 \\ 0 & 0 & 0 & x_{4} \end{matrix})

W^{i} (t) = {(W_{1}^{i} (t), W_{2}^{i} (t), W_{3}^{i} (t), W_{4}^{i} (t))}^{T}

,

t \in [t_{0}; T]

, i = 1, 2, …, M, are independent 4-variate Brownian motions,

φ_{s}^{i}

,

1 \leq s \leq 4

, i = 1, 2, …, M, are independent and normally distributed random variables with zero mean and constant variances (

φ_{s}^{i} ~ N (0; σ_{s}^{2})

),

{α_{1}, α_{2}, α_{3}, α_{4}, β_{1}, β_{2}, β_{3}, β_{4}, γ_{1}, γ_{2}, γ_{3}, γ_{4}, σ_{11}, σ_{12}, σ_{13}, σ_{14}, σ_{22}, σ_{23}, σ_{24}, σ_{33}, σ_{34}, σ_{44}, σ_{1}, σ_{2}, σ_{3}, σ_{4}}

are fixed effect parameters to be estimated which fulfill conditions:

t \geq t_{0} > \min {\frac{\ln (γ_{1})}{β_{1}}, \frac{\ln (γ_{2})}{β_{2}}, \frac{\ln (γ_{3})}{β_{3}}, \frac{\ln (γ_{4})}{β_{4}}}

,

β_{1}, β_{2}, β_{3}, β_{4} > 0

,

α_{1} + φ_{1}^{i}, α_{2} + φ_{2}^{i}, α_{3} + φ_{3}^{i}, α_{4} + φ_{4}^{i} \geq 1

, and

W^{i} (t)

,

φ_{s}^{i}

, are mutually independent for all 1 ≤ i ≤ M,

1 \leq s \leq 4

. The Bertalanffy type 4-variate SDE can be converted into a well-studied 4-variate Ornstein-Uhlenbeck (1930) [33] process by the transformation

Y (t) = {(e^{β_{i} t} \ln X_{i} (t), i = 1, \dots, 4)}^{T}

and solved explicitly. The solution is a conditional random vector

(X^{i} (t) | X^{i} (t_{0}) = x_{0}) = {(X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, s = 1, \dots, 4)}^{T}

that has a 4-variate lognormal distribution

L N_{4} (μ^{i} (t); Σ (t))

, i = 1, 2, … M, with the mean vector

μ^{i} (t)

:

μ^{i} (t) = {(μ_{s}^{i} (t), 1 \leq s \leq 4)}^{T} = {(\ln (x_{s 0}) + (α_{s} + φ_{s}^{i}) \ln (\frac{1 - γ_{s} \exp (- β_{s} t)}{1 - γ_{s} \exp (- β_{s} t_{0})}) - \frac{σ_{s s}}{2} (t - t_{0}), 1 \leq s \leq 4)}^{T}

(4)

the variance-covariance matrix

Σ (t)

:

Σ (t) = {(v_{s u} (t))}_{s, u = 1, \dots, 4} = {(σ_{s u} (t - t_{0}))}_{s, u = 1, \dots, 4}, σ_{s u} = σ_{u s}, v_{s u} (t) = v_{u s} (t)

(5)

and the probability density function:

f (x_{1}, x_{2}, x_{3}, x_{4}, t | θ^{f}, φ^{i}) = \frac{1}{{(2 π)}^{2} {| Σ (t) |}^{\frac{1}{2}} (x_{1} \cdot x_{2} \cdot x_{3} \cdot x_{4})} \exp (- \frac{1}{2} Ω (x_{1}, x_{2}, x_{3}, x_{4}, t))

(6)

Here

Ω (x_{1}, x_{2}, x_{3}, x_{4}, t) = {(\ln (x) - μ^{i} (t))}^{T} {(Σ (t))}^{- 1} (\ln (x) - μ^{i} (t)),

θ^{f} = {α_{1}, α_{2}, α_{3}, α_{4}, β_{1}, β_{2}, β_{3}, β_{4}, γ_{2}, γ_{3}, γ_{4}, σ_{11}, σ_{12}, σ_{13}, σ_{14}, σ_{22}, σ_{23}, σ_{24}, σ_{33}, σ_{34}, σ_{44}}

φ^{i} = {φ_{1}^{i}, φ_{2}^{i}, φ_{3}^{i}, φ_{4}^{i}}

3. Results

3.1. Marginal Distribution

Allowing that the random vector

(X^{i} (t) | X^{i} (t_{0}) = x_{0}) = {(X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, s = 1, \dots, 4)}^{T}

, i = 1, 2, …, M has a 4-variate lognormal distribution,

L N_{4} (μ^{i} (t); Σ (t))

, defined by Equations (4)–(6) and referred to properties of multivariate lognormal distribution [34,35], the marginal univariate distribution of

(X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0})

,

1 \leq s \leq 4

is also lognormal

L N_{1} (μ_{s}^{i} (t); v_{s s} (t))

with mean and variance given by the following forms:

μ_{s}^{i} (t) = \ln (x_{s 0}) + (α_{s} + φ_{s}^{i}) \ln (\frac{1 - γ_{s} \exp (- β_{s} t)}{1 - γ_{s} \exp (- β_{s} t_{0})}) - \frac{σ_{s s}}{2} (t - t_{0})

(7)

v_{s s} (t) = σ_{s s} (t - t_{0})

(8)

The marginal mean, median, mode, p-quantile (0 < p < 1) and variance trajectories

m_{s}^{i} (t)

,

m e_{s}^{i} (t)

,

m o_{s}^{i} (t)

,

m q_{s}^{i} (t, p)

and

w_{s} (t)

,

1 \leq s \leq 4

are defined by [34]:

m_{s}^{i} (t) \equiv Ε (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}) = \exp (μ_{s}^{i} (t) + \frac{1}{2} v_{s s} (t))

(9)

m e_{s}^{i} (t) \equiv M e d i a n (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}) = \exp (μ_{s}^{i} (t))

(10)

m o_{s}^{i} (t) \equiv M o d e (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}) = \exp (μ_{s}^{i} (t) - v_{s s} (t))

(11)

m q_{s}^{i} (t, p) \equiv Q u a n t i l e (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}) = \exp (μ_{s}^{i} (t) + \sqrt{v_{s s} (t)} Φ^{- 1} (p))

(12)

w_{s}^{i} (t) \equiv V a r (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}) = \exp (2 μ_{s}^{i} (t) + v_{s s} (t)) \cdot (\exp (v_{s s} (t)) - 1)

(13)

where:

Φ^{- 1} (\cdot)

is the inverse of standard normal distribution function.

The marginal bivariate distribution of

(X_{s}^{i} (t), X_{u}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0})

,

1 \leq s, u \leq 4

, i = 1, 2, …, M is also lognormal

N_{2} (μ_{s u}^{i} (t); Σ_{s u} (t))

with mean vector

μ_{s u}^{i} (t)

and covariance matrix

Σ_{s u} (t)

given by the following forms:

μ_{s u}^{i} (t) = (\begin{array}{l} \ln (x_{s 0}) + (α_{s} + φ_{s}^{i}) \ln (\frac{1 - γ_{s} \exp (- β_{s} t)}{1 - γ_{s} \exp (- β_{s} t_{0})}) - \frac{σ_{s s}}{2} (t - t_{0}) \\ \ln (x_{u 0}) + (α_{u} + φ_{u}^{i}) \ln (\frac{1 - γ_{u} \exp (- β_{u} t)}{1 - γ_{u} \exp (- β_{u} t_{0})}) - \frac{σ_{u u}}{2} (t - t_{0}) \end{array})

(14)

Σ_{s u} (t) = (\begin{matrix} v_{s s} (t) & v_{s u} (t) \\ v_{u s} (t) & v_{u u} (t) \end{matrix})

(15)

The covariance and correlation functions are given by:

\begin{array}{l} {cov}_{s u} (t) \equiv C o v (X_{s}^{i} (t), X_{u}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0}) \\ = \exp (μ_{s} (t) + μ_{u} (t) + \frac{1}{2} (v_{s s} (t) + v_{u u} (t)) + v_{s u} (t)) (\exp (v_{s u} (t)) - 1) \end{array}

(16)

\begin{array}{l} ρ_{i j} (t) \equiv C o r (X_{s}^{i} (t), X_{u}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0}) \\ = \frac{C o v (X_{s}^{i} (t), X_{u}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0})}{\sqrt{V a r (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}) \cdot V a r (X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0})}} = \frac{\exp (v_{s u} (t)) - 1}{\sqrt{(\exp (v_{s s} (t)) - 1)} \sqrt{(\exp (v_{u u} (t)) - 1)}} \end{array}

(17)

The marginal trivariate distribution of

(X_{s}^{i} (t), X_{u}^{i} (t), X_{z}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0}, X_{z}^{i} (t_{0}) = x_{z 0})

,

1 \leq s, u, z \leq 4

, i = 1, 2, …, M is also lognormal

L N_{3} (μ_{s u z}^{i} (t); Σ_{s u z} (t))

with mean vector,

μ_{s u z}^{i} (t)

, and covariance matrix,

Σ_{s u z} (t)

, given by the following forms:

μ_{s u z}^{i} (t) = (\begin{array}{l} \ln (x_{s 0}) + (α_{s} + φ_{s}^{i}) \ln (\frac{1 - γ_{s} \exp (- β_{s} t)}{1 - γ_{s} \exp (- β_{s} t_{0})}) - \frac{σ_{s s}}{2} (t - t_{0}) \\ \ln (x_{u 0}) + (α_{u} + φ_{u}^{i}) \ln (\frac{1 - γ_{u} \exp (- β_{u} t)}{1 - γ_{u} \exp (- β_{u} t_{0})}) - \frac{σ_{u u}}{2} (t - t_{0}) \\ \ln (x_{z 0}) + (α_{z} + φ_{z}^{i}) \ln (\frac{1 - γ_{z} \exp (- β_{z} t)}{1 - γ_{z} \exp (- β_{z} t_{0})}) - \frac{σ_{z z}}{2} (t - t_{0}) \end{array})

(18)

Σ_{s u z} (t) = (\begin{matrix} v_{s s} (t) & v_{s u} (t) & v_{s z} (t) \\ v_{u s} (t) & v_{u u} (t) & v_{u z} (t) \\ v_{z s} (t) & v_{z u} (t) & v_{z z} (t) \end{matrix})

(19)

3.2. Conditional Distributions

The conditional univariate distribution of

(X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0})

,

1 \leq s \leq 4

, i = 1, 2, …, M at a given

(X_{u}^{i} (t) = x_{u})

,

u \in {1, 2, 3, 4} \ {s}

is a univariate lognormal

L N_{1} (η^{i} (t, x_{u}); λ_{s u} (t))

, respectively, with mean and variance given by the following forms [34,35]:

η^{i} (t, x_{u}) = μ_{s}^{i} (t) + \frac{v_{s u} (t)}{v_{u u} (t)} (\ln (x_{u}) - μ_{u}^{i} (t))

(20)

λ_{s u} (t) = v_{s s} (t) - \frac{{(v_{s u} (t))}^{2}}{v_{u u} (t)}

(21)

The conditional mean, median, mode, p-quantile (0 < p < 1) and variance functions,

m_{s}^{i} (t, x_{u})

,

m e_{s}^{i} (t, x_{u})

,

m o_{s}^{i} (t, x_{u})

,

m q_{s}^{i} (t, p, x_{u})

and

w_{s}^{i} (t, x_{u})

,

1 \leq s \leq 4

, i = 1, 2, …, M, are defined by Equations (9)–(12) after plugging the mean and variance given by Equations (20) and (21).

The conditional univariate distribution of

(X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0})

,

1 \leq s \leq 4

, i = 1, 2, …, M at a given

(X_{u}^{i} (t) = x_{u}, X_{z}^{i} (t) = x_{z})

,

u, z \in {1, 2, 3, 4} \ {s}

is a univariate lognormal

L N_{1} (η^{i} (t, x_{u}, x_{z}); λ_{s, u z} (t))

, here:

\ln (x_{u z}) = {(\ln (x_{u}), \ln (x_{z}))}^{T}

(22)

η^{i} (t, x_{u}, x_{z}) = μ_{s}^{i} (t) + Σ_{s, u z} (t) {[Σ_{u z} (t)]}^{- 1} (\ln (x_{u z}) - μ_{u z}^{2} (t))

(23)

λ_{s, u z} (t) = v_{s s} (t) - Σ_{s, u z} (t) {[Σ_{u z} (t)]}^{- 1} {(Σ_{s, u z} (t))}^{T}

(24)

Σ_{s, u z} (t) = (\begin{matrix} v_{s u} (t) & v_{s z} (t) \end{matrix})

(25)

The conditional mean, median, mode, p-quantile (0 < p < 1) and variance functions,

m_{s}^{i} (t, x_{u}, x_{z})

,

m e_{s}^{i} (t, x_{u}, x_{z})

,

m o_{s}^{i} (t, x_{u}, x_{z})

,

m q_{s}^{i} (t, p, x_{u}, x_{z})

and

w_{s}^{i} (t, x_{u}, x_{z})

,

1 \leq s \leq 4

, i = 1, 2, …, M,

u, z \in {1, 2, 3, 4} \ {s}

, are defined by Equations (9)–(12) after plugging the mean and variance given by Equations (23) and (24).

The conditional univariate distribution of

(X_{s}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0})

,

1 \leq s \leq 4

, i = 1, 2, …, M at a given

(X_{u}^{i} (t) = x_{u}, X_{z}^{i} (t) = x_{z}, X_{y}^{i} (t) = x_{y})

,

u, z, y \in {1, 2, 3, 4} \ {s}

is a univariate lognormal

N_{1} (η^{i} (t, x_{u}, x_{z}, x_{y}); λ_{s, u z y} (t))

, here:

\ln (x_{u z y}) = {(\ln (x_{u}), \ln (x_{z}), \ln (x_{y}))}^{T}

(26)

η^{i} (t, x_{u}, x_{z}, x_{y}) = μ_{s}^{i} (t) + Σ_{s, u z y} (t) {[Σ_{u z y} (t)]}^{- 1} [\ln (x_{u z y}) - μ_{u z y}^{3, i} (t)]

(27)

λ_{s, u z y} (t) = v_{s s} (t) - Σ_{s, u z y} (t) {[Σ_{u z y} (t)]}^{- 1} {(Σ_{s, u z y} (t))}^{T}

(28)

Σ_{s, u z y} (t) = (\begin{matrix} v_{s u} (t) & v_{s z} (t) & v_{s y} (t) \end{matrix})

(29)

The conditional mean, median, mode, p-quantile (0 < p < 1) and variance functions,

m_{s}^{i} (t, x_{u}, x_{z}, x_{y})

,

m e_{s}^{i} (t, x_{u}, x_{z}, x_{y})

,

m o_{s}^{i} (t, x_{u}, x_{z}, x_{y})

,

m q_{s}^{i} (t, p, x_{u}, x_{z}, x_{y})

and

w_{s}^{i} (t, x_{u}, x_{z}, x_{y})

,

1 \leq s \leq 4

, i = 1, 2, …, M,

u, z, y \in {1, 2, 3, 4} \ {s}

, are defined by Equations (9)–(12) after plugging the mean and variance given by Equations (27) and (28).

The conditional bivariate distribution of

(X_{s}^{i} (t), X_{u}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0})

,

1 \leq s, u \leq 4

, i = 1, 2, …, M at a given

(X_{z}^{i} (t) = x_{z})

,

z \in {1, 2, 3, 4} \ {s, u}

is a bivariate lognormal

L N_{2} (Η_{s u z}^{i} (t, x_{z}); Λ_{s u z} (t))

, here:

Η_{s u z}^{i} (t, x_{z}) = μ_{s u}^{i} (t) + Σ_{s u z}^{2} (t) \frac{\ln (x_{z}) - μ_{z}^{i} (t)}{v_{z z} (t)}

(30)

Λ_{s u z} (t) = Σ_{s u z}^{2} (t) - Σ_{21} (t) {[v_{z z} (t)]}^{- 1} {(Σ_{s u z}^{2} (t))}^{T}

(31)

Σ_{s u}^{2} (t) = (\begin{matrix} v_{s z} (t) \\ v_{u z} (t) \end{matrix})

(32)

The conditional bivariate distribution of

(X_{s}^{i} (t), X_{u}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0})

,

1 \leq s, u \leq 4

, i = 1, 2, …, M at a given

(X_{z}^{i} (t) = x_{z}, X_{y}^{i} (t) = x_{y})

,

z, y \in {1, 2, 3, 4} \ {s, u}

is a bivariate lognormal

L N_{2} (Η_{s u, z y}^{i} (t, x_{z}, x_{y}); Λ_{s u, z y} (t))

, here:

Η_{s u, z y}^{i} (t, x_{z}, x_{y}) = μ_{s u}^{i} (t) + Σ_{s u, z y} (t) {[Σ_{z y} (t)]}^{- 1} (\begin{matrix} \ln (x_{z}) - μ_{z}^{i} (t) \\ \ln (x_{y}) - μ_{y}^{i} (t) \end{matrix})

(33)

Λ_{s u, z y} (t) = Σ_{s u} (t) - Σ_{s u, z y} {[Σ_{z y} (t)]}^{- 1} {(Σ_{s u, z y})}^{T}

(34)

Σ_{s u, z y} (t) = (\begin{matrix} v_{s z} (t) & v_{s y} (t) \\ v_{u z} (t) & v_{u y} (t) \end{matrix})

(35)

The conditional trivariate distribution of

(X_{s}^{i} (t), X_{u}^{i} (t), X_{z}^{i} (t) | X_{s}^{i} (t_{0}) = x_{s 0}, X_{u}^{i} (t_{0}) = x_{u 0}, X_{z}^{i} (t_{0}) = x_{z 0})

,

1 \leq s, u, z \leq 4

, i = 1, 2, …, M at a given

(X_{y}^{i} (t) = x_{y})

,

y \in {1, 2, 3, 4} \ {s, u, z}

is a trivariate lognormal

L N_{3} (Η_{s u z, y}^{i} (t, x_{z}); Λ_{s u z, y} (t))

, here:

Η_{s u z, y}^{i} (t, x_{y}) = μ_{s u z}^{3, i} (t) + Σ_{s u z, y} (t) \frac{\ln (x_{y}) - μ_{y}^{i} (t)}{v_{y y} (t)}

(36)

Λ_{s u z, y} (t) = Σ_{s u z} (t) - Σ_{s u z, y} (t) {[v_{y y} (t)]}^{- 1} {(Σ_{s u z, y} (t))}^{T}

(37)

Σ_{s u z, y} (t) = (\begin{matrix} v_{s y} (t) \\ v_{u y} (t) \\ w_{z y}^{2} (t) \end{matrix})

(38)

3.3. Maximum Likelihood Procedure

Most natural processes evolve in continuous time, but they are observed in discrete time. To examine practical applications of the Bertalanffy type 4-variate stochastic process defined by Equation (1) suppose that we observe the process at discrete time points

{t_{1}^{i}, t_{2}^{i}, \dots, t_{n_{i}}^{i}}

composing an estimation dataset

{(x_{11}^{i}, x_{21}^{i}, x_{31}^{i}, x_{41}^{i}), (x_{12}^{i}, x_{22}^{i}, x_{32}^{i}, x_{42}^{i}), \dots, (x_{1 n_{i}}^{i}, x_{2 n_{i}}^{i}, x_{3 n_{i}}^{i}, x_{4 n_{i}}^{i})}

(n_i is the number of observed trees of the ith stand, i = 1, 2, …, M). The associated maximum log-likelihood function for the fixed effect scenario model takes the following form:

L L_{f} (θ^{f}) = \sum_{i = 1}^{M} \sum_{j = 1}^{n_{i}} \ln (f (x_{1 j}^{i}, x_{2 j}^{i}, x_{3 j}^{i}, x_{4 j}^{i}, t_{j}^{i} | θ^{f}, 0, 0, 0, 0)) .

(39)

and for the mixed effect scenario model takes the following form:

L L_{m} (θ^{m}) = \sum_{i = 1}^{M} \int_{R^{4}} (\sum_{j = 1}^{n_{i}} \ln (f (x_{1 j}^{i}, x_{2 j}^{i}, x_{3 j}^{i}, x_{4 j}^{i}, t_{j}^{i} | θ, φ_{1}^{i}, φ_{2}^{i}, φ_{3}^{i}, φ_{4}^{i}) + \sum_{l = 1}^{4} \ln (p (φ_{l}^{i} | σ_{l}^{2})))) d φ_{1}^{i} d φ_{2}^{i} d φ_{3}^{i} d φ_{4}^{i}

(40)

here

θ^{m} = θ^{f} \cup {σ_{1}, σ_{2}, σ_{3}, σ_{4}}

. As the 4-variate integral in Equation (40) does not have a closed-form solution and the analytic expression is known, the maximum log-likelihood function for the 4-variate mixed effect scenario model by using the Laplace expansion is approximately given in the following form [36]:

L L_{m} (θ^{m}, \hat{ψ}) \approx \sum_{i = 1}^{M} (g (φ^{i} | θ^{m}) + 2 \ln (2 π) - \frac{1}{2} \ln (\det {([- \frac{\partial^{2} g (φ^{i} | θ^{m})}{\partial φ_{j}^{i} \partial φ_{k}^{i}}])}_{φ^{i} = \hat{φ^{i}}}))

(41)

here:

φ^{i} = (φ_{1}^{i}, φ_{2}^{i}, φ_{3}^{i}, φ_{4}^{i})

. The random effects

ψ = (φ^{1}, φ^{2}, \dots, φ^{M})

are estimated by maximization:

\hat{ψ} = \underset{φ^{i}}{\arg \max} g (φ^{i} | \hat{θ^{m}}), i = 1, 2, \dots, M,

(42)

g (φ^{i} | θ^{m}) = \sum_{j = 1}^{n_{l}} \ln (f (x_{1 j}^{i}, x_{2 j}^{i}, x_{3 j}^{i}, x_{4 j}^{i}, t_{j}^{i} | θ^{m}, φ^{i})) + \sum_{k = 1}^{4} \ln (p (φ_{k}^{i} | σ_{k}^{2})) .

(43)

The maximization of

L L_{m} (θ^{m}, ψ)

is a two-step optimization problem. The internal optimization step estimates the vector

φ^{i}

for every stand i = 1, 2, …, M with Equation (42). The external optimization step maximizes

L L_{m} (θ^{m}, \hat{ψ})

after plugging the

\hat{φ^{i}}

into Equation (41). These two steps are iterated until convergence.

3.4. Random Effects Calibration

A key feature of mixed effects models is that, by introducing random effects in addition to fixed effects, they allow us to correctly account both within- and between forest stand variations. In the forestry literature, calibration means that random effects are calibrated using a supplementary sample of observations taken from the previous observations

{(x_{11}, x_{21}, x_{31}, x_{41}), (x_{12}, x_{22}, x_{3 n}, x_{42}), \dots, (x_{1 m}, x_{2 m}, x_{3 m}, x_{4 m})}

at discrete previous times (ages)

{t_{1}^{}, t_{2}^{}, \dots, t_{m}^{}}

. The random effects can be calibrated by:

\hat{Ψ} = \underset{(φ_{1}, φ_{2}, φ_{3}, φ_{4})}{\arg \max} \sum_{j = 1}^{m} \ln (f (x_{1 j}, x_{2 j}, x_{3 j}, x_{4 j}, t_{j} | \hat{θ^{m}}, φ_{1}, φ_{2}, φ_{3}, φ_{4})) + \sum_{i = 1}^{4} \ln (p (φ_{i} | \hat{σ_{i}^{2}}))

(44)

In the previous study [17], calibration relies on the mean trend Equation (9) to predict the random effects in relation to fixed effects parameters

\hat{θ^{m}}

estimated by approximated maximum likelihood procedure (see Equations (41) and (42)). Both alternative techniques deal adequately with random effects calibration, whose are essential for analyzing large observed datasets.

3.5. Estimating Results

The data used were obtained from 17 permanent experimental Scots pine (Pinus sylvestris L.) stands [6]. The measurements of the diameter at breast height (D), total height (H), crown base height (CH), and crown width (CW) are presented in Figure 1.

The results of the parameter estimates using the NLPSolve procedure in the symbolic algebra system MAPLE [37] are summarized in Table 1.

3.6. Information Measures

Multivariate data analysis presents a wide range of mathematical and practical problems, particularly in forestry. Datasets sampled from forest stands measurements reflect complex biological systems confronted with diverse multiple interactions and dependencies between tree size variables. Therefore, full-scale analysis of dependencies between size components requires the use of the multivariable information measures. The inference of tree structure evolution, defined by Equation (1), is related to the estimation of the information flow between tree size variables. Entropy is a useful concept to measure the uncertainty in the multivariate stochastic systems and it can be applied to measure multivariate dependences between tree size variables. An information theoretic approach assumes that the development of tree size components will exhibit some dependencies among them, and therefore such statistical dependencies among tree size components can be used to construct the new theoretical growth and yield models. Central to this area is to determine the best relationship of target variable with one another of tree size variables by using an entropy-based technique or more generally to decide whether a subset of multiple tree size components is interdependent. This study will focus on the amount of information transmitted by a set of tree size variables. The simplest information theory measure between two variables is the interaction (mutual) information, which defines the information contained in one variable about another, defined by McGill (1954) [38], can also be interpreted in terms of the Shannon entropy:

I (X_{s}, X_{u}, t) = H (X_{s}, t) + H (X_{u}, t) - H (X_{s}, X_{u}, t), 1 \leq s, u \leq 4

(45)

The definition of the differential Shannon entropy of a stochastic process directly follows that of a continuous random variable. Differential entropy cannot represent the uncertainty of continuous random processes and does not have the point of information. However, mutual information retains its interpretability in the continuous case.

Since the random vector (X₁, X₂, X₃, X₄) is lognormally distributed and for fixed effect scenario the random effects,

φ_{i}

,

i = 1, \dots, M

, are assumed to be equal its mean value

Ε (φ_{i}) = 0

,

μ_{s}^{i} (t) \equiv μ_{s} (t)

,

1 \leq s \leq 4

, moreover, taking into account stochastic representations of log-skew elliptical random vectors [39], the expressions for univariate and multivariate Shannon entropies (measured in nats) take the following forms [40]:

H (X_{s}, t) \equiv - \int_{0}^{+ \infty} f (x_{s}, t | {\hat{θ}}_{s}, 0) \ln (f (x_{s}, t | {\hat{θ}}_{s}, 0)) d x_{s} = \frac{1}{2} \ln (2 π e σ_{s s} (t)) + μ_{s} (t)

(46)

1 \leq s \leq 4, {\hat{θ}}_{s} = {{\hat{α}}_{s}, {\hat{β}}_{s}, {\hat{γ}}_{s}, {\hat{σ}}_{s s}}

\begin{array}{l} H (X_{s}, X_{u}, t) \equiv - \int_{0}^{+ \infty} \int_{0}^{+ \infty} f (x_{s}, x_{u}, t | {\hat{θ}}_{s u}, 0) \ln (f (x_{s}, x_{u}, t | {\hat{θ}}_{s u}, 0)) d x_{s} d x_{u} \\ = \frac{1}{2} \ln (| Σ_{s u} (t) |) + \ln (2 π e) + μ_{s} (t) + μ_{u} (t) \end{array}

(47)

1 \leq s, u \leq 4, {\hat{θ}}_{s u} = {{\hat{α}}_{s}, {\hat{β}}_{s}, {\hat{γ}}_{s}, {\hat{σ}}_{s s}, {\hat{α}}_{u}, {\hat{β}}_{u}, {\hat{γ}}_{u}, {\hat{σ}}_{u u}, {\hat{σ}}_{s u}}

\begin{array}{l} H (X_{s}, X_{u}, X_{z}, t) \equiv - \int_{0}^{+ \infty} \int_{0}^{+ \infty} \int_{0}^{+ \infty} f (x_{s}, x_{u}, x_{z}, t | {\hat{θ}}_{s u z}, 0) \ln (f (x_{s}, x_{u}, x_{z}, t | {\hat{θ}}_{s u z}, 0)) d x_{s} d x_{u} d x_{z} \\ = \frac{1}{2} \ln (| Σ_{s u z} (t) |) + \frac{3}{2} \ln (2 π e) + μ_{s} (t) + μ_{u} (t) + μ_{z} (t) \end{array}

(48)

1 \leq s, u, z \leq 4, {\hat{θ}}_{s u} = {{\hat{α}}_{s}, {\hat{β}}_{s}, {\hat{γ}}_{s}, {\hat{σ}}_{s s}, {\hat{α}}_{u}, {\hat{β}}_{u}, {\hat{γ}}_{u}, {\hat{σ}}_{u u}, {\hat{α}}_{z}, {\hat{β}}_{z}, {\hat{γ}}_{z}, {\hat{σ}}_{z z}, {\hat{σ}}_{s u}, {\hat{σ}}_{s z}, {\hat{σ}}_{u z}}

\begin{array}{l} H (X_{1}, X_{2}, X_{3}, X_{4}, t) \equiv - \int_{0}^{+ \infty} \int_{0}^{+ \infty} \int_{0}^{+ \infty} \int_{0}^{+ \infty} f (x_{1}, x_{2}, x_{3}, x_{4}, t | \hat{θ}, 0) \ln (f (x_{1}, x_{2}, x_{3}, x_{4}, t | {\hat{θ}}_{s u z}, 0)) d x_{1} d x_{2} d x_{3} d x_{4} \\ = \frac{1}{2} \ln (| Σ (t) |) + 2 \ln (2 π e) + μ_{1} (t) + μ_{2} (t) + μ_{3} (t) + μ_{4} (t) \end{array}

(49)

\hat{θ} = {{\hat{α}}_{1}, {\hat{β}}_{1}, {\hat{γ}}_{1}, {\hat{σ}}_{11}, {\hat{α}}_{2}, {\hat{β}}_{2}, {\hat{γ}}_{2}, {\hat{σ}}_{22}, {\hat{α}}_{3}, {\hat{β}}_{3}, {\hat{γ}}_{3}, {\hat{σ}}_{33}, {\hat{α}}_{4}, {\hat{β}}_{4}, {\hat{γ}}_{4}, {\hat{σ}}_{44}, {\hat{σ}}_{12}, {\hat{σ}}_{13}, {\hat{σ}}_{14}, {\hat{σ}}_{23}, {\hat{σ}}_{24}, {\hat{σ}}_{34}}

As we can see from Equation (45), the mutual information, I, is calculated directly by summing the individual entropies and subtracting the joint entropy. Mutual information, I, between two random variables, X_s and X_u, compares the uncertainty of measuring variables jointly with the uncertainty of measuring the two variables independently, identifies nonlinear dependence between two variables [41,42,43], and is non-negative and symmetrical. A generalization of bivariate mutual information to more than two variables have been analyzed in few different scenarios [20,21,41,42,43]. A direct multivariate extension of bivariate mutual information expressed by Equation (45) to n variables X₁, X₂, and X_n is named as the multi-information [44,45], also known as total correlation, and is defined by:

Ω (X_{1}, X_{2}, \dots, X_{n}, t) = \sum_{i = 1}^{n} H (X_{i}, t) - H (X_{1}, X_{2}, \dots, X_{n}, t) .

(50)

The multi-information is always non-negative and a near-zero value indicates that the variables are essentially statistically independent. Two special cases of mutual information take the normalized forms, respectively [46,47,48]:

N I_{\min} (X_{s}, X_{u}, t) = \frac{I (X_{s}, X_{u}, t)}{\max (H (X_{s}, t), H (X_{u}, t))}, 1 \leq s, u \leq 4

(51)

N I_{D} (X_{s}, X_{u}, t) = 1 - \frac{I (X_{s}, X_{u}, t)}{H (X_{s}, X_{u}, t)}, 1 \leq s, u \leq 4

(52)

Next normalized variant of the mutual information is provided by the correlation coefficient in the following form [49]:

N I_{C} (X_{s}, X_{u}, t) = \sqrt{2 - \frac{2 \cdot I (X_{s}, X_{u}, t)}{H (X_{s}, t) + H (X_{u}, t)}}, 1 \leq s, u \leq 4

(53)

A simple generalization of the normalized mutual information, defined by Equations (51)–(53), for three variables with the target variable X_s (s = 1, …, 4) takes the following forms:

N I_{\min}^{2} (X_{s}, X_{u}, X_{z}, t) = \frac{H (X_{s}, t) + H (X_{u}, X_{z}, t) - H (X_{s}, X_{u}, X_{z}, t)}{\max (H (X_{s}, t), H (X_{u}, X_{z}, t))}, u, z \in {1, 2, 3, 4} / {s},

(54)

N I_{D}^{2} (X_{s}, X_{u}, X_{z}, t) = \frac{H (X_{s}, t) + H (X_{u}, X_{z}, t) - H (X_{s}, X_{u}, X_{z}, t)}{H (X_{s}, X_{u}, X_{z}, t)}, u, z \in {1, 2, 3, 4} / {s}

(55)

\begin{matrix} N I_{C}^{2} (X_{s}, X_{u}, X_{z}, t) = \sqrt{2 - \frac{2 (H (X_{s}, t) + H (X_{u}, X_{z}, t) - H (X_{s}, X_{u}, X_{z}, t))}{H (X_{s}, t) + H (X_{u}, X_{z}, t)}}, \\ u, z \in {1, 2, 3, 4} / {s} \end{matrix}

(56)

Generalized forms of mutual information to more than two variables are called interaction information [21,47]. The relationship between multi-information and interaction information for the trivariate and 4-variate cases are defined in the following forms [21]:

I (X_{1}, X_{2}, X_{3}, t) = \sum_{s > u} I (X_{s}, X_{u}, t) - Ω (X_{1}, X_{2}, X_{3}, t),

(57)

I (X_{1}, X_{2}, X_{3}, X_{4}, t) = \sum_{z > s > u} I (X_{s}, X_{u}, X_{z}, t) - \sum_{s > u} I (X_{s}, X_{u}) + Ω (X_{1}, X_{2}, X_{3}, X_{4}, t) .

(58)

Providing that the target variable X_s (

1 \leq s \leq 4

) is added to the set of

ν_{n - 1} = (X_{1}, ...., X_{s - 1}, X_{s + 1}, ...., X_{n})

variables, the differential interaction information [21,47] is defined as:

Δ_{n} (ν_{n - 1}; X_{s}, t) = I (ν_{n}, t) - I (ν_{n - 1}, t), ν_{n} = (X_{1}, \dots, X_{n})

(59)

In consequence of Equations (45), (46) and (59), the deltas for two variables

ν_{2} = (X_{s}, X_{u})

,

1 \leq s \leq 4

,

u \in {1, 2, 3, 4} / {s}

(the target variable X_s) are defined as:

Δ_{2} (ν_{1}; X_{s}, t) = I (ν_{2}, t) - H (X_{s}, t), ν_{2} = (X_{s}, X_{u}), ν_{1} = (X_{u})

(60)

for three variables

ν_{3} = (X_{s}, X_{u}, X_{z})

,

1 \leq s \leq 4

,

u, z \in {1, 2, 3, 4} / {s}

(the target variable X_s) are defined as:

Δ_{3} (ν_{2}; X_{s}, t) = I (ν_{3}, t) - I (ν_{2}, t), ν_{2} = (X_{u}, X_{z})

(61)

for four variables

ν_{4} = (X_{s}, X_{u}, X_{z}, X_{y})

1 \leq s \leq 4

,

u, z, y \in {1, 2, 3, 4} / {s}

(the target variable X_s) are defined as:

Δ_{4} (ν_{3}; X_{s}, t) = I (ν_{4}, t) - I (ν_{3}, t), ν_{3} = (X_{u}, X_{z}, X_{y})

(62)

4. Discussion

Traditionally, the used statistical metrics for goodness-of-fit linear and nonlinear regression models mostly reflect only fitting criteria (not goodness of fit), which was used in an optimization process to get the best-fit parameters. For example, the coefficient of determination cannot determine whether the parameter estimates and predictions are biased. Similarly, a low value of the coefficient of determination can produce a good model. Consequently, the best metrics possessed model is not necessarily the one that fits best the data. Evaluation of the model fit within information measures relies on the detection of variable dependence, estimation of the significance of such dependence and inference of the functional form of the dependence [21,47]. Moreover, information measures operate on probability distributions rather than directly on data. In this study, the functional forms of inter-variable relationships are deducible using marginal densities (Equations (7), (8), (14), (15), (18) and (19)) of the 4-variate probability density function which is a solution of a diffusion process defined by Equation (1). In this study, the problem of inter-variable dependencies and correlations of new developed functional forms of tree size variable dynamic is analyzed by using entropy-based measures like interaction information, normalized interaction information, multi-information and differential interaction information (see Equations (45)–(62)). The Shannon entropies of the evolution of tree diameter, height, crown base height and crown width in univariate, bivariate, trivariate and four-variate cases are graphically charted in Figure 2.

It is evident that, to some extent, entropy can be viewed as the amount of information that can be gathered through observed dataset. It is understandable that the lower is the Shannon entropy of the tree size variable the less information about the evolution of a tree we are missing and providing more information about the tree development. Figure 2 shows that for all scenarios (univariate, bivariate, trivariate and four-variate) the Shannon entropy increases against the time (age). Hence, an information available about the tree development is actually losing with acceding a tree age. Moreover, the differences of the Shannon entropy (uncertainty measures) between different scenarios of tree size variables can be interpreted as an information gain or loss. It is important to note, if a tree size variable have a small single uncertainty measure, then contribution to the multivariate entropy turns out to be negligible. The univariate probability density function of the diameter, defined by Equations (7) and (8), produced the supreme entropy relationship whereas the univariate probability density function of the crown width produced the least entropy relationship. The bivariate probability density function of the diameter and height, defined by Equations (14) and (15), produced the supreme entropy relationship whereas the bivariate probability density function of the crown base height and crown width produced the least entropy relationship. Lastly, the trivariate probability density function of the diameter, height and crown base height, defined by Equations (18) and (19), produced the supreme entropy relationship whereas the trivariate probability density function of the height, crown base height and crown width produced the minimal entropy relationship.

A complex way of quantifying statistical dependencies between tree size variables comes from the definition of multi-information, which is defined as the difference between the sum of single entropies for each tree size variables and the joint entropy of all tree size variables. The multi-information defined by Equation (55) quantifies the total amount of information carried by correlations between the variables. As a measure of overall multivariable dependence or redundancy, this quantity goes to zero if all variables are independent. The information measure defined by Equations (45) and (50) is named as the total multi-information of two and n random variables, respectively. Figure 3 shows that multi-information is positive, remains stable against the age and gathers bigger values by increasing the number of tree size variables. It is obvious that the multi-information is equal to the mutual information when n = 2. If we range all relationships by using multi-information measure, then, for example, examine Figure 3 we could choose the most important predictor variables for quantifying other response variable. It follows that for a single response tree size variable—diameter (black, blue and green curves) the superior relationship could be defined using a height (black—diameter and height) as a predictor variable; for the height (black, red and cyan curves) the superior relationship could be defined using a diameter (black—diameter and height); for the crown base height (blue, cyan and pink curves) the superior relationship could be defined using height (cyan—height and crown base height); and for the crown width (green, pink and cyan curves) the superior relationship could be defined using a diameter (green—diameter and crown diameter). However, such ranging procedure is not successful, as the low joint entropy value of n variables will also produce low the value of multi-information even if the all the variables are perfectly related. Figure 3 shows that the amount of multi-information is apparently constant via age.

The concept of causality is commonly understandable as the capacity of one variable to influence another. As was noted by Wiener [50] for two simultaneously measured variables, ‘if we can predict the first variable better by using the past information from the second one than by using the information without it, then we call the second variable causal to the first one’. In Wiener’s formulation, the causality is a statistical concept that is based on prediction. Consequently, the best-ranked relationship is not necessarily the one that fits best the data, but that carries superior causality information. Recognizing the statement about causality as an information [51] about the effect of nonlinear relationship we can examine and compare new developed nonlinear relationships by using intersection information measures defined by Equations (45) and (51)–(62). Modeling of the evolution of tree size variables requires better understanding which predictor exerts primary control on response tree size variable. Mutual information quantifies the amount of information that one tree size variable reveals about another and thus the strength of their codependency. Interconnecting causality with mutual information we can measure how much knowing one of these tree size variables reduces uncertainty about the other. If two tree size variables are independent, then knowing single tree size variable does not give any information about another tree size variable and vice versa, so their mutual information is zero. Unfortunately, the value of mutual information depends on the absolute magnitude of joint entropy between the two chosen tree size variables and is not appropriate to use directly for relative comparisons. Therefore, for the ranging of all developed models is advisable to use normalized interaction measures, defined by Equations (51)–(56). The higher normalized information measure values of the bivariate and trivariate mutual information (Equations (51), (53), (54) and (56)) show stronger relationship. The normalized mutual information interpreted in typical distance metric form (see Equations (52) and (55)) is closer to zero in case of a stronger similarity. Figure 4 presents the evolution of the bivariate normalized mutual information defined by Equations (51)–(56). Eventually, the results presented in Figure 4, provide strong evidence that information measures are powerful tools to quantify and explain the relevance of different nonlinear relationships for tree size variables modeling. It follows, that for the quantifying tree diameter relationship against a single predictor variable it must be the crown width or the height, as the corresponding bivariate normalized mutual information curves are sufficiently close or intersects. Following this overall result for the quantifying tree height relationship against a single predictor variable it must be the crown base height or the diameter, as the corresponding bivariate normalized mutual information curves are sufficiently close or intersects. For the quantifying tree crown base height relationship against a single predictor variable it must be the height and, eventually, for the quantifying tree crown base height relationship best a single predictor variable must be the diameter.

The evolution of the trivariate normalized mutual information defined by Equations (54)–(56) is presented in Figure 5. In parallel with bivariate normalized mutual information scenario we can compare nonlinear relationships using trivariate normalized mutual information (see Equations (54)–(56)). Consequently, for the quantifying of tree diameter relationship against two predictor variables it must be the crown base height and crown width. For the quantifying of tree height relationship against two predictor variables it must be the diameter and crown base height. Moreover, for the quantifying of tree crown base height relationship against two predictor variables it must be the height and crown width. Eventually, for the quantifying of tree crown width relationship the best two predictor variables must be the diameter and crown base height.

Given the initial state of the tree size variables, the solution of the SDE (1) determines the dynamic of the univariate, marginal and conditional probability density functions of state variables. These density functions of tree size variables are updated at each age. The dynamic of the univariate marginal and conditional density functions of tree size variables provides the updated prediction by using the mean and the conditional mean trend. For the test on predictive capacity of new derived nonlinear relationships previously were discussed the concepts of the normalized mutual information. To make SDE models comparison more precise, the difference of the intersection information (deltas) defined by Equations (58)–(62) prevails over previous discussed decisions. Figure 6 presents the evolution of the difference of the intersection information (deltas). For tree growth modeling the general problem is to guarantee the maximum degree of dependence to be considered and to determine the number of the best predictor variables involved. The three non-linear scenarios (one, two and three predictors) for the mean trend of a response variable were developed in the present study. The further discussion deals with the adequacy of the deltas defined by Equations (59)–(62) in describing the dependence and causality of the tree size variables. The non-linear models showed that more of predictor variables is included the higher deltas value is achieved. Therefore, the nonlinear models with three predictors (see Figure 6 the third column) provided the best reveal of dependence of all response variables (diameter, height, crown base height and crown width) due to the higher values of deltas than other models (one or two predictors). The shape of the deltas curves in the first column of Figure 6 showed that height reveals the most part of dependence between diameter and other tree size variables (but remains very close result when used crown width as a predictor variable). The diameter reveals the most part of dependence between height and other tree size variables; the height reveals the most part of dependence between crown base height and other tree size variables; eventually, the diameter reveals the most part of dependence between crown width and other tree size variables.

In the second column of Figure 6 the deltas curve showed that the height and crown width reveal the most part of dependence between diameter and other two tree size variables. The diameter and crown base height reveal the most part of dependence between height and other two tree size variables; the height and crown width reveal the most part of dependence between crown base height and other two tree size variables; eventually, the diameter and crown base height reveal the most part of dependence between crown width and other two tree size variables.

The ranking and selection of the mean tree size curves

m_{s}^{i} (t)

,

m_{s}^{i} (t, x_{u})

,

m_{s}^{i} (t, x_{u}, x_{z})

and

m_{s}^{i} (t, x_{u}, x_{z}, x_{y})

can be alternately performed using basic statistical measures, for example, mean bias (MB), mean absolute bias (MAB), root mean square error (RMSE), adjusted coefficient of determination R², and Akaike’s information criterion (AIC) [6,17]. Statistical measures and the ranking for both fixed- and mixed effects scenarios presented in Table 2. Therefore, from a statistical point of view (see Table 2), for the fixed effects scenario all relationships attained lower values of statistical indexes than the mixed effects scenario models.

New developed nested statistical models, defined as a set of probability distributions on the sample space (dataset), support growth and yield modeling by facilitating individualized outcomes conditional on predictor variables. The goodness-of-fit of a model to dataset evaluated through the use of numerical statistical measures and presented in Table 2 provides summary measures of the overall accuracy of the predictions. The goodness-of-fit of new developed models to data assessed by using numerical statistical measures MB, MAB, RMSE, R² and AIC presented in Table 2, and information measures defined by Equations (45)–(62) and visualized in Figure 1, Figure 2, Figure 3, Figure 4 and Figure 5 showed very similar results. This study presents that the interaction information measure approach is particularly powerful, but has less general applicability because of the complicated calculations required, which are not always presently solvable.

Forests statisticians who use mathematics but are not completely at ease with abstract mathematical notations and formulas presented in Equations (1)–(62) can often better understand the new derived mathematical models if these models are visually embodiment. Using the univariate marginal distributions (Equations (7) and (8)), Figure 7 shows the mean, mode, median and both quartiles trends via the mean stand age (in the mixed-effect scenario for randomly selected two stands). The implementation of abstract mathematical Equations (9)–(12) visually reveals nonsymmetry that was not apparent from the observed discrete datasets. Just such a hidden nonsymmetry, disclosed by visualization, confirmed the fact that tree size variables are positively skewed.

Funding

This research received no external funding.

Acknowledgments

The author is grateful to the Editor and two anonymous reviewers for handling the full submission of the manuscript.

Conflicts of Interest

The author declares no conflict of interest.

References

Sharma, R.P.; Vacek, Z.; Vacek, S.; Kučera, M. A Nonlinear Mixed-Effects Height-to-Diameter Ratio Model for Several Tree Species Based on Czech National Forest Inventory Data. Forests 2019, 10, 70. [Google Scholar] [CrossRef]
Gangying, H.; Zhang, G.; Zhao, Z.; Yang, Y. Methods of Forest Structure Research: A Review. Curr. For. Rep. 2019, 5, 69–78. [Google Scholar]
Román-Román, P.; Serrano-Pérez, J.J.; Torres-Ruiz, F. A Note on Estimation of Multi-Sigmoidal Gompertz Functions with Random Noise. Mathematics 2019, 7, 541. [Google Scholar] [CrossRef]
Nucci, M.C.; Sanchini, G. Noether Symmetries Quantization and Superintegrability of Biological Models. Symmetry 2016, 8, 155. [Google Scholar] [CrossRef]
Garcıa, O. Forest Stands as Dynamical Systems: An Introduction. Mod. Appl. Sci. 2013, 7, 32–38. [Google Scholar] [CrossRef]
Rupšys, P.; Petrauskas, E. A Linkage among Tree Diameter, Height, Crown Base Height, and Crown Width 4-variate Distribution and Their Growth Models: A 4-variate Diffusion Process Approach. Forests 2017, 8, 479. [Google Scholar] [CrossRef]
Rupšys, P.; Petrauskas, E. A New Paradigm in Modelling the Evolution of a Stand via the Distribution of Tree Sizes. Sci. Rep. 2017, 7, 12154. [Google Scholar] [CrossRef] [PubMed]
Rupšys, P. Modeling Dynamics of Structural Components of Forest Stands Based on Trivariate Stochastic Differential Equation. Forests 2019, 10, 506. [Google Scholar] [CrossRef]
Duan, G.; Gao, Z.; Wang, Q.; Fu, L. Comparison of Different Height–Diameter Modelling Techniques for Prediction of Site Productivity in Natural Uneven-Aged Pure Stands. Forests 2018, 9, 29. [Google Scholar] [CrossRef]
Román-Román, P.; Serrano-Pérez, J.J.; Torres-Ruiz, F. Some Notes about Inference for the Lognormal Diffusion Process with Exogenous Factors. Mathematics 2018, 6, 85. [Google Scholar] [CrossRef]
Rupšys, P.; Petrauskas, E. Evolution of Bivariate Tree Diameter and Height Distribution via Stand Age: Von Bertalanffy Bivariate Diffusion Process Approach. J. Forest Res. Jap. 2019, 24, 16–26. [Google Scholar] [CrossRef]
Di Crescenzo, A.; Paraggio, P. Logistic Growth Described by Birth-Death and Diffusion Processes. Mathematics 2019, 7, 489. [Google Scholar] [CrossRef]
Rupšys, P. Time Delay Stochastic Logistic Growth Laws in Single-Species Population Growth Modeling. In Proceedings of the 4th WSEAS International Conference on Mathematical Biology and Ecology, Acapulco, Mexico, 25–27 January 2008; pp. 29–34. [Google Scholar]
Muñoz-Cobo, J.L.; Berna, C. Chemical Kinetics Roots and Methods to Obtain the Probability Distribution Function Evolution of Reactants and Products in Chemical Networks Governed by a Master Equation. Entropy 2019, 21, 181. [Google Scholar] [CrossRef]
Visalga, G.; Rupšys, P.; Petrauskas, E. Influence of Noise on Decay Predictions in Standing Trees. AIP Conf. Proc. 2017, 1895, 030006. [Google Scholar]
Cai, W.; Pan, J. Stochastic Differential Equation Models for the Price of European CO2 Emissions Allowances. Sustainability 2017, 9, 207. [Google Scholar] [CrossRef]
Rupšys, P. New Insights into Tree Height Distribution Based on Mixed Effects Univariate Diffusion Processes. PLoS ONE 2016, 11, e0168507. [Google Scholar] [CrossRef]
Rupšys, P. The Use of Copulas to Practical Estimation of Multivariate Stochastic Differential Equation Mixed Effects Models. AIP Conf. Proc. 2015, 1684, 080011. [Google Scholar]
Ju, B.; Zhang, H.; Liu, Y.; Pan, D.; Zheng, P.; Xu, L.; Li, G. A Method for Detecting Dynamic Mutation of Complex Systems Using Improved Information Entropy. Entropy 2019, 21, 115. [Google Scholar] [CrossRef]
Sakhanenko, N.A.; Galas, D.J. Biological Data Analysis as an Information Theory Problem: Multivariable Dependence Measures and the Shadows Algorithm. J. Comput. Biol. 2015, 22, 1005–1024. [Google Scholar] [CrossRef] [Green Version]
Galas, D.J.; Sakhanenko, N.A. Symmetries among Multivariate Information Measures Explored Using Möbius Operators. Entropy 2019, 21, 88. [Google Scholar] [CrossRef]
Hara, T. A Stochastic Model and the Moment Dynamics of the Growth and Size Distribution in Plant Populations. J. Theor. Biol. 1984, 109, 173–190. [Google Scholar] [CrossRef]
Kohyama, T.; Hara, T. Frequency Distribution of Tree Growth Rate in Natural Forest Stands. Ann. Bot. London 1989, 64, 47–57. [Google Scholar] [CrossRef]
Kohyama, T.S.; Potts, M.D.; Kohyama, T.I.; Kassim, A.R.; Ashton, P.S. Demographic Properties Shape Tree Size Distribution in a Malaysian Rain Forest. Am. Nat. 2015, 185, 367–379. [Google Scholar] [CrossRef] [Green Version]
Hozumi, K.; Shinozaki, K.; Tadaki, Y. Studies on the Frequency Distribution of the Weight of Individual Trees in a Forest Stand I. A New Approach Toward the Analysis of the Distribution Function and the-3/2th Power Distribution. Jpn. J. Ecol. 1968, 18, 10–20. [Google Scholar]
Rupšys, P. Generalized Fixed-Effects and Mixed-Effects Parameters Height–Diameter Models with Diffusion Processes. Int. J. Biomath. 2015, 8, 1550060. [Google Scholar] [CrossRef]
Rupšys, P. Height–Diameter Models with Stochastic Differential Equations and Mixed-Effects Parameters. J. For. Res. Jap. 2015, 20, 9–17. [Google Scholar] [CrossRef]
Rupšys, P. Stochastic Mixed-Effects Parameters Bertalanffy Process, with Applications to Tree Crown Width Modeling. Math. Probl. Eng. 2015, 2015, 375270. [Google Scholar] [CrossRef]
Li, C. Maximum-Likelihood Estimation for Diffusion Processes via Closed-Form density Expansions. Ann. Statist. 2013, 41, 1350–1380. [Google Scholar] [CrossRef]
García, O. Estimating Reducible Stochastic Differential Equations by Conversion to a Least-Squares Problem. Comput. Stat. 2019, 34, 23–46. [Google Scholar] [CrossRef]
Picchini, U.; Gaetano, A.; Ditlevsen, S. Stochastic Differential Mixed-Effects Models. Scand. J. Stat. 2010, 37, 67–90. [Google Scholar] [CrossRef]
Itô, K. On stochastic processes. Jap. J. Math. 1942, 18, 261–301. [Google Scholar] [CrossRef]
Uhlenbeck, G.E.; Ornstein, L.S. On the Theory of Brownian Motion. Phys. Rev. 1930, 36, 823–841. [Google Scholar] [CrossRef]
Garvey, P.R.; Book, S.A.; Covert, R.P. Probability Methods for Cost Uncertainty Analysis: A Systems Engineering Perspective, 2nd ed.; Chapman and Hall/CRC: New York, NY, USA, 2016. [Google Scholar]
Garvey, P.R. Garvey: A Family of Joint Probability Models for Cost and Schedule Uncertainties. J. Cost Anal. 1995, 12, 156–200. [Google Scholar] [CrossRef]
Joe, H. Accuracy of Laplace Approximation for Discrete Response Mixed Models. Comput. Stat. Data An. 2008, 52, 5066–5074. [Google Scholar] [CrossRef]
Monagan, M.B.; Geddes, K.O.; Heal, K.M.; Labahn, G.; Vorkoetter, S.M.; Mccarron, J. Maple Advanced Programming Guide; Maplesoft: Waterloo, ON, Canada, 2007. [Google Scholar]
McGill, W. Multivariate Information Transmission. Psychometrika 1954, 19, 97–116. [Google Scholar] [CrossRef]
Marchenko, Y.V.; Genton, M.G. Multivariate Log-Skew-Elliptical Distributions with Applications to Precipitation Data. Environmetrics 2010, 21, 318–340. [Google Scholar] [CrossRef]
De Queiroz, M.M.; Silva, R.W.; Loschi, R.H. Shannon Entropy and Kullback–Leibler Divergence in Multivariate Log Fundamental Skew-Normal and Related Distributions. Can. J. Stat. 2016, 44, 219–237. [Google Scholar] [CrossRef]
Eskandarzadeh, M.; Di Crescenzo, A.; Tahmasebi, S. Cumulative Measure of Inaccuracy and Mutual Information in k-th Lower Record Values. Mathematics 2019, 7, 175. [Google Scholar] [CrossRef]
Li, W. Mutual Information Functions Versus Correlation Functions. J. Stat. Phys. 1990, 60, 823–837. [Google Scholar] [CrossRef]
Wing, S.; Johnson, J.R. Applications of Information Theory in Solar and Space Physics. Entropy 2019, 21, 140. [Google Scholar] [CrossRef]
Watanabe, S. Information Theoretical Analysis of Multivariate Correlation. IBM J. Res. Dev. 1960, 4, 66–82. [Google Scholar] [CrossRef]
Galas, D.J.; Dewey, G.; Kunert-Graf, J.; Sakhanenko, N.A. Expansion of the Kullback-Leibler Divergence, and a New Class of Information Metrics. Axioms 2017, 6, 8. [Google Scholar] [CrossRef]
Maes, F.; Collignon, A.; Vandermeulen, D.; Marchal, G.; Suetens, P. Multimodality Image Registration by Maximization of Mutual Information. IEEE Trans. Med. Imag. 1997, 16, 187–198. [Google Scholar] [CrossRef]
Sakhanenko, N.A.; Kunert-Graf, J.; Galas, D.J. The Information Content of Discrete Functions and Their Application in Genetic Data Analysis. J. Comput. Biol. 2017, 24, 1153–1178. [Google Scholar] [CrossRef] [Green Version]
Kvålseth, T.O. On Normalized Mutual Information: Measure Derivations and Properties. Entropy 2017, 19, 631. [Google Scholar] [CrossRef]
Cahill, N.D. Normalized Measures of Mutual Information with General Definitions of Entropy for Multimodal Image Registration. Lect. Notes Comput. Sci. 2010, 6204, 258–268. [Google Scholar]
Wiener, N. The Theory of Prediction. In Modern Mathematics for Engineers; McGraw-Hill: New York, NY, USA, 1956. [Google Scholar]
Granger, C.W.J. Time Series Analysis, Cointegration, and Applications. Am. Econ. Rev. 2004, 94, 421–425. [Google Scholar] [CrossRef]

Figure 1. Observed datasets.

Figure 2. Evolution of the Shannon entropy. (a) Univariate: black—diameter; blue—height; green—crown base height; red—crown width. (b) Bivariate: black—diameter and height; blue—diameter and crown base height; green—diameter and crown width; red—height and crown base height; cyan—height and crown width; pink—crown base height and crown width. (c) Trivariate: black—diameter, height and crown base height; blue—diameter, height and crown width; green—diameter, crown base height and crown width; red—height, crown base height and crown width. (d) 4-variate: diameter, height, crown base height and crown width.

Figure 3. Evolution of multi-information. (a) Bivariate; black—diameter and height; blue—diameter and crown base height; green—diameter and crown width; red—height and crown base height; cyan—height and crown width; pink—crown base height and crown width. (b) Trivariate; black—diameter, height and crown base height; blue—diameter, height and crown width; green—diameter, crown base height and crown width; red—height, crown base height and crown width. (c) 4-variate, diameter, height, crown base height and crown width.

Figure 4. Evolution of bivariate normalized mutual information (Equations (51)–(53)). Left column–Equation 51. Middle column–Equation (52). Right column—Equation (53). First row for diameter as a response variable and predictors: black—height; blue—crown base height; green—crown width. Second row for height as a response variable and predictors: black—diameter; red—crown base height; cyan—crown width. Third row for crown base height as a response variable and predictors: blue—diameter, red—height, pink—crown width. Fourth row for crown width as a response variable and predictors: green—diameter, cyan—height, pink—crown base height.

Figure 5. Evolution of trivariate normalized interaction information (Equations (54)–(56)). Left column—Equation (54). Middle column—Equation (55). Right column—Equation (56). First row for diameter as a response variable and predictors: black—height and crown base height; blue—crown base height and crown width; green—crown base height and crown width. Second row for height as a response variable and predictors: black—diameter and crown base height; blue—diameter and crown width; red—crown base height and crown width. Third row for crown base height as a response variable and predictors: black—diameter and height; green—diameter and crown width; red—height and crown width. Fourth row for crown width as a response variable and predictors: blue—diameter and crown base height; green—diameter and crown base height; red—height and crown base height.

Figure 6. Evolution of deltas (Equations (60)–(62)). Left column—Equation (60). Middle column—Equation (61). Right column—Equation (62). First row demonstrates for diameter as a target variable. Second demonstrates for height as a target variable. Third row demonstrates for crown base height as a target variable. Fourth row demonstrates for crown width as a target variable. (a1,b1,c1,d1): black—diameter and height; blue—diameter and crown base height; green—diameter and crown width; red—height and crown base height; cyan—height and crown width; pink—crown base height and crown width. (a2,b2,c2,d2): black—height, diameter and crown base height; blue—height, diameter and crown width; green—diameter, crown base height and crown width; red—height, crown base height and crown width. (a3,b3,c3,d3): all tree size variables.

Figure 7. Evolution of the marginal mean, mode, median and both quartiles (Equations (9)–(12)) within the observed datasets: mean—solid line; median—dotted line; mode—dashed line; quartiles—dashed–dotted line; first column—fixed effect scenario; second column—mixed effect scenario and first randomly selected stand; and third column—mixed effect scenario and second randomly selected stand.

Table 1. Estimates of fixed effect parameters.

Model

Parameters of Drift Term

α₁

β₁

γ₁

α₂

β₂

γ₂

α₃

β₃

γ₃

α₄

β₄

γ₄

Fixed

1.3869

0.0269

1.1267

3.1491

0.0441

0.7664

4.7343

0.0392

0.6427

0.3816

3.5 × 10⁻⁴

1.0015

Mixed

1.3878

0.0269

1.1265

3.1504

0.0441

0.7663

4.7215

0.0394

0.6436

0.3818

3.5 × 10⁻⁴

1.0015

Model

Parameters of Diffusion Term

σ₁₁

σ₁₂

σ₁₃

σ₁₄

σ₂₂

σ₂₃

σ₂₄

σ₃₃

σ₃₄

σ₄₄

Fixed

0.0019

9.0 × 10⁻⁴

5.4 × 10⁻⁴

0.0016

6.4 × 10⁻⁴

5.6 × 10⁻⁴

5.9 × 10⁻⁴

8.3 × 10⁻⁴

0.8 × 10⁻⁴

0.0023

Mixed

0.0019

9.6 × 10⁻⁴

6.2 × 10⁻⁴

0.0015

6.9 × 10⁻⁴

5.7 × 10⁻⁴

7.2 × 10⁻⁴

1.1 × 10⁻⁴

0.0023

Parameters of Random Effects

Mixed

σ₁

σ₂

σ₃

σ₄

0.0306

0.1200

0.2240

0.0257

Table 2. Goodness-of fit numerical statistical measures for the fixed- and mixed effects scenario models.

Predictors	Fixed Effect Scenario					Mixed Effect Scenario
Predictors	B (Rank)	MAB (Rank)	RMSE Rank)	R² (Rank)	AIC (Rank)	MB (Rank)	MAB (Rank)	RMSE (Rank)	R² (Rank)	AIC (Rank)
Diameter
(t)	−0.619 (7)	5.472 (8)	6.765 (8)	0.355 (8)	26967 (8)	−0.837 (8)	4.823 (8)	5.869 (8)	0.515 (8)	26306 (8)
(t,H)	0.097 (2)	3.350 (5)	4.292 (5)	0.741 (5)	24852 (5)	−0.048 (1)	2.956 (6)	3.784 (6)	0.798 (6)	24267 (6)
(t,CH)	−0.012 (1)	4.786 (7)	5.951 (7)	0.501 (7)	26374 (7)	−0.493 (6)	4.491 (7)	5.570 (7)	0.563 (7)	26068 (7)
(t,CW)	−0.734 (8)	3.380 (6)	4.304 (6)	0.739 (6)	24864 (6)	−0.551 (7)	2.852 (5)	3.744 (5)	0.803 (5)	24217 (5)
(t,H,CH)	−0.178 (4)	2.887 (4)	3.711 (4)	0.806 (4)	24184 (4)	−0.073 (2)	2.664 (4)	3.470 (4)	0.830 (4)	23974 (4)
(t,H,CW)	−0.165 (3)	2.204 (2)	2.892 (2)	0.881 (2)	23019 (2)	−0.083 (4)	1.974 (2)	2.623 (2)	0.903 (2)	22567 (2)
(t,CH,CW)	−0.223 (6)	2.707 (3)	3.584 (3)	0.819 (3)	24018 (3)	−0.261 (5)	2.600 (3)	3.446 (3)	0.833 (3)	23838 (3)
(t,H,CH,CW)	−0.217 (5)	2.135 (1)	2.797 (1)	0.890 (1)	22870 (1)	−0.081 (3)	1.942 (1)	2.590 (1)	0.906 (1)	22515 (1)
Height
(t)	−0.376 (7)	3.038 (8)	3.766 (8)	0.497 (8)	24238 (8)	−0.351 (8)	1.984 (8)	2.561 (8)	0.767 (8)	22433 (8)
(t,D)	−0.257 (6)	1.883 (5)	2.350 (5)	0.804 (5)	22045 (5)	−0.180 (5)	1.380 (4)	1.752 (4)	0.891 (4)	20680 (4)
(t,CH)	0.166 (5)	1.676 (4)	2.114 (4)	0.841 (4)	21553 (4)	−0.088 (4)	1.521 (5)	1.929 (5)	0.868 (5)	21129 (5)
(t,CW)	−0.461 (8)	2.709 (7)	3.349 (7)	0.602 (7)	23696 (7)	−0.331 (7)	1.703 (7)	2.235 (7)	0.823 (6–7)	21815 (6)
(t,D,CH)	0.065 (2)	1.029 (2)	1.312 (1–2)	0.939 (1–2)	19336 (1)	−0.048 (2)	0.939 (2)	1.215 (2)	0.948 (1–2)	18981 (1–2)
(t,D,CW)	−0.118 (4)	2.189 (6)	2.857 (6)	0.710 (6)	22962 (6)	−0.203 (6)	1.668 (6)	2.234 (6)	0.823 (6–7)	21819 (7)
(t,CH,CW)	0.060 (1)	1.293 (3)	1.678 (3)	0.900 (3)	20482 (3)	−0.082 (3)	1.219 (3)	1.592 (3)	0.910 (3)	20240 (3)
(t,D,CH,CW)	0.067 (3)	1.027 (1)	1.312 (1–2)	0.939 (1–2)	19345 (2)	−0.046 (1)	0.936 (1)	1.213 (1)	0.948 (1–2)	18981 (1–2)
Crown base height
(t)	−0.711 (7)	2.575 (8)	3.042 (8)	0.488 (8)	23243 (8)	−0.299 (7)	1.329 (4)	1.722 (5)	0.836 (5)	20595 (5)
(t,D)	−0.685 (6)	2.231 (6)	2.689 (6)	0.600 (6)	22674 (6)	−0.254 (6)	1.371 (8)	1.748 (6)	0.831 (6)	20668 (6)
(t,H)	−0.470 (4)	1.353 (4)	1.683 (4)	0.843 (4)	20492 (4)	−0.126 (3)	1.029 (3)	1.342 (3)	0.900 (3)	19437 (3)
(t,CW)	−0.724 (8)	2.558 (7)	3.025 (7)	0.494 (7)	23223 (7)	−0.300 (8)	1.330 (5)	1.718 (4)	0.837 (4)	20590 (4)
(t,D,H)	−0.369 (3)	1.154 (2)	1.456 (1)	0.883 (1)	19822 (2)	−0.135 (4)	1.375 (7)	1.770 (7)	0.827 (7)	20735 (7)
(t,D,CW)	−0.506 (5)	2.012 (5)	2.490 (5)	0.657 (5)	22321 (5)	−0.191 (5)	1.362 (7)	1.797 (8)	0.821 (8)	20805 (8)
(t,H,CW)	−0.326 (1–2)	1.181 (3)	1.514 (3)	0.873 (3)	20004 (3)	−0.081 (2)	0.948 (2)	1.264 (2)	0.912 (2)	19166 (2)
(t,D,H,CW)	−0.326 (1–2)	1.146 (1)	1.461 (2)	0.882 (2)	19485 (1)	−0.078 (1)	0.933 (1)	1.243 (1)	0.915 (1)	19096 (1)
Crown width
(t)	−0.009 (4)	0.884 (8)	1.108 (8)	0.180 (8)	18540 (8)	−0.091 (8)	0.834 (8)	1.038 (8)	0.281 (8)	18237 (7)
(t,D)	0.044 (8)	0.542 (4)	0.700 (4)	0.673 (4)	16402 (4)	−0.018 (1)	0.506 (4)	0.660 (4)	0.710 (4)	16129 (4)
(t,H)	0.036 (7)	0.770 (6)	0.983 (6)	0.355 (6)	17980 (6)	−0.055 (6)	0.720 (6)	0.916 (6)	0.440 (6)	17660 (6)
(t,CH)	0.001 (1)	0.878 (7)	1.102 (7)	0.189 (7)	18515 (7)	−0.090 (7)	0.831 (7)	1.037 (7)	0.282 (7)	18238 (8)
(t,D,H)	0.024 (6)	0.498 (3)	0.650 (3)	0.718 (3)	16058 (2)	−0.025 (4)	0.481 (3)	0.631 (3)	0.734 (3)	15929 (3)
(t,D,CH)	0.007 (2–3)	0.491 (2)	0.640 (1)	0.726 (1–2)	15989 (1)	−0.020 (2)	0.471 (2)	0.621 (2)	0.742 (2)	15859 (2)
(t,H,CH)	−0.010 (5)	0.667 (5)	0.855 (5)	0.512 (5)	17339 (5)	−0.040 (5)	0.638 (5)	0.835 (5)	0.535 (5)	17234 (5)
(t,D,H,CH)	0.007 (2–3)	0.490 (1)	0.641 (2)	0.726 (1–2)	16293 (3)	−0.021 (3)	0.469 (1)	0.618 (1)	0.745 (1)	15842 (1)

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rupšys, P. Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures. Mathematics 2019, 7, 761. https://doi.org/10.3390/math7080761

AMA Style

Rupšys P. Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures. Mathematics. 2019; 7(8):761. https://doi.org/10.3390/math7080761

Chicago/Turabian Style

Rupšys, Petras. 2019. "Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures" Mathematics 7, no. 8: 761. https://doi.org/10.3390/math7080761

APA Style

Rupšys, P. (2019). Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures. Mathematics, 7(8), 761. https://doi.org/10.3390/math7080761

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Understanding the Evolution of Tree Size Diversity within the Multivariate Nonsymmetrical Diffusion Process and Information Measures

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Marginal Distribution

3.2. Conditional Distributions

3.3. Maximum Likelihood Procedure

3.4. Random Effects Calibration

3.5. Estimating Results

3.6. Information Measures

4. Discussion

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI