4.2. Benefit Distribution Model of the Carrier of the Network Freight Platform
There are n agents involved in the benefit distribution, so the research object in this paper is to obtain the benefit distribution agent, and the optimization target Z = , i ∊ n, t represents the time variable.
(1) Contribution and contribution rate
In the preceding order distribution problem there are multiple subjects of interest, the network freight platform carrier drivers will receive the corresponding compensation after completing the order, which constitute the overall benefits of the network freight platform, but because each network freight platform carrier drivers make different contributions in the process of order distribution, so their expected benefits differ, where the contribution reference to the distribution costs in the preceding section, specifically The expression is as follows:
In the formula, represents the contribution value made by agenti in the order distribution process, represents the final transportation cost incurred by agenti in the order distribution problem, is the cost of carbon emissions and the weight coefficient corresponds to each index . It can be seen that the contribution value is directly proportional to the input cost.
The contribution rate is expressed by the proportion of the input of the stakeholders in the whole group, which is as follows:
In this action, represents the total contribution of all the online freight platform carrier drivers involved in order delivery. It can be seen that the contribution rate represents the agenti contribution ratio to some extent. Since the total contribution value of the entire order delivery remains constant, is mainly determined by the size of .
(2) Earnings and expected earnings
The benefits in the model are the benefits obtained by the network freight platform carrier drivers in the whole order distribution process. This value is given by the above order distribution model, and the expected benefits are equal to the product of the contribution rate and the total benefits of the group. The specific expression is as follows:
In the formula, is used to represent the expected benefit of agenti and represents the total revenue obtained by the online freight platform carrying drivers participating in the order delivery after the total order allocation in the previous article. Because the total return is a fixed value, it can be seen that agenti expected return is mainly determined by the contribution rate and proportional.
(3) Degree of expected benefit realization and relative deprivation
After (4)–(16), (4)–(17) and (4)–(18) calculations, the expected benefit realization degree of
agenti is further calculated. The relative deprivation sense
was also calculated for each
agent on the basis of
Ai, details are as follows:
In the formula, the degree of expected return realization is equal to divided by and multiplied by 100%, while and on the right side of the relative deprivation equation represent the mean value of the agent and the proportion of the higher agent in the whole group. It can be seen that the sense of relative deprivation is caused by the injustice of benefit distribution. When all the expected benefit realization degree of agent is equal, is equal to 0. At this time, the benefit distribution is the optimal distribution scheme for the ant colony division of labor model of group benefit distribution.
In benefit distribution, because it is redistributed, the total income of the group is constant, so once the
agent profit increases, there will be
agent profit reduction, for the benefit distribution model of environmental stimulus is divided into two kinds, namely respectively consider the interest increase and reduce the benefits of environmental stimulus value, The specific expression is as follows:
At the increase of the benefits allocated to the agenti, it is in a state of more contribution and less return. At this time, > , so is selected as the molecule divided by the sum of . In contrast, when the benefits allocated by the agenti are reduced, it is a state of less contribution and more return, and < . Therefore, was selected as the molecules divided by the sum of .
(4) Benefit realization capability and response threshold
In the order distribution, the benefit realization ability is reflected in the
agent, that is, “the greater the ability, the greater the responsibility”. Here, using the
agent, the response threshold represents the perception degree of the benefit subject to response to the benefit change, it sets the response threshold for the response increase and benefit reduction, as follows:
In the process of order transportation, the larger the vehicle agent, the more cargo it wants to transport, the more benefits it wants, and the more the unit transportation cost and carbon emission coefficient, the higher the consumption cost is. In conclusion, is the proportional relationship with , and the inverse relationships with and , is the benefit of the realization ability weight coefficient.
In the process of profit distribution, the more powerful the agent is, the more benefits it desires to get; on the contrary, the agent with weaker ability to realize benefits will have less desired benefits. Therefore, the larger the , the greater the possibility of increasing the response benefit and the smaller the response threshold is. On the contrary, if the is greater, the possibility of increasing the response benefit is smaller, and the response threshold is more. In order to simplify the model, Xiao Renbin et al. assumed that and are inversely proportional to each other and that and are directly proportional to each other. In Equations (24) and (25), k is the transformation coefficient of the range of adjustment threshold.
(5) Response probability and benefit change
In the process of interest after allocation, the interests of the
agent may have three changes—interests increase, decrease and interests are the same—so in the
agent distribution of interests in the process with time, there are three possible response probabilities, namely benefit increases the probability of the response
, benefit reduces the probability of the response
and the response to the interests of the same probability is
. The specific expression is as follows:
Based on the response probability formula in the fixed threshold response model, the slight difference is that it chooses different behaviors and different variables were set up to calculate the corresponding response probabilities. It can be seen that the probability of increasing agent’s interest is directly proportional to and inversely proportional to, respectively; the probability of decreasing agent interest is also directly proportional to and , and inversely proportional to and , and the probability of unchanged agent interest is jointly determined with both and , respectively.
After the
agent choice of benefit change behavior, and benefits need to be regulated, the specific formula is as follows:
In Equations (4)–(29), represents the change value of agenti interest, is the value increase in the process of interest regulation, and on the contrary, is the value of decrease. In Equations (4)–(30) and (4)–(31) are decision state variables, equal to 1 or 0. When the agent choice benefits increase, (), , and vice versa. Likewise, when the agent choice benefits decrease (), , and vice versa, When the agent does not change (), . Considering the positive and negative ratio between the interest realization ability and the expected interest of the interest subject, the value of is taken when the interest increases and when the interest decreases. Since the total interest is unchanged, the adjustment value is compared with 1% of the total interest. and respectively represent the change value of the increase and decrease of interest in the process of the complex distribution of interest. It means is that any value is taken from the interval (0, ), and represents any value that is taken from the interval .
In Equation (32), the new income of agenti after redistribution is equal to the interest obtained at the initial time plus the change in interest .
(6) Mean and variance of group relative deprivation and variance of expectation attainment
Given that the benefit realization ability of the network freight platform carrier driver in this model is mainly determined by the vehicle model, the recovery factor is not considered in this model. Considering the interests of the interests in the process of distribution in dynamic coordination,
agent relative deprivation has been changing in order to consider the final interests distribution coordination effect. The model proposed group of relative deprivation mean and variance as the effectiveness of the model index, specific expression is as follows:
where
is the average of the group relative deprivation,
is the squared difference of the group relative deprivation, when
>
indicates that the relative deprivation at this time is still in the higher range, so the benefit redistribution is not yet over; on the contrary, it indicates that the relative deprivation at this time has reached the acceptable range of the network freight platform carrier group, and the benefit redistribution is completely over. At this time, the benefit distribution plan is the final plan, and the acceptability of the distribution result can be tested by
, and a large
indicates that the effect is less desirable, and vice versa.
is the squared difference of the standard expected benefit realization of the group, and since the standard expected benefit realization is 1, it is used to calculate the stability of the realization, and the smaller the
becomes, the better the regulation effect is.
4.3. Algorithm Implementation
The flow chart of the algorithm is shown in
Figure 2:
The specific steps are as follows:
Step 1, set the following variables: called agenti in the above order allocation problem of the initial costs, transportation costs and carbon emissions earnings , maximum bearing capacity , unit costs , a carbon tax w, coefficient of carbon emissions , fuel cost carbon conversion coefficient α and conversion coefficient of β, take the initial simulation time t = 0, tmax for maximum operation frequency, the conversion coefficient of setting the threshold value k, is the critical value of satisfaction with group relative deprivation.
Step 2, calculate the contribution and contribution rate of agenti according to Equations (16) and (17), and the expected revenue of agenti according to Equation (18).
Step 3, calculate the realization degree of expected income and relative deprivation according to Equations (19) and (20);
Step 4, calculate the mean and variance of the relative deprivation of the group and the variance of the degree of realization of desired benefits according to Equations (33)–(35), respectively.
Step 5, calculate the environmental stimulus value and response threshold according to Equations (21)~(22) and (23)~(25).
Step 6, calculate the choice probability of agent interest change according to Equations (26)–(28), and the behavior choice of agent interest change is made according to the principle of probability Max, and the income of agent is updated according to Equations (29)–(32).
Step 7, if the mean of group relative deprivation () is > the critical value and t + 1 < tmax, let t = t + 1, go to step 3. Otherwise, go to step 8.
Step 8, statistics and output simulation results.