1. Introduction
In this paper, two new approaches to formulate machine scheduling problems are proposed. These new formulations extend and mix parts of previous models. We focus on two scenarios, namely a single machine and identical parallel machines, in order to be able to study a wide range of problems. The set of problems to be considered is 1||LMAX, 1||ΣUi, 1||ΣwiCi, 1||ΣTi, 1||ΣwiTi, Pm||ΣwiCi, and Pm||ΣwiTi.
From a general point of view, the main formulations for single-machine scheduling have been the following:
DC: disjunctive constraints formulation [
1];
LO: linear ordering formulation [
2,
3];
TI: time-indexed formulation [
4];
SP: sequence position formulation [
5,
6].
An experimental study of the previous formulations on a single machine was carried out [
7] by considering objectives such as 1||Σw
jC
j, 1||L
MAX, 1||Σw
jT
j, and 1||ΣU
j, as well as the same problems with release dates. Another experimental study of these formulations on the single-machine total tardiness problem was performed in [
8], including a hybrid formulation between the LO formulation and the SP formulation, with an adaptation of the traveling salesman problem formulation with Desrochers and Laporte anti-loop constraints [
9]. Although these last two approaches are more recent, neither of them achieves any relevant improvements over the other formulations. The DC and SP formulations were also compared empirically in other single-machine scenarios [
10].
However, as far as we are concerned, there has not been a relevant contribution to the formulation of scheduling problems for years. Therefore, the main novelty of this paper is to develop new strategies for modeling scheduling problems, aimed at both single-machine and parallel-machine scenarios.
Considering the results from previous studies and the results presented in this work, it can be concluded that the TI formulation is the best formulation for short job sizes, such as the uniform distribution from the interval [1, 10]. However, with the increase in job size, it becomes a very inefficient formulation. For instance, the time to build the model in a single-machine problem with no more than 100 jobs can take several hours if the job size is in the uniform distribution from the interval [1, 100].
When the problem objective does not imply making the job completion time explicit, SP is the best formulation for any job size [
8], but the objective of the problem can be modeled using the completion time of each position, which happens in problems like 1||ΣC
j, 1||ΣT
j, 1||L
MAX, or 1||ΣU
j. However, weighted objectives in linear programming cannot be modeled without obtaining the completion time for each job. For these cases, the SP formulation loses efficiency, although its behavior depends on the specific weighted objective. Regarding weighted completion times, its behavior becomes very inefficient, whereas its results improve for weighted tardiness purposes.
LO is a good formulation for problems with few jobs [
7]. The set of θ(n
3) transitivity constraints, where n refers to the number of jobs, significantly worsens its performance when the number of jobs increases. Finally, DC is the formulation with the worst behavior, even for problems with the computational complexity of class P [
8].
The next section shows the modeling of these formulations for single-machine problems. In
Section 3, we present the new approaches to formulate single-machine problems. In
Section 4, we adapt these formulations for a parallel-machine scenario. Finally, in
Section 5, an empirical analysis of the formulations is developed.
2. Overview of Previous Formulations
In this section, a summary is presented for each of the four to-date best approaches for weighted objectives, namely the DC, LO, TI, and SP formulations.
Let N = {1..n} be the set of jobs to be sequenced.
; let pj be the processing time, wj be the weight of the job, and dj be the due date of the job. Also, .
2.1. Disjunctive Constraints Formulation (DC)
The variables of the formulation are defined as follows:
if job j precedes job k; 0 otherwise;
completion time of job j; maximum lateness;
tardiness time of job j; if job j is late; 0 otherwise.
Common constraints in all DC single-machine models are (1)–(3), i.e., constraints (1)–(3) are included for all objectives:
2.2. Linear Ordering Formulation (LO)
It uses the same variables that were defined for the DC formulation. Common constraints in all LO single-machine models are the following:
Calculation of lower bound for the time completion:
Transitivity constraints:
All objectives for scheduling problems are modeled in the same way as that for the DC formulation (
Section 2). The only difference is the replacement of common constraints (2) and (3) by constraints (12) and (13).
2.3. Time-Indexed Formulation (TI)
For job processing, let t = 1, 2, …T be the set of time periods, where .
The variables of the formulation are defined as follows:
if job j starts processing at time t; 0 otherwise;
completion time of job j.
Common constraints in all TI single machine models are:
No more than one job is processed at each time:
Objective functions for each type of problem are:
2.4. Sequence Position Formulation (SP)
Let S = {1…n} be the set of positions in the sequence. Since n is the number of jobs, n is also the number of positions in the sequence.
The variables of the SP formulation are defined as follows:
if job j is processed at position s; 0 otherwise;
completion time of position s;
1 if job in position s is late, 0 otherwise;
completion time of job j;
tardiness time of position s;
tardiness time of job j.
Common constraints in all SP single-machine models are:
Calculation of the lower bound for the completion time of each position:
In the previous cases, where no weighted objectives were imposed, there was no need to work with variables considering the job. In other words, the model could include only variables relative to the position. However, in this case, we need to calculate the lower bound for the completion time of each job from the completion time of each position:
3. Novel Formulations—Single-Machine Scenario
In this section, the formulations of the novel approaches are presented for the case of single-machine problems. The parallel-machine scenario will be developed in
Section 4.
3.1. Order-Position Hybrid Formulation (OPH)
This novel approach takes elements from both the LO formulation and SP formulation, so that the transitive constraints can be replaced. It uses the following variables:
if job j precedes job k; 0 otherwise;
if job j is processed at position s; 0 otherwise;
completion time of job j;
tardiness time of job j.
Common constraints in all DC single-machine models are expressions (1) and (12) from the LO formulation and (22) from the SP formulation:
To establish a connection between formulations and prevent precedents from forming a loop, there is a need to incorporate the following relationship between both formulations into the model:
Constraint (30) ensures that the position in the sequence occupied by job j is equal to the number of jobs preceding job j, plus one. Constraint (30) also forces each job to have a position, thus making it unnecessary to include constraint (21).
Proof. Because of constraint (22), there must be a job in each position. If a job were to be in more than one position, some job j would not be able to be placed in any position; hence, . By constraint (30), for that job j: , which is impossible. Therefore, each job has a unique integer position in the sequence, which corresponds to the integer . □
In addition, the set of constraint (30) allows us to relax variables .
Proof. Let the established order be:
k = 1 Job o1; k = 2 Job o2; …; k = i Job oi; …; k = n Job on
There is a bijective mapping between {o1, o2, …, on} and the set of jobs {1…n}.
…
…
Therefore, all variables are assigned a value of 1 or 0, even if they are defined as continuous. □
For all objectives, the constraints and expressions to be included correspond to those proposed for the DC or LO formulations.
3.2. Order-Disjunctive Hybrid Formulation (ODH)
The LO formulation problem shows a high number of O(n
3) of transitive constraints (constraint (13)), which prevents precedence loops amongst jobs. We propose to replace transitive constraints with the following disjunctive constraint (32), which only employs n
2 constraints while guaranteeing the proper sequence of the jobs:
which are equivalent to constraint (3).
Therefore, common constraints of the ODH formulation will be expressions (1) and (12). The other expressions correspond to those of the DC or LO formulation.
4. Formulations for Parallel-Machine Scenario
In this scenario, we need to incorporate set
M={1…
m} of parallel machines as data. Precedence relationships for LO, OPH, and ODH formulations must only happen amongst jobs processed on the same machine. The machine where each job is processed needs to be considered a decision variable for the LO formulation, but not for the OPH formulation, where it can be treated as an auxiliary calculation of the position in the sequence of each machine, as in the SP formulation. The DC formulation is not going to be studied in this section because of its poor behavior. Finally, we are going to focus on the problems with objectives
Pm||Σ
wjCj and
Pm||Σ
wjTj. Other objectives for the study of identical parallel machines were considered in [
11,
12,
13].
4.1. Linear Ordering Formulation (LO)
The variables of the formulation are defined as follows:
if job j precedes job k; 0 otherwise;
if job j is processed in machine i; 0 otherwise;
if jobs j and k are processed in the same machine; 0 otherwise;
completion time of job j;
Tardiness time of job j.
Common constraints in all LO parallel-machine models are:
In addition, constraints (12) and (13) need to be incorporated. The objective functions for every problem are the same as those of the single-machine scenario.
4.2. Time-Indexed Formulation (TI)
For job processing, let t = 1, 2, …T be the set of time periods, where .
The variables of the formulation are defined as follows:
;
If job j starts processing at time t in machine i; 0 otherwise;
completion time of job j.
Common constraints in all TI parallel-machine models are:
No more than one job is processed at each time:
Objective functions for each type of problem are:
4.3. Sequence Position Formulation (SP)
The variables of the SP formulation are defined as follows:
if job j is processed at position s in machine i; 0 otherwise;
completion time of position s;
completion time of job j;
tardiness time of job j.
Common constraints in all SP parallel-machine models are:
No more than one job in each position:
Calculation of lower bound for the time completion of each position:
Calculation of lower bound for the time completion of each job:
The objective functions for each type of problem are the same as those defined in the LO formulation.
4.4. Order-Position Hybrid Formulation (OPH)
All variables from the LO formulation and variables from the SP formulation are used in the OPH formulation.
Expressions (33) and (34) from the LO formulation and (40) and (41) from the SP formulation need to be incorporated. The relationship between the LO and SP formulations is defined as:
As in expression (30), variables
can be considered continuous. Regarding variables
, their value is set to 0 whenever any variable
or
is equal to 0. Therefore, the next constraints must be added:
However, variables
are not decision variables in this model but are calculated from variables
by using the following expression:
The objective functions for each type of problem are the same as those defined in the LO formulation.
4.5. Order-Disjunctive Hybrid Formulation (ODH)
The ODH formulation with parallel machines uses variables from the LO formulation, along with its constraints, (33)–(35) and (12). In addition, constraint (3) must be added to guarantee proper sequences on each machine. Constraint (32), which was defined for the ODH formulation in a single machine, cannot be applied in the case of parallel machines, since two jobs on different machines do not have precedence relationships.
The objective functions for each type of problem are the same as those defined in the LO formulation.
5. Computational Results
Experiments have been performed by following the experimental analysis proposed by [
14,
15]. Therefore, the following parameters have been considered:
Processing time pj: generated from two uniform distributions: [1, 100] and [1, 10];
Weight wj: generated from the uniform distribution [1, 10];
Due date dj: an integer generated from the uniform distribution [P(L − R/2),P(L + R/2)], where P is the sum of the processing times of all jobs. Parameters L and R correspond to relative measures of the location and range of the distribution, respectively, and take three values each: {0.2, 0.4, 0.6} and {0.6, 1, 1.4};
Number of jobs (n): six sizes have been considered {20, 40, 60, 80, 100, and 200}, although no problem includes the six sizes.
From each combination of parameters, four instances are generated. Experiments were run on an Intel(R) Core(TM) i7-10700K CPU @ 3.80 GH with 16 Gb RAM. The optimization library GUROBI Optimizer 10.0.1 was used.
All instances for all problems will be executed during a number of seconds equal to the number of jobs times the number of machines in the corresponding problem, which is a significant difference compared to the analysis of previous scheduling models, where the execution time is usually one hour.
5.1. Linear Programming Relaxation
The linear programming relaxation has been analyzed for 1||ΣwjCj and 1||ΣwjTj. This relaxation of the integer models produces optimal solutions for the formulations TI, LO, OPH, and ODH in the case of problem 1||ΣwjCj. However, the SP formulation obtains solutions with an integrality gap of 97.5%.
Regarding problem 1||Σ
wjTj,
Table 1 shows the lower bound gap. In this case, the TI formulation presents the best bounds. ODH also shows a better gap than the rest of the formulations. LO and OPH have the same lower bounds.
5.2. Results for 1||ΣwjCj
Regarding the computational results of the integer resolution of the problem,
Table 2 shows the average resolution times for the four instances of each size and
pi interval. The behaviors of the LO, TI, OPH, and ODH models can be observed. However, results from the sequencing-position model (SP) were not included because, with just 20 jobs and after an hour, no instance had finished, and the best solution found so far was not optimal.
Since the building time of a model is equivalent in all scheduling problems that are to be analyzed, the analysis of the TI formulation for the interval pj ∈ [1, 100] is discarded. For any scheduling problem, preliminary tests for the interval pj ∈ [1, 100] conclude that the TI models do not yield good feasible solutions for most of the cases.
As can be seen in
Table 2, OPH and ODH are clearly the fastest formulations, equivalent to the resolution time of the LP relaxation. Although TI is slightly better than LO for
pj ∈ [1, 10], the growth of the TI model may make its resolution unfeasible for the
pj ∈ [1, 100] distribution interval.
To evaluate the growth of the models,
Table 3 shows how to calculate the size of the integer formulations for all four alternatives. In the TI formulation,
T corresponds to the number of time periods.
Table 4 displays the size for
N = {20, 40, 60, 100} and shows that TI contains the smallest number of constraints for
pj∈ [1, 10], whereas the OPH formulation contains the smallest number of constraints for
pj∈ [1, 100]. LO and ODH contain the smallest number of variables, but ODH considerably reduces the number of constraints.
5.3. Results for 1||LMAX
It was proved that the best formulation to solve the 1||L
MAX problem was the SP formulation [
9], where the maximum execution time was one hour. In our study, the maximum time is set to
n seconds.
Table 5 shows the number of instances where the branch and bound execution finished
(A), the number of instances where the optimal solution was reached
(B), and the number of instances where each model obtained optimal or best feasible solutions compared to other formulations
(C). Please note that we have considered the values of
n = {40, 60, 80, 100}.
Each row of
Table 5 comprises 36 instances, i.e., 4 instances for each combination of [R, L]. Therefore, 288 problems have been solved. The ODH formulation achieves the best solutions, and
pj ∈ [1, 100] reaches optimal values for all instances. It can also be seen that the
SP formulation is the one that most often completes the resolution, thus guaranteeing the optimal solution. Although the TI formulation shows very good behavior, it significantly decreases convergence with 100 jobs. LO never provides optimal solutions when it does not finish. Finally, OPH presents the worst performance here.
5.4. Results for 1||ΣUj
Although ODH yields very good results for problem 1||ΣU
j, the SP model provides the best overall performance (
Table 6). However, if the results are analyzed based on parameter R (
Table 7), ODH turns out to be the best model with R equal to 1 and 1.4, whereas its performance decreases with R equal to 0.6. At that value, SP works very well. In other words, SP is the best formulation for problems where delivery dates are very similar, but if we increase the range of delivery date values, ODH becomes the best formulation.
Table 6 also shows that TI loses convergence as the number of jobs increases, while the ODH formulation presents an overall better behavior for a greater interval of
pj.
5.5. Results for 1||ΣTj
Henceforth, formulations are studied for the NP problems. Results are analyzed for n = {40, 60, 80} and are shown in
Table 8. Since 4 instances for each combination [L, R] are considered, there is a total of 216 instances. The TI formulation presents the best overall results, provided that the formulation is feasible. If all ranges are considered, ODH presents the best results by a considerable margin. Although the LO formulation comes in second place, it loses efficiency when the number of jobs increases, due to its constraints to avoid loops in the precedents of order O(n
3).
As in the previous models, SP continues to have a direct formulation of the objective function with the use of positions. It is not necessary to calculate the delay of the job, but the delay of the position. However, SP formulation does not show good results, and there are no important differences regarding the length of the process times.
5.6. Results for 1||ΣwjTj
The problem of minimizing the total weighted tardiness in single-machine scheduling is a well-known, strongly NP-hard problem [
16]. Therefore, the resolution of instances until reaching an optimal solution becomes harder with size. We have decided to maintain the same instances as in the previous problem 1||Σ
Tj.
In this problem, SP needs to calculate the delay of each job from the position delay, so this formulation will lose efficiency. Results are displayed in
Table 9 and follow a similar trend as problem 1||Σ
Tj.
The TI formulation yields the best results. Regarding the rest of the formulations, ODH is the one presenting the best convergence. LO and OPH improve the results of the SP formulation, with LO being slightly better. There are no notable differences with respect to the length of the process times.
5.7. Results for Pm||ΣwjCj
To minimize completion times with parallel machines, we have considered the number of jobs to be n = {20, 40, 60} and the number of machines m = {2, 3, 5}. Four instances for each combination of the processing time interval, {pj ∈ [1, 100], pj ∈ [1, 10]}, n, and m, have been generated, leading to a total of 72 instances.
The TI formulation reaches the optimum in all 36 problems with
pj ∈ [1, 10]. If any range of
pj is considered, ODH outperforms the rest of the formulations, displaying the best performance in 45 problems. The OPH and LO formulations show similar results, with OPH being slightly better. Like the single-machine problem, SP exhibits very poor convergence. Since there were no differences regarding the length of the process times, results of both intervals have been integrated in
Table 10. Therefore, each row contains eight problems, except for the TI formulation, which has only executed four problems.
5.8. Results for Pm||ΣwjTj
In the case of problem
Pm||Σw
jT
j, 648 problems are tested. As in the P
m||Σw
jC
j problem, there were no differences regarding the interval of the process times, so the results of both intervals have been integrated in
Table 11. Therefore, each row contains 72 problems, except for the TI formulation, which has only executed 36 problems in each row.
Like the previous problem, TI achieves optimal solutions for all problems. Although the results of the SP and ODH formulations are roughly similar, ODH provides a greater number of optimal solutions. Finally, OPH improves the LO formulation.
5.9. Summary and Applications
Scheduling models are used in almost all production systems and environments. Their formulations are needed in many applications as a tool for the experimental study of heuristic allocation techniques and even as a way to achieve exact optimal solutions in environments with a restrained number of jobs.
As a summary of results, the total number of instances where each formulation obtained the optimal or the best feasible solution in each problem is shown in
Figure 1. In other words,
Figure 1 depicts the summation of the
C column from
Table 5,
Table 6,
Table 7,
Table 8,
Table 9,
Table 10 and
Table 11. It can be seen that ODH outperforms the other formulations in most of the problems. Regarding the
Pm||Σ
wjTj problem, the TI formulation achieves a great performance only in scenarios with a small number of jobs.
6. Conclusions
In this paper, two novel modeling approaches for scheduling problems have been presented by extending already existing modeling strategies. The strategy for the analysis has been to use short execution times: a number of seconds equal to the number of jobs times the number of machines. Afterward, we collected the number of completions reached and the number of optimal solutions, and instead of the average error, we have computed the number of times that each formulation achieved the best result amongst the five approaches.
The first formulation, OPH, based on the combination of the linear order and sequence position formulations, shows very good results in weighted completion time objectives. The second formulation, ODH, based on the combination of the linear order formulation and the disjunctive formulation, yields the best results in our experimental study. Therefore, ODH should be considered the main formulation for further work and other study scenarios.
If only processing times belonging to the interval [1, 10] are assessed, the time index formulation produces the best results. However, when the processing time is extended to interval [1, 100], the model drastically increases its size. For instance, it takes more than 15 min to generate a problem with 40 jobs and one machine. In addition, its results are very poor compared to the processing time.
Future lines of research involve extending the two novel formulations to flowshop environments. Concerning limitations of the research methodology, mathematical models can be inefficient in complex scenarios with a large number of jobs. However, as a tool for studying heuristic techniques, ODH achieves optimal solutions while showing a better behavior than previous formulations.