1. Introduction
Scheduling is a decision-making process that deals with optimizing resource allocation to perform a collection of tasks in production or manufacturing processes. Scheduling is involved in many real-life applications, such as jobs in a manufacturing plant, customers waiting for services in front of a teller’s window, or airplanes waiting for clearance to land or take off at an airport. Many studies have been carried out by researchers in several scheduling environments, such as single machine scheduling and parallel machine scheduling, to fill some gaps in these problems.
Single machine scheduling requires only the sequencing of jobs. However, scheduling problems with more than one machine, such as parallel machines, involve both resource allocation and sequencing, rather than simple sequencing [
1]. The importance of parallel machine scheduling may be viewed from both theoretical and a practical perspectives [
2]. From a theoretical perspective, it is a general case of the single machine and a special case of the flexible flow shop. A practical perspective is important since parallel operations are frequent in the real world. The makespan objective in parallel machines is of considerable interest, since it balances the load between the parallel machines.
In the parallel machine scheduling problem, each job is processed on only one machine and each machine can process one job at a time. The identical parallel machine is a variant of the machine scheduling problem that is characterized by a set of n independent jobs J = J1, J2, …, Jn to be scheduled in m identical parallel machines M = M1, M2, …, Mm, where (m < n). Each job J can be processed in only one machine with a processing time pj that is identical on all the machines. Preemption is not allowed. Therefore, once a machine is processing a job, it must complete its processing without interruption. The objective is to minimize the total completion time of the last processed job in the schedule (makespan).
The identical parallel machine scheduling problem to minimize the makespan is described by Graham et al. [
3] in terms of three fields:
as
, where
Pm indicates the identical parallel machine environment with
m machines, the second empty field, which indicates no preemption or precedence constraints on the jobs, and
Cmax in the third field, which indicates the makespan minimization objective. This problem is proven to be NP-hard [
4]. Therefore, efficient exact algorithms were needed to be able to solve large-size instances to optimality in less computational time.
The remainder of this paper is organized as follows:
Section 2 presents a literature review of the identical parallel machine scheduling problem and the arc flow model.
Section 3 details the methodology used in the problem. The computational results and discussion are provided in
Section 4. Finally, the conclusion and future work are shown in
Section 5.
2. Literature Review
The first known heuristic applied to the identical parallel machine was the longest processing time (
LPT) rule proposed by Ronald L. Graham [
5].
LPT works by sorting jobs in a non-increasing order of processing times and assigning jobs to the least-loaded machine one by one until assigning all the jobs. The worst-case approximation ratio for the
LPT rule is 4/3–1/(3
m), where
m is the number of machines. Recently, Della Croce and Scatamacchia [
6] revisited the
LPT rule and improved the ratio to 4/3–1/(3
m-1). Other popular approximation heuristics combine (
Pm||
Cmax) with bin-packing techniques:
MULTIFIT [
7],
COMBINE [
8], and
LISTFIT [
9]. These heuristics provide a better worst-case performance, but with higher computation times.
Metaheuristics also play an important role in obtaining good solutions for parallel machine scheduling problems. Among these metaheuristics are the genetic algorithm [
10], simulating annealing [
11], the variable neighborhood search [
12], and other metaheuristics in the literature.
Over the past years, researchers have increased their interest in finding efficient exact methods capable of solving large and hard instances to optimality in less computational time. Mokotoff [
1] conducted a survey on parallel machine scheduling. Dell’Amico and Martello [
13] proposed a branch-and-bound algorithm based on sophisticated lower and upper bounds and some dominance rules for (
Pm||
Cmax). Mokotoff [
14] proposed an exact method based on the cutting plane method for (
Pm||
Cmax). Then, Dell’Amico and Martello [
15] noted that their previous work [
13], which had been published before the work of Mokotoff [
14], obtained better results in terms of the computation time and solved all the studied instances to optimality. Dell’Amico et al. [
16] presented a scatter search algorithm and exact algorithm based on branch-and-price algorithms, which make use of the duality between
P||
Cmax and the bin-packing problem. Haouari et al. [
17] proposed lower bounds based on the lifting procedure. The authors also proposed two heuristics that required iteratively solving a subset sum problem. The previous work was extended by Haouari and Jemmali [
18], enhanced lower bounds were obtained, a new heuristic was proposed based on solving a sequence of 0–1 knapsack problem, and these bounds were embedded in a branch-and-bound algorithm. To test the performance of their algorithm, the authors identified a set of hard instances for which the ratio between the number of jobs and the number of machines was equal to 2.5. The proposed branch-and-bound solved only about 68% of a total of 240 generated instances with different numbers of jobs and machines.
The arc flow approach has been used recently in classical optimization problems, and allows modeling with a pseudo-polynomial number of variables and constraints. For a cutting-stock problem, de Carvalho [
19] proposed a branch-and-price approach for an arc-flow formulation. Next, it was extended for the bin-packing problem in De Carvalho [
20]. An alternative arc-flow formulation for the cutting-stock problem was proposed in [
21,
22], which used a graph compression technique that reduced the size of the constructed graph without affecting the optimal solution. These formulations were recently tested and compared in [
23] against several other models and problem-specific algorithms on one-dimensional bin-packing and cutting-stock problems. J. Martinovic et al. [
24] compared the arc-flow model with a one-cut model for the one-dimensional cutting-stock problem and presented reduction techniques for both models. M. Mrad et al. [
25] proposed a graph compression method to an arc flow formulation for a two-dimensional strip-cutting problem. Other applications of arc flow include berth allocation problems [
26], vehicle-routing problems [
27], and facility location problems [
28].
In the area of scheduling, Mrad and Souayah [
29] proposed an arc flow formulation for the parallel machine scheduling problem to minimize the makespan. The proposed mathematical model outperformed other proposed methods from the literature, since it solved to optimality most of the hard instances from the literature in a few seconds on average. On the other hand, some hard instances were still unsolved within a predefined elimination time, mainly because the number of jobs was relatively large and the ratio of the number of jobs to the number of machines was greater than or equal to 2.25.
A. Kramer et al. [
30] studied the identical parallel machine scheduling problem to minimize the total weighted completion time. An enhanced arc flow formulation with reduction techniques was proposed, which reduced the number of variables and constraints. As a consequence, large instances with up to 400 jobs were solved to optimality and instances with up to 1000 jobs provided a low optimal gap. Then, A. Kramer et al. [
31] extended the previous work of A. Kramer et al. [
30] by adding a release time constraint for each job. A mixed-integer linear program and a branch-and-price algorithm that relied on the decomposition of an arc-flow formulation and the use of exact and heuristic methods were proposed for solving pricing subproblems. S. Wang et al. [
32] studied deterministic and parallel machine scheduling location problems and proposed a network flow-based formulation, two formulation-based heuristics, and one polynomial time algorithm. Trindade et al. [
33,
34] proposed an arc flow formulation for parallel and single batch processing machine scheduling with non-identical job sizes and a machine capacity. A. Kramer et al. [
35] proposed five different formulations for identical parallel machine scheduling with family setup times to minimize the total weighted completion time. The formulations were one commodity formulation, three arc flow formulations, and a set covering formulation. The results showed that one of the arc flow formulations and a set covering formulation yielded a better performance. de Lima et al. [
36] conducted a survey on the foundation of the arc flow formulation and showed the relation between their network and dynamic programming. The survey also discussed the main solution methods for solving large-scale arc flow models and their main applications. de Lima et al. [
37] proposed a network flow framework to address huge network issues in arc flow model solutions.
To summarize, we can say that so far, there are two kinds of proposed approaches for makespan minimization on parallel machines: heuristics and exact methods. On the one hand, heuristics have the potential to find good solutions in a reasonable amount of time. However, they do not guarantee the optimality of the provided solutions. For instance, one of the most recent heuristics for the studied problem [
6] can still be outperformed by older methods for some benchmark instances. On the other hand, exact algorithms deliver optimal schedules, but are limited by the size of the instances (number of jobs/machines) and the relatively high computation time. To illustrate, the state-of-the-art exact method [
29] still fails to solve some instances with as much as 154 jobs. Moreover, it requires thousands of seconds to solve problems with less than 200 jobs.
The main aim of this paper was to move towards an efficient implementation of the arc flow model for makespan minimization in an identical parallel machine scheduling problem. Hence, the arc flow model proposed by Mrad and Souayah [
29] was considered. The improvements were made by proposing enhanced bounds and graph compression techniques to reduce the number of variables and constraints in the constructed arc flow model. A variable neighborhood search (
VNS) algorithm was proposed with five neighborhood structures and an initial solution obtained from the schedule of the longest processing time (
LPT) rule as an upper bound. Three lower bounds from the literature were considered for the improved arc flow model. A better upper bound was considered as a graph compression technique since it reduced the size of the graph. As a consequence, the number of variables was reduced. Furthermore, another proposed graph compression technique eliminated scheduling some jobs on the same machine, which made the resulting lower bound of the problem exceed the current upper bound obtained by the
VNS algorithm. It is worth noting that devising an improved arc flow formulation has an important application in reducing the computation time for solving hard optimization problems. This can be attested by our experimental results as well as those of previous successful implementations of such techniques [
30,
31].