Allocating constraint tasks schedules to promote on-demand cloud services by utilizing the Hungarian Algorithm

Nesreen Alsharman; Ismail Hababeh; Mohammad Alqudah; Kholoud Nairokh; Deefallah Alshorman

doi:10.7717/peerj-cs.3385

Allocating constraint tasks schedules to promote on-demand cloud services by utilizing the Hungarian Algorithm

Nesreen Alsharman ¹, Ismail Hababeh ¹, Mohammad Alqudah², Kholoud Nairokh¹, Deefallah Alshorman³

1Computer Science Department, German Jordanian University, Amman, Jordan

2Basic Sciences Department, German Jordanian University, Amman, Jordan

3Department of Elementary Teacher Education, Al-Zaytoonah University of Jordan, Amman, Jordan

DOI: 10.7717/peerj-cs.3385

Published: 2025-12-16
Accepted: 2025-10-23
Received: 2025-02-13

Academic Editor: Davide Chicco

Subject Areas: Algorithms and Analysis of Algorithms, Computer Networks and Communications, Data Mining and Machine Learning, Mobile and Ubiquitous Computing, Optimization Theory and Computation
Keywords: Hungarian algorithm, Makespan, Fittest task population, Quality of service, Assignment problem

Copyright: © 2025 Alsharman et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

Cite this article: Alsharman N, Hababeh I, Alqudah M, Nairokh K, Alshorman D. 2025. Allocating constraint tasks schedules to promote on-demand cloud services by utilizing the Hungarian Algorithm. PeerJ Computer Science 11:e3385 https://doi.org/10.7717/peerj-cs.3385

The authors have chosen to make the review history of this article public.

Abstract

Cloud computing offers numerous benefits to its users, but it also presents significant performance challenges. The nondeterministic polynomial time (NP)-complete nature of cloud workflow scheduling makes it a significantly challenging task. Scheduling cloud tasks become considerably more complex when operations involve varying quality-of-service (QoS) requirements. Constrained workflow scheduling, however, has the potential to boost cloud system performance and consequently improve quality of service. Although numerous approaches have been developed for workflow scheduling, most focus exclusively on single QoS constraints. This study presents a method for utilizing the Hungarian Algorithm (HA) to address multiple workflow scheduling constraints and promote on-demand cloud services. The Fittest Task Population algorithm (FTPA) was developed to generate the fittest task population set that matches the customer tasks’ constraints. The HA is utilized to assign each task in the generated fittest task population set to the fittest cloud virtual machine (VM). The proposed approach is validated and compared with state-of-the-art workflow scheduling methods using different multiple constraint scheduling scenarios. The comparative analysis validates the effectiveness of the proposed integrated FTPA-HA algorithm, demonstrating its superiority over existing scheduling approaches.

Introduction and background

Cloud computing has become a cornerstone of modern computing infrastructure, offering users dynamically scalable, flexible, and cost-efficient resources through Infrastructure-as-a-Service (IaaS) and related models (Burak & Bharadiya, 2023; Noman, Fernand & Peter, 2022; Patel & Kansara, 2021). The ability to provision resources on demand has attracted a wide range of applications, from large-scale data analytics and business intelligence to scientific workflows and real-time interactive services (Albtoush et al., 2023). Despite these benefits, scheduling workflow tasks in cloud environments remains a critical research challenge. This challenge arises from the NP-complete nature of workflow scheduling problems, the heterogeneity of virtual machines (VMs), and the diversity of user-defined quality-of-service (QoS) constraints such as deadlines, cost, reliability, and energy efficiency. Addressing these challenges is essential for ensuring that cloud services meet user expectations while also maintaining provider efficiency and sustainability.

A considerable body of research has explored workflow scheduling in cloud computing, producing a wide range of strategies. These approaches can be broadly grouped into heuristic-based, metaheuristic-based, machine learning–based, and hybrid or multi-objective frameworks. Heuristic approaches rely on deterministic or rule-based techniques to allocate tasks to resources (Hu, Wu & Dong, 2023). Examples include list scheduling, earliest-finish-time methods, and variations such as backfilling (Hussain et al., 2023). These methods are computationally lightweight, making them attractive for real-time scenarios and systems where scheduling decisions must be made rapidly. However, heuristics tend to focus on local optimization and often ignore global workload distribution. This results in inefficiencies such as load imbalance, violation of QoS constraints, and poor scalability when workflows are large or highly interdependent.

To overcome the limitations of heuristics, numerous metaheuristic algorithms have been applied to workflow scheduling. Genetic Algorithms (GA) (Kumar & Karthikeyan, 2024), Particle Swarm Optimization (PSO) (Fu et al., 2023), Ant Colony Optimization (ACO) (He et al., 2022), and other evolutionary strategies (Khiat, Haddadi & Bahnes, 2024; Tekawade & Banerjee, 2023) are among the most widely used. These methods explore large solution spaces effectively and can optimize multiple objectives such as makespan, energy consumption, and cost (Jingwei et al., 2023). However, they are computationally expensive, suffer from slow convergence in large search spaces, and are highly sensitive to parameter tuning. These drawbacks make them less suitable for on-demand and real-time cloud environments where scheduling must be both accurate and fast.

More recently, machine learning (ML) and probabilistic approaches have been introduced to enhance adaptability in dynamic and uncertain environments. For example, reinforcement learning and deep learning models (Mishra & Majhi, 2021; Shen et al., 2025; Zhou et al., 2022) can predict task execution times, resource consumption, and make adaptive scheduling decisions. Similarly, probabilistic models (Russo et al., 2024; Ye et al., 2022) attempt to manage uncertainty by modeling task execution times and costs as distributions rather than fixed values. These approaches improve adaptability (Singh, 2022) and predictive accuracy, but they introduce high training costs, increased scheduling delays, and difficulties in guaranteeing deterministic compliance with strict user-defined deadlines or budgets (Naqin et al., 2020). As a result, their applicability is often limited to offline or batch scheduling scenarios rather than real-time cloud service provisioning.

Hybrid frameworks attempt to combine the strengths of multiple paradigms (Sita, Bhambri & Kataria, 2023). Examples include multi-objective optimization frameworks (Hegde et al., 2024; Mohammadzadeh & Masdari, 2021) that jointly consider makespan, energy consumption, and execution costs, as well as deadline and cost constrained evolutionary algorithms (Tekawade & Banerjee, 2023). While these frameworks achieve better trade-offs among competing objectives, they often struggle when multiple QoS constraints must be satisfied simultaneously. Many assume simplified workflow structures (Ara et al., 2020), single global deadlines, or independent tasks, which reduce their applicability to real-world scientific workflows that are large, dynamic, and constraint rich.

The Hungarian Algorithm (HA) has attracted attention as an efficient method for solving assignment problems in polynomial time (Alam et al., 2022; Juliet & Brindha, 2023; Lee, 2022). Cloud scheduling (Khiat, Haddadi & Bahnes, 2024) has been applied to optimize resource allocation and minimize costs. HA-based methods are particularly appealing because they guarantee optimal task-to-resource assignments without the overhead of metaheuristic searches. However, most existing HA applications in cloud scheduling focus on pure cost or resource utilization optimization and fail to incorporate user-defined constraints such as deadlines and budgets. This significantly limits their use in real-world cloud workflow scheduling scenarios, where multi-constraint optimization is essential.

Several critical research gaps can be identified, many methods optimize one metric (e.g., makespan, energy, or cost) at the expense of others, leading to QoS violations under realistic workloads. Metaheuristic and ML-based approaches often incur prohibitive overheads, limiting their practicality in deadline-sensitive and on-demand contexts. Existing Hungarian Algorithm applications do not adequately incorporate user-defined constraints such as deadlines and cost budgets. Few methods combine lightweight heuristic or evolutionary mechanisms with exact assignment algorithms (Vásconez et al., 2024), a combination that is critical for achieving scalability (Ahmed, Choudhary & Al-Dayel, 2024) while satisfying constraints.

Furthermore, numerous unresolved challenges remain in terms of security. Existing cloud environments remain vulnerable to cyberattacks, as client data stored within vendor-controlled infrastructures may not fully comply with security standards, exposing users to breaches and malicious assaults (Elsayed, Almustafa & Gebali, 2022; Singh, Jeong & Park, 2016). Moreover, classical cryptographic techniques are insufficient to resist emerging quantum attacks, which creates an urgent need for post-quantum cryptography (PQC) solutions to ensure future-proof protection of cloud systems (Ukwuoma et al., 2022). Although PQC is rapidly evolving, its integration into real-time workflow scheduling and resource allocation strategies remains underexplored (Rajkumar et al., 2024).

In addition, machine learning-based cloud management systems face a growing threat from poisoning attacks, including domain name system (DNS) cache poisoning and data manipulation. These adversarial strategies can compromise the decision-making process of scheduling and resource allocation by injecting malicious inputs into training datasets (Mangalampalli & Karri, 2023). While solutions have been proposed for anomaly detection (Yang et al., 2024) and lightweight cryptographic fault tolerance in advanced encryption standard (AES) (Marisargunam, 2024), they primarily address data integrity and cryptographic resilience (Varnita et al., 2024) but do not fully consider the multi-constraint optimization problem of workflow scheduling under strict deadlines and cost requirements.

To address these research gaps and challenges, this paper makes the following contributions:

Proposes an integrated cloud task scheduling and allocating approach (FTPA-HA) that satisfies the user’s deadline and budget constraints.
Bridges performance and security perspectives: Unlike prior studies that address security or scheduling in isolation, our approach complements ongoing advancements in post-quantum cryptography and anomaly detection by introducing a scalable and constraint-aware workflow scheduling strategy that enhances resilience against malicious manipulations and resource misallocations.
Improves scalability and adaptability: Through genetic-based crossover and mutation operators in FTPA, the proposed system explores a wider solution space while ensuring near-optimal allocation efficiency, outperforming state-of-the-art scheduling techniques across diverse workflow sizes and varying user constraints.

While prior research has identified the security vulnerabilities and computational inefficiencies of cloud systems, this study fills the gap by presenting a constraint-driven, security-conscious, and computationally efficient scheduling mechanism that ensures deadline-and budget-compliant task allocation in heterogeneous cloud environments.

Methodology and materials

Allocating cloud tasks to heterogeneous VMs can result in varying performance outcomes. Therefore, many different constraints are introduced in workflow scheduling computations to generate near optimal solutions, such as deadlines, cost, load balancing, and energy efficiency. However, the constraints that are considered in workflow scheduling depend on the nature or size of the job, VMs availability, and running environment. In this paper, we define the workflow scheduling boundaries that affect the deadline and cost constraints as follows:

Makespan: the total time required to execute the entire workflow task. It represents the time duration of the application task completion.
Financial cost: the value which incurred while running a workflow. The financial cost is defined as follows: Let $E T_{t_{i}}^{V M_{q}}$ represents task t_i execution time on the virtual machine VM_q, $T T_{i j}$ represents the time of transfer data between task t_i and task t_j, $c o s t_{v m_{q}}$ represents the cost of task t_i running on a VM_q, and $c o s t_{t r a n s f e r_{i j}}$ represents the transfer cost between the parent task t_i and the child task t_j; then the total cost $C o s t_{W}^{V M_{s e t}}$ of the workflow schedule is computed in Eq. (1)

(1) $C o s t_{W}^{V M_{s e t}} = \sum_{i = 1}^{n} [(c o s t_{v m_{q}} \times (E T_{t_{i}}^{V M_{q}})) + (c o s t_{t r a n s f e r_{i j}} \times T T_{i j})] .$

VM lease time LT(VM_p): the time when the virtual machine VM_p becomes idle.
Start time: the time at which task t_i starts execution on its assigned virtual machine VM_q. Let t_root represents the root, i.e., entry task, Parents(t_i) represents the parents of task t_i, ParentsFinishTime(t_i) represents the parents finish time task t_i; then the start time $S T_{t_{i}}^{V M_{q}}$ is computed in Eq. (2)

(2) $S T_{t_{i}}^{V M_{q}} = {\begin{matrix} 0, & i f t_{i} = t_{r o o t}, & o t h e r w i s e \\ \underset{t_{v} \in P a r e n t (t_{i})}{m a x} & (L R (V M_{j}), & m a x (P a r e n t s F i n i s h T i m e (t_{i}) + T T_{i j})) . \end{matrix}$

Execution time: the time required for a task to be executed on the assigned virtual machine, which includes the running time and data transmission times. Let Size(t_i) represents the size of task t_i in millions of instructions, Speed(VM_p) represents the processing power of the virtual machine VM_p in millions of instructions per second MIPS, and VM_(p-PE) represents the number of virtual machines VM_p processing elements or cores; then the execution time is computed in Eq. (3)

(3) $E T_{t_{i}}^{V M_{p}} = \frac{S i z e (t_{i})}{S p e e d (V M_{j}) * V M (_{p_P E})} .$

Finish time: the time at which task t_i ends its execution on the assigned virtual machine. The finish time equals to the sum of task start time and its execution time.
Depth: the topological level of the task t_i in the workflow. The depth of task t_i Depth_ti is computed in Eq. (4).

(4) $D e p t h_{t i} = {\begin{matrix} 0, & i f t_{i} = t_{r o o t} \\ 1 + max_{t_{v} \in p r e d e c e s s o r (t_{i})} & D e p t h (t_{v}) + 1 \end{matrix} .$

Deadline: represents the user defined deadline and calculated in Eq. (5)

(5) $D e a d l i n e = T i m e_{H E F T}^{W} \times (1 + α),$ where Time^W_HEFT is the scheduling time of the tasks of the workflow W that is computed using the HEFT scheduling algorithm (Gobichettipalayam, Sandhiya & Sruthi, 2023). The parameter α is a randomly generated number that controls the deadline constraint. The task scheduling output must satisfy the constraint: Schedule_ti ≤ user defined Deadline.

Budget: denotes the user defined budget and computed in Eq. (6)

(6) $B u d g e t = C o s t_{c h e a p e s t}^{W} \times (1 + ω),$ where Cost^W_cheapest is the cost of scheduling the tasks of the workflow W, on the cheapest VM. The parameter ω is a randomly generated number that controls the budget constraint. The task scheduling output must satisfy the constraint: Schedule_cost ≤ User defined budget.

The objective of our proposed FTPA-HA scheduling approach is to minimize the makespan, which refers to the total execution time required to complete all workflow tasks on the allocated virtual machines. The makespan is primarily affected by the following key decision variables, task–VM assignment, task start times, and task execution durations. The task–VM assignment variable ( $x_{i, q}$ ) determines which virtual machine executes each task. Selecting faster or less-loaded VMs reduces execution time and, consequently, the overall makespan. The start time variable ( $S T_{i}$ ) specifies when each task begins execution, constrained by the completion of its parent tasks and the associated data transfer times. The execution time variable ( $E T_{i, q}$ ) depends on both the task size and VM computational capacity, while the finish time FT_i defines each task’s completion, with the largest finish time across all tasks determining the makespan.

Several constraints further influence this measure. The precedence constraint defined in Eq. (2) enforces task dependencies, ensuring that no task can start before its predecessors complete, directly shaping task scheduling order and duration. The execution time constraint defined in Eq. (3) links task size to VM performance, meaning that allocating tasks to slower VMs extends the makespan. The deadline constraint defined in Eq. (5) restricts the total execution time to remain within a defined limit, while the budget constraint defined in Eq. (6) caps the total cost, which may restrict the use of high-performance VMs and thus indirectly increase makespan. Additionally, the task depth constraint defined in Eq. (4) reflects workflow hierarchy; deeper dependency levels naturally prolong the overall completion time.

Generally, the makespan is determined by how effectively the proposed model balances task allocation, dependency handling, and resource utilization within the imposed cost and deadline limitations. Algorithm 1 describes the scheduling and allocating processes of the proposed approach.

Algorithm 1:

Scheduling and allocating workflow tasks in cloud computing.

Input: Cloud workflow tasks, user deadline constraint, user cost constraint

Output: The fittest tasks population set

Begin

Step 1: Input the initial set of tasks population.

Step 2: Select a task from the initial set of tasks population.

Step 3: Perform the three-point crossover genetic operator on the selected task.

Step 4: Apply the mutation genetic operator on the generated three-point crossover task.

Step 5: Add the newly generated tasks to the new feasible task population.

Step 6: Replace the initial tasks population with the new task feasible population.

Step 7: Evaluate the new feasible population. If the fittest task allocation is not met, repeat steps 2–7.

Step 8: If the fittest task allocation is met, add it to the fittest tasks population (n × n) matrix and determine the minimum element in each row and deduce it from each element in that row.

Step 9: Determine the minimum element in each column and deduct it from each element in that column.

Step 10: Determine the minimum number of lines to cover all zero elements in the matrix.

Step 11: Determine the smallest element (k) that is not covered by a line. For each element that is covered twice, add k and subtract k from the elements that are left uncovered.

Step 12: Allocate the smallest element (k) to the fittest Virtual Machine.

End

DOI: 10.7717/peerj-cs.3385/table-5

In this proposed scheduling and allocating algorithm, the three-point crossover (Singh, 2022) genetic operator is used to create a new near-optimal task population solution that expands the task allocating search space. The mutation genetic operator (Ahmed, Choudhary & Al-Dayel, 2024) supports the allocation process with random probability that gives the tasks a low fitness value and the chance to produce new feasible solutions. This demonstrates the ability of the proposed approach to enhance task scheduling performance in cloud computing systems.

The proposed approach consists of two-stage scheduling algorithms, FTPA and HA. First, FTPA generates the most feasible set of workflow tasks under strict constraints, mitigating weaknesses of heuristic-only methods. Then, HA is applied to optimally assign these tasks to the fittest VMs in polynomial time. The reflection of both FTPA and HA on Algorithm 1 is described as follows:

The FTPA primary role is to generate the fittest workflow task population set under user-defined deadlines and cost constraints. Reaching the fittest task allocation in (Step 7) refers to the evaluation stage where the newly generated task population is assessed against the user-defined deadline and cost constraints. At this point, the algorithm determines whether the evolved set of candidate solutions includes a feasible population that satisfies both constraints while maintaining high fitness. If such population is not found, the algorithm returns to (Steps 2–6), continuing the crossover and mutation process until a suitable solution is produced. This implies that (Step 7) serves as a checkpoint ensuring that only constraint-compliant and near-optimal task populations progress to the assignment stage. This iterative process (Steps 1–7), ensures that infeasible or low-fitness allocations are eliminated while retaining solutions that balance execution time, financial cost, and deadline adherence.

The second stage of the model applies the HA to optimally assign the refined FTPA-generated tasks to the fittest available virtual machines (VMs). HA operates on the cost matrix by systematically reducing rows and columns, covering zeros with minimal lines, and adjusting uncovered elements until the optimal assignment is achieved in polynomial time. This ensures that each task is allocated to the VM that minimizes total execution cost while still meeting deadline constraints. The HA is integrated explicitly in (Steps 8–12) as the second stage of the proposed scheduling algorithm, where the mapping of tasks to VMs are completed. These steps detail the core phases of the HA: constructing the (n x n) cost matrix from the fittest task population (Step 8), row and column reductions to normalize costs (Step 9), covering all zeros with the minimum number of lines (Step 10), adjusting the uncovered elements to generate additional zeros (Step 11), and finally assigning tasks to the fittest virtual machines by selecting the optimal zero-cost entries (Step 12). Together, the two stages form a complementary process: FTPA filters and prepares the most promising candidate task sets, while HA guarantees cost-effective and deadline-compliant allocations demonstrating the novelty and effectiveness of the proposed scheduling approach.

Experimental results and performance analysis

The proposed approach is simulated in Azure Application Service to deploy the assignment problem using Java 17 with Visual Studio on a laptop machine (Core i7 CPU, 2.40 GHz, and 8 Giga Byte (GB) RAM), and a set of different VMs running on heterogeneous cloud resources. The data set represents the workflow tasks costs and consists of 20 (n × n) matrices. The matrix rows represent agents (e.g., workers, machines, etc.) and the columns represent tasks (e.g., jobs, resources, etc.). Each matrix cell (rowi, colj) value represents the cost of allocating task (j) to the agent (i). The tasks allocating costs are used by the Hungarian Algorithm (the second stage of the proposed approach FTPA-HA) to find the fittest VM that best allocate each task. To assess how well the HA assigns tasks to VMs, correctly and optimally, we define the workflow allocating boundaries that affect the performance of the workflow constraints, namely Precision and Recall that are measured in terms of the following metrics:

True positive (TP): The number of tasks to which the best VM is successfully allocated.
False positive (FP): The number of not correct allocations assigned to VM.
False negative (FN): the number of missed allocations.

Precision and Recall are defined as follows:

Precision is the percentage of tasks that are correctly predicted positive allocations to the total predicted allocations and computed in Eq. (7):

(7) $R e c a l l = T P \div (T P + F N)$

Recall is the percentage of tasks that are correctly predicted positive allocations to all actual allocations and computed in Eq. (8):

(8) $P r e c i s i o n = T P \div (F P + T P) .$

Discussion

The proposed scheduling approach is validated by conducting several experiments that carried out in the same configuration and execution environment in order to guarantee a fair and reasonable comparison with well-known scheduling algorithms such as Optimal Sequence Dynamic Assignment Algorithm (OSDAA) (Kumar, Surachita & Kumar, 2022), Round-Robin (R-R) (Balharith & Alhaidari, 2019), Random (RD) (Manikandan, Gobalakrishnan & Pradeep, 2022), Genetic Algorithm (GA) (Kumar & Karthikeyan, 2024), and Particle Swarm Optimization (PSO) (Fu et al., 2023). The main objective of all approaches in comparison was to reduce the makespan. Simulations were performed using a standardized testbed equipped with an Intel Core i7 processor (2.4 GHz, 8 cores), 8 GB RAM, and running Ubuntu 20.04. Experiments deployed 10 virtual machines (VMs), each configured with 20 vCPUs, 21.4 GB RAM, 13,000 MHz processor speed, 100 GB storage, and 0.000001 transfer cost unit/s. The workload consisted of 200 tasks, either synthetically generated with directed acyclic graph (DAG) dependencies or drawn from standard scientific workflows, with task sizes (1,000–5,000) million instructions. All scheduling methods were tested under non-preemptive execution with no dynamic task arrivals, ensuring consistency across all evaluations. While the OSDAA configuration followed the default parameters, each of GA, PSO, R-R, and RD scheduling approaches need specific settings and tuning for best optimization. The GA was configured with a population size of 50, crossover rate of 0.8, mutation rate of 0.1, and 100 generations. The PSO used a swarm size of 30 with inertia weight 0.7, cognitive weight 1.5. The R-R and RD approaches were assigning tasks randomly without priorities.

The reinforcement learning-based strategy is used in the second stage (HA allocation) of the proposed approach to improve adaptability. After each scheduling decision, the system receives feedback on allocation performance (e.g., whether deadlines and cost constraints were met). This feedback is then used to update the allocation policy, enabling the algorithm to learn from prior scheduling outcomes. Over time, this reinforcement process enhances the Hungarian algorithm’s effectiveness by guiding it toward more efficient task-to-VM assignments under dynamic cloud conditions.

The heuristic-based strategy is realized in the FTPA stage of the proposed approach. In this stage, heuristic-inspired genetic operators, specifically three-point crossover and mutation, are employed to generate feasible task populations that satisfy user-defined deadlines and cost constraints. This heuristic process enables the scheduler to find near-optimal solutions efficiently without exhaustively searching the entire solution space. In addition, the heuristic-based strategy provides computational efficiency, ensures scalability in large-scale on-demand cloud service provisioning, and offers high-quality candidate solutions for the Hungarian Algorithm to refine.

The simulation parameters and the VMs configurations are summarized in Tables 1 and 2. Better cloud computing efficiency is achieved by having the scheduler distribute jobs to earlier VMs based on the task’s information and the resource information server with minimal makespan value.

Table 1:

FTPA simulation parameters.

Workflow type	Scientific Direct Acyclic graph (DAG) workflow
Workflow name	Montage, Epigenomics, Ligo-Inspiral, Sipht, and CyberShake
DAG size	100, 200, 300, 400, 500, 600, 700, 800, 900, 1,000 tasks
Number of edges	456, 4511
Average data tasks size	2.7 MB, 3.1 MB, 3.5 MB, 3.7 MB
Resource pool	10 spaceShared VMs Data Center
Number of VMs	10
Transfer cost unit/s	0.000001
Max. generation	300
Population size	200
Crossover probability	50%
Mutation probability	10%

DOI: 10.7717/peerj-cs.3385/table-1

Table 2:

VMs configuration parameters.

VM type	VM cores	Processor speed (MHz)	Memory (GB)	Cost unit/s
VM_type1	1	2,000	1.8	0.000012
VM_type2	4	4,500	5.3	0.000036
VM_type3	5	8,000	8	0.000049
VM_type4	8	10,000	16	0.00010
VM_type5	20	13,000	21.4	0.00014

DOI: 10.7717/peerj-cs.3385/table-2

Cloud providers offer flexible memory allocation, so memory in the cloud is elastic and scalable, allowing resources to scale up or down in response to demand, preventing memory waste or shortages for applications. The makespan fitness function is computed in Eq. (9):

(9) $M a k e s p a n^{min} = m a x {E T_{t 1}, E T_{t 2}, E T_{t 3}, \dots, E T_{t n}}$ where ET is the execution time needed to execute (tn) task on VM. Makespan reflects the overall efficiency of the scheduling workflows of cloud computing approaches. In the proposed scheduling FTPA-HA approach, the overall competence goal is minimizing the makespan to optimize resource use and task throughput under user constraints, deadline and cost.

The key factors that influence makespan in cloud-based scheduling and considered in our approach are population size, fitness weight, robust weight, crossover rate, mutation rate, input matrix size, input matrix cost. Table 3 summarizes the range and methods of the key factors changes and their impact on the makespan.

Table 3:

The impact of range and methods of key factors on the makespan.

Parameter	Range	Methods of change	Mechanism of impact on makespan
Population size	100–1,000	Linear increments	Larger populations → more balanced loads and shorter makespan.
Fitness weight	0.001–0.1	Multi-objective weighting	Emphasis on load balancing vs. sensitivity shifts optimization bias, shaping the scheduling output that flows into our approach—stage 2.
Robust weight	0.1–10	Log-scale adjustments	Heavily penalizing uncertainty → increases resource under utilization→ increase makespan.
Crossover rate	0.6–0.95	Step changes	Too high → enhances recombination of decent traits but can reduce privileged solutions → affects convergence speed and diversity.
Mutation rate	0.01–0.2	Step changes	Too low → early convergence. Too high noise.
Input Size (n × n matrix)	Derived from the proposed FTPA—stage 1	Indirect via the proposed FTPA—stage 1 outputs	Suboptimal assignments → extended makespan.
Input cost matrix precision	Small–large granularity	Normalized scaling	Minimizes total execution time → reduce makespan.

DOI: 10.7717/peerj-cs.3385/table-3

The proposed scheduling approach FTPA-HA effectively reduces makespan by combining the FTPA (stage 1) that generates decent initial solutions respecting cloud constraints, such as deadline, cost, and virtual machines availability, and HA (stage 2) that refines the solution for balanced parallel execution, minimizing idle virtual machines time and bottlenecks. This two-stage scheduling approach significantly reduces makespan, improving resource utilization and quality of service QoS, thus, ensure both efficient and practical scheduling in dynamic cloud environments. Table 4 shows the makespan results of the scheduling methods.

Table 4:

Cloud workflow scheduling methods comparison in terms of makespan.

Scheduling method	Makespan (s)	Number of scheduling tasks
Proposed method	0.00014 s	200
OSDAA	11,000 s	200
Round-Robin (R-R)	17,000	200
Random (RD)	17,000	200
GA	10,000	200
PSO	20,000	200

DOI: 10.7717/peerj-cs.3385/table-4

The results depicted in Table 4 show that the proposed two-stage FTPA-HA approach significantly outperforms existing approaches in terms of computational time since the first stage of the proposed approach is a computationally efficient lightweight heuristic-based scheduling technique used to optimize resource allocation that may not always provide optimal solutions but aims to find fast and good enough near optimal solutions, making it ideal for large-scale and on-demand cloud service provisioning with resource constraints where decisions should be made in near real-time. The second stage where the HA is utilized to allocate workflow tasks to the fittest VM in polynomial time. This is a reinforcement learning-based scheduling technique that supports feedback after each scheduling decision and updates its plan to improve its future allocations in polynomial time, making it more efficient for learning complex environments which can adapt to changing tasks and constraints over time, thus resulting in better long-term optimization. In addition, we carried out the experiments under the following execution constraints: no task anticipation, no dynamic task arrival, and fixed deadline which is not considered in makespan evaluation.

To evaluate the efficiency of state-of-the-art cloud workflow scheduling approaches, a comparative analysis was conducted between the proposed FTPA-HA scheduling approach and two state-of-the-art scheduling algorithms namely, CEDCES: A Cost-Effective Deadline Constrained Evolutionary Scheduler for Task Graphs in Multi-Cloud System (Mangalampalli & Karri, 2023) and GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy (Shen et al., 2025). This study implements the three scheduling approaches on five real-world scientific applications, Cybershake (Tang et al., 2021), Montage (Singh, Jeong & Park, 2016), Ligo-Inspiral (Elsayed, Almustafa & Gebali, 2022), Epigenomics (Sadhasivam, Balamurugan & Pandi, 2018) and Sipht (Youseff, Butrico & Silva, 2008). Each scientific application is tested on workflow task sizes (100, 200, …, 1,000) with different deadlines and costs constraints. Figures 1 and 2 present an average of 10 experimental results of the proposed FTPA-HA performance efficiency against CEDCES and GATES approaches in terms precision and recall with deadline and cost constraints.

Figure 1: Precision performance of the scheduling approaches.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-1

Figure 2: Recall performance of the scheduling approaches.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-2

Figure 1 shows that the CEDCES precision values under deadline constraint drop from 92.0% with 100 tasks to 87.3% with 1,000 tasks, and under cost constraint drop from 93.0% with 100 tasks to 86.6. This figure shows also that the GATES precision values under deadline constraint drop from 94.0% with 100 tasks to 89.3% with 1,000 tasks, and under cost constraint drops from 93.1% with 100 tasks to 88.7% with 1,000 tasks.

In addition, this figure shows that the proposed FTPA-HA precision values under deadline constraint drop from 97.2% with 100 tasks to 92.8% with 1,000 tasks, and under cost constraint drops from 96.0% with 100 tasks to 91.7% with 1,000 tasks. Figure 2 indicates that the CEDCES recall values under deadline constraint drop from 93.0% with 100 tasks to 88.7% with 1,000 tasks, and under cost constraint drop from 92.0% with 100 tasks to 87.6% with 1,000 tasks. Besides, this figure shows that the GATES precision values under deadline constraint drop from 95.1% with 100 tasks to 90.8% with 1,000 tasks, and under cost constraint drops from 94.0% with 100 tasks to 89.4% with 1,000 tasks. Furthermore, this figure shows that the proposed FTPA-HA precision values under deadline constraint drop from 98.0% with 100 tasks to 93.5% with 1,000 tasks, and under cost constraint drops from 97.2% with 100 tasks to 92.7% with 1,000 tasks.

The experimental results confirm that CEDCES approach struggles to manage increasing task complexity, especially under tight deadline and cost constraints, and the GATES technique improves adaptability but lacks optimal refinement under multi-objective constraints deadline and cost. On contrast, the proposed approach FTPA-HA distinguished by task-resource allocating that leads to optimal resource utilization (generates high precision values) and lower failure rates (generates high recall values) across all task sizes. However, the FTPA-HA performance depends on well-tuned mutation and three crossover rate parameters that degrade the performance slightly under increasing constraints but prove its adaptability even workloads scale high. This makes the proposed FTPA-HA a robust approach for large-scale, on-demand cloud task scheduling in deadline-sensitive and cost-limited situations.

To illustrate the second stage of the proposed approach, we implement the Hungarian algorithm on a web application hosted in a cloud cluster named Hungarian and record the system activity and performance in the period March 31–April 7, 2025.

Figures 3, 4, 5, 6, 7, and 8 represent two sets of performance monitoring data for two different users with different configurations. In the first dataset, the HA is implemented with various performance configuration metrics: Hypertext Transfer Protocol (HTTP) 5xx server-side errors, total amount of data received by the server, total amount of data sent from the server, number of HTTP requests received, and the average server response time in seconds. In the second dataset, the HA is utilized with different performance configuration metrics. This comparison exposes differences in traffic intensity, reliability, and response efficiency. The datasets features are described as follows: HTTP 5xx errors represents server-side HTTP error codes (500–599), indicating issues with the server when processing requests. Data In denotes the amount of incoming data received by the server from users. Data Out was defined as the amount of outgoing data sent by the server to users. Requests represent the number of requests made to the server. Response Time expresses the average time taken by the server to respond to a request.

Figure 3: Results of the assignment problem for the first configuration.
(A) The number of HTTP 5xx server-side errors. (B) The total amount of data received by the server. (C) The total amount of data sent from the server. (D) The total number of HTTP requests received. (E) The average server response time in seconds.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-3

Figure 4: Results of the Http 5xx of the second configuration.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-4

Figure 5: Results of the data in of the second configuration.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-5

Figure 6: Results of the data out of the second configuration.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-6

Figure 7: Results of the requests of the second configuration.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-7

Figure 8: Results of the response time of the second configuration.

Download full-size image

DOI: 10.7717/peerj-cs.3385/fig-8

Figure 3 represents five performance metrics monitoring graphs related to a web application hosted in a cloud cluster named Hungarian. This figure provides experimental records of system activity and performance in the period March 31–April 7, 2025. The figure sub-graphs are described as follows: Figure 3A represents the number of HTTP 5xx server-side errors. There are several noticeable spikes in 5xx errors, peaking at around 30 errors at one point, and 123 errors are spread unevenly over the timeline, indicating intermittent server-side issues and backend server instability due to crashes, timeouts, or server overloads, potentially correlating with request spikes. Figure 3B represents the total amount of data (7.3 MB) received by the server. There is one significant spike, reaching around 5 MB at a single point and the rest of the timeline shows almost negligible data input. A single large spike implies a large request payload during that instance. Figure 3C represents the total amount of data (249 kB) sent from the server, mirroring the request pattern. There are multiple small spikes, maxing at around 80–90 kB. Little data is sent out by the server suggesting lightweight responses, such as text-based or small JavaScript Object Notation (JSON) payloads. Figure 3D represents the total number of HTTP requests received (207). Several spikes in requests, with a peak of around 35–40 requests at once. High traffic contributes to error rates (similar pattern to 5xx errors). The request volume is moderate but shows intermittent behavior that might be causing performance degradation and triggering 5xx errors. Figure 3E represents the average server response time in seconds. Highly variable, with peaks nearing 12 s. A few drops below 2 s, but the average remains high (4.89 s) aligns with request spikes and server errors. It may imply slow database queries, backend bottlenecks, or overloaded infrastructure. All sub-graphs in this figure show that the Hungarian algorithm handles more traffic but struggles with server errors and performance degradation during the implementation period.

Figures 4, 5, 6, 7, and 8 represent another set of performance monitoring charts that show different behaviors for the same web application by changing the performance parameters’ values. It spans the same period (March 31–April 7, 2025). The figure sub-graphs are described as follows: Figure 4 represents zero HTTP 5xx errors that implies the server-side code for this system is running smoothly, without any crashes during the running period. Figure 5 represents the total amount of data (52.6 kB) received by the server. There is one small spike around 50 kB, which likely indicates minimal data processing or low upload activities of the system. Figure 6 represents the total amount of data (59.5 kB) sent from the server. There is a little data out spike, reaching around 60 kB. This implies that the server is processing relatively small responses such as text-based or lightweight data. Figure 7 represents the HTTP requests received (25) during the implementation period. Low requests with a slight increase in traffic, indicating that this system received very few interactions.

This implies that the second configuration of the HA is not frequently used, or it might represent a testing environment with few interactions. Figure 8 represents the average server response time in seconds. The response time is extremely high at one point, peaking at 16.57 s. Despite this spike, the overall behavior seems reasonable and indicating no major variabilities follow this spike that indicate a slow request during the implementation period due to resource contention or a heavy operation. All parts in this figure show that the Hungarian algorithm is underutilized but suffers from slow response times when it does receive requests.

Conclusion and future work

The Hungarian Algorithm works by determining the optimal assignment of tasks to VMs in such a way that the total cost is minimized while meeting deadlines and cost constraints. This was performed by creating a cost matrix that represented the cost of assigning each task to each VM and then finding the assignment that results in the lowest total cost. By incorporating the Fittest Task Population Algorithm into the process, the system can efficiently manage the population of tasks and ensure that only the most suitable tasks are considered for allocation. This helps in improving the overall performance of the cloud task scheduling environment by optimizing the allocation of tasks based on their specific requirements. The combination of the Hungarian Algorithm and the Fittest Task Population Algorithm allow for an efficient and erective task allocation process in cloud task scheduling environments, ultimately leading to better utilization of resources and improved quality of service for users. Furthermore, we conducted a sensitivity analysis to investigate the impact of different factors on the performance of the proposed approach. We varied parameters, such as the task arrival rate, resource availability, and network latency to determine the robustness and scalability of the proposed algorithm. The results show that our approach can adapt to changing conditions and effectively manage workload in dynamic and unpredictable environments. To further validate the effectiveness of our approach, we conducted case studies with actual cloud service providers and validated our algorithm in real-world scenarios. Results of these case studies confirmed the applicability and efficiency of our approach in improving cloud system performance and meeting user requirements. We aim to continue evolving and refining our scheduling approach to meet the growing demands and challenges of cloud computing environments, while providing cost-effective and reliable solutions for cloud customers. In future work, metaheuristic optimization algorithms such as Sea Lion Optimization (SLnO) (Mell & Grance, 2011) and Gray Wolf Optimizer (GWO) (Zhang, Cheng & Boutaba, 2010) will be investigated for more task scheduling constraints that enhance cloud service quality and match cloud balancing.

Supplemental Information

Data set.

DOI: 10.7717/peerj-cs.3385/supp-1

Download

Java implementation of the Hungarian Algorithm, used to handle assignment issues with either maximum profit or minimum overall cost.

DOI: 10.7717/peerj-cs.3385/supp-2

Download

The Hungarian Algorithm accurately determines the best assignment for a given cost matrix.

DOI: 10.7717/peerj-cs.3385/supp-3

Download

[1] Ahmed Z, Choudhary M, Al-Dayel I. 2024. Effects of crossover operator combined with mutation operator in genetic algorithms for the generalized travelling salesman problem. International Journal of Industrial Engineering Computations 15(3):627-644

[2] Alam S, Sagor E, Ahmed T, Haque T, Shoaib M, Ibrahim S, Shahjahan O, Rubaet M. 2022. Assessment of assignment problem using Hungarian method.

[3] Albtoush A, Farizah Y, Almi’ani K, Maizura N. 2023. Structure-aware scheduling methods for scientific workflows in cloud. Applied Sciences 13(3):3

[4] Ara R, Rahim MA, Roy S, Prodhan UK. 2020. Cloud computing: architecture, services, deployment models, storage, benefits and challenges. International Journal of Trend in Scientific Research and Development (IJTSRD) 4(4):837-842

[5] Balharith T, Alhaidari F. 2019. Round robin scheduling algorithm in CPU and cloud computing: a review.

[6] Burak C, Bharadiya J. 2023. Cloud computing forensics; challenges and future perspectives: a review. Asian Journal of Research in Computer Science 16(1):1-14

[7] Elsayed A, Almustafa K, Gebali F. 2022. A survey of security in cloud computing.

[8] Fu X, Sun Y, Wang H, Li H. 2023. Task scheduling of cloud computing based on hybrid particle swarm algorithm and genetic algorithm. Cluster Computing 26(5):2479-2488

[9] Gobichettipalayam K, Sandhiya R, Sruthi K. 2023. BTSAH: batch task scheduling algorithm based on Hungarian algorithm in cloud computing environment.

[10] He X, Shen J, Liu F, Wang B, Zhong G, Jiang J. 2022. A two-stage scheduling method for deadline-constrained task in cloud computing. Cluster Computing 25(5):3265-3281

[11] Hegde S, Srinivas D, Rajan M, Rani S, Kataria A, Min H. 2024. Multi-objective and multi constrained task scheduling framework for computational grids. Scientific Reports 14(1):6521

[12] Hu Q, Wu X, Dong S. 2023. A two-stage multi-objective task scheduling framework based on invasive tumor growth optimization algorithm for cloud computing. Journal of Grid Computing 21(2):31

[13] Hussain M, Luo M, Hussain A, Javed M, Abbas Z, Wei L. 2023. Deadline-constrained cost-aware workflow scheduling in hybrid cloud. Simulation Modelling Practice and Theory 129:102819

[14] Jingwei Z, Long C, Cong L, Zhiming Z, Ying M. 2023. Cost-aware scheduling systems for real-time workflows in cloud: an approach based on genetic algorithm and deep reinforcement learning. Expert Systems with Applications 234(1):120972

[15] Juliet M, Brindha T. 2023. Efficient resource allocation in cloud computing using Hungarian optimization in Aws. Research Square 24(06):15

[16] Khiat A, Haddadi M, Bahnes N. 2024. Genetic-based algorithm for task scheduling in fog-cloud environment. Journal of Network and Systems Management 32(1):3

[17] Kumar P, Karthikeyan S. 2024. Using genetic algorithms to optimize job scheduling in Google cloud platform.

[18] Kumar P, Surachita N, Kumar B. 2022. A pair-based task scheduling algorithm for cloud computing environment. Journal of King Saud University Computer and Information Sciences 34(1):1434-1445

[19] Lee S. 2022. Polynomial time algorithm for worker assignment problem. The Journal of the Institute of Internet, Broadcasting and Communication 22(5):159-164

[20] Mangalampalli S, Karri G. 2023. Cloud environment limitations and challenges.

[21] Manikandan N, Gobalakrishnan N, Pradeep K. 2022. Bee optimization based random double adaptive whale optimization model for task scheduling in cloud computing environment. Computer Communications 187(4):35-44

[22] Marisargunam S. 2024. Cloud service authentication based on advanced encryption standard (AES) for ensure privacy. International Journal of Advanced Trends in Engineering and Management (IJATEM) 3(4):14-30

[23] Mell P, Grance T. 2011. The NIST definition of cloud computing. National Institute of Standards and Technology 53(6):50

[24] Mishra K, Majhi S. 2021. A binary bird swarm optimization based load balancing algorithm for cloud computing environment. Open Computer Science 11(1):146-160

[25] Mohammadzadeh A, Masdari M. 2021. Scientific workflow scheduling in multicloud computing using a hybrid multi-objective optimization algorithm. Journal of Ambient Intelligence and Humanized Computing 14(4):3509-3529

[26] Naqin Z, Weiwei L, Wei F, Fang S, Xiongwen P. 2020. Budgetdeadline constrained approach for scientific workflows scheduling in a cloud environment. Cluster Computing 26(3):1737-1751

[27] Noman J, Fernand G, Peter L. 2022. Simplification of genetic programs: a literature survey. Data Mining and Knowledge Discovery 36(4):1279-1300

[28] Patel H, Kansara N. 2021. Cloud computing deployment models: a comparative study. International Journal of Innovative Research in Computer Science & Technology (IJIRCST) 9(2):45-50

[29] Rajkumar N, Kumar K, Gokul M, Durai S. 2024. Post-quantum cryptography security with cspm for secure data transmission in cloud environments.

[30] Russo GR, Marotta R, Cordari F, Quaglia F, Cardellini V, Sanzo PD. 2024. Efficient probabilistic workflow scheduling for IaaS clouds. ArXiv

[31] Sadhasivam N, Balamurugan R, Pandi M. 2018. Cancer diagnosis epigenomics scientific workflow scheduling in the cloud computing environment using an improved PSO algorithm. Asian Pacific Journal of Cancer Prevention: APJCP 19(1):243

[32] Shen Y, Chen G, Ma H, Zhang M. 2025. Gates: cost-aware dynamic workflow scheduling via graph attention networks and evolution strategy. ArXiv

[33] Singh G. 2022. A study of crossover operators in genetic algorithms. In: Frontiers in Nature Inspired Industrial Optimization. Singapore: Springer Singapore. 17-32

[34] Singh S, Jeong YS, Park JH. 2016. A survey on cloud computing security: issues, threats, and solutions. Journal of Network and Computer Applications 75:200-222

[35] Sita R, Bhambri P, Kataria A. 2023. Integration of IoT, big data, and cloud computing technologies: trend of the era.

[36] Tang X, Cao W, Tang H, Deng T, Mei J, Liu Y, Shi C, Xia M, Zeng Z. 2021. Cost-efficient workflow scheduling algorithm for applications with deadline constraint on heterogeneous clouds. IEEE Transactions on Parallel and Distributed Systems 33(9):2079-2092

[37] Tekawade A, Banerjee S. 2023. Cedces: a cost-effective deadline constrained evolutionary scheduler for task graphs in multi-cloud system.

[38] Ukwuoma H, Arome G, Thompson A, Alese B. 2022. Post-quantum cryptography-driven security framework for cloud computing. Open Computer Science 12(1):142-153

[39] Varnita L, Subramanyam K, Narasimha T, Bhandari H, Angayarkanni SA. 2024. A study on isogeny based cryptography.

[40] Vásconez JP, Schotborgh E, Vásconez IN, Moya V, Pilco A, Menéndez O, Guamán-Rivera R, Guevara L. 2024. Smart delivery assignment through machine learning and the Hungarian algorithm. Smart Cities 7(3):1109-1125

[41] Yang X, Kun Z, Jun X, Bibo T, Chen LC. 2024. Multi-source intrusion detection in cloud environments.