surveys of grid scheduling algorithms are proposed
in (Maruthanayagam, 2010); (Jiang et al., 2007) and
performance of some priority rule scheduling
algorithms is presented in (Azmi, 2011). DIET
platform (Marrow et al., 2003) is a GridRPC
middleware relying on the client/agent/server
paradigm. The scheduling on DIET changes from
FIFO, Round Robin and CPU-based scheduling. But
the operation of DIET platform is different with
pilot-agent platform: DIET use both “push” and
“pull” scheduling. Mandatory requests are pushed
from clients to resources, whereas optional requests
are pulled by resources from clients. Pilot-agent
platform takes most scheduling decisions in a
centralized agent, in contrast, each client and each
server contributes to taking scheduling decisions in
DIET. Therefore, the solutions brought by research
of scheduling problem on DIET platform are not
directly applicable to our problem statement.
In (Berman et al., 1996), author presents a
scheduling solution in application level called
AppLeS. They describe an application specific
approach to scheduling individual parallel
applications on production heterogeneous systems.
They utilize comprehensive information about
application and resource to optimize execution time
of application on grid. Our goal is not to optimize
the execution time of all users but the quality of
service for each user.
Existing pilot agent platforms such as DIANE
(Mościcki, 2003), WPE(Kasam et al., 2009) and
DIRAC (van Herwijnen et al., 2003) have different
scheduling policies: WPE and DIANE platforms
use FIFO while DIRAC uses Round Robin policy.
The VS projects have specific properties such as
divisibility in many docking tasks and no order of
execution constraints. Therefore we need to find a
suitable online scheduling policy for the VS
application on the pilot-agent platform. Fortunately,
in some platform such as DIRAC platform, we can
configure the specific scheduling policy for a user
group sharing the same application. So we can apply
suitable policy in a VS user group to improve
fairness.
2.2 Cloud Scheduling
As mentioned earlier, the limited machine
availability property of the scheduling problem on
pilot agent platform is similar with scheduling on
cloud environment because on cloud environment,
user buys some resources with limited duration.
When a VS project is deployed on an IAAS cloud,
docking task will be executed on a virtual machine
with limited availability.
Some researches on cloud scheduling such as
(Pandey et al., 2010); (Li et al., 2011) have
presented their scheduling algorithms on cloud to
optimize the speed of resources allocation, the price
to pay and the utilization of system resource. But our
object is optimization of the fairness of users when
they share pilot-agent platform together.
In (Luckow et al., 2010), author proposed the
design and implementation of a SAGA-based Pilot-
Job system, which supports a wide range of
application types, and is usable over a broad range
of infrastructures from grids/clusters to cloud
computing. In (Fifield et al., 2011), author showed
also an extension of the pilot agent platform DIRAC
on cloud computing by submitting pilot agent on
Virtual Machine on cloud such as Amazon EC2.
Therefore our research is also relevant to pilot-agent
platforms on Cloud environments.
2.3 Scheduling for Stretch
Optimization with Limited
Machine Availability Constraints
Many groups have conducted research on optimizing
job-centric stretch in the context of dedicated
machines (i.e. always available). In (Muthukrishnan
et al., 1999), S. Muthukrishnan presented the
efficiency of the optimal on-line algorithm SPT on
uniprocessor and multi-processor. Their objective is
optimizing the average of the stretch. In (Legrand et
al., 2006), Legrand has shown that SPT is quite
effective at max-stretch and sum-stretch
optimization in problems with continuous machines.
But compared to these studies, our scheduling
problem uses a user-centric definition of stretch and
adds an additional constraint: machines have limited
availability. With this property, the number of
machines available for platform changes over time
and the complexity of problem increases. In
(Schmidt, 2000), authors have reviewed some
scheduling algorithm in the context of limited
machine availability. LPT is one of the online
scheduling algorithms proposed in this research. But
these researches are done on system-centric metrics
(makespan, sum of completion time…). In our latest
research for scheduling for stretch optimization with
limited machine availability constraints, we
compared two well-known scheduling policies, SPT
and LPT, to the scheduling policies currently used
on the existing platforms (FIFO and Round Robin).
Simulation result and experimentation result on real
platform showed that SPT policy is the best policy in
these 4 policies for optimization of user stretch in
CLOSER2014-4thInternationalConferenceonCloudComputingandServicesScience
202