•
Ranking factor calculation solves the
problem with fresh node consideration for
data transfer.
Thus if we use a predictive framework for data
transfer by allocation jobs on reliable and efficient
participating nodes, it will improve the overall
performance of data transfer in Grid environment.
5 CONCLUSIONS AND FUTURE
WORK
In this paper we investigated the problem for
selecting reliable and efficient nodes by predicting
their performance before allocating parallel jobs in a
computational Grid environment. Allocating jobs to
predicted reliable nodes gives better performance
and is also more cost effective than using hot
pluggable expensive hardware. To solve this we
proposed a predictive framework with different
phases for providing information about efficient and
reliable participating nodes in the computational grid
environment. We discussed, examined and
compared various prediction techniques. We also
solved the problem of considering new nodes with
ranking factor calculations.
Although our proposed framework solves
the performance prediction problem, there may arise
a critical situation that many ranking factors will
become same for a large set of nodes in the vast
internet. In that case priority based factors might be
used. These can be any one of the measured factors
based on the network topology. Moreover, based on
the topology and the availability of the network, we
can choose either centralised or decentralised data
repository mechanism.
The extension of this work will be
improving the ranking policy by considering more
factors in the Grid environment. Improvement will
be possible in our prediction data delivery phase
with XML-based framework for better integration
with web services.
Acknowledgement: We would like to
acknowledge Mr Panu Phinjareonphan and Prof Bill
Appelbe for their valuable comments on the
experimentation.
REFERENCES
Faerman, M., Su, A., Wolski, R., and Berman, F., 1999.
Adaptive Performance Prediction for Distributed Data-
Intensive Applications. In SC’99 ACM/IEEE
conference on Supercomputing.
Globus. Globus Project. http://www.globus.org
Lee, J., Gunter, D., Stoufer, M., Tierney, B., 2002.
Monitoring Data Archives for Grid Environments. In
SC’02, ACM/IEEE conference on Supercomputing.
Liu, C., Yang, L., Foster, I., and Angulo, D., 2002. Design
and Evaluation of a Resource Selection Framework for
Grid Applications. In HPDC’02, 11th IEEE
Symposium on High-Performance Distributed
Computing.
Swany, M., Wolski, R., 2002. Multivariate Resource
Performance Forecasting in the Network Weather
Service. In SC’02, ACM/IEEE conference on
Supercomputing.
Syed, U., Yona, G., 2003. Using a Mixture of Probabilistic
Decision Trees for Direct Prediction of Protein
Function. In RECOMB’03, 7
th
Annual International
Conference on Computational Biology.
Vazhkudai, S. and Schopf, J., 2002. Predicting sporadic
grid data transfers. In HPDC’02, 11
th
IEEE
Symposium on High Performance Distributed
Computing.
Vazhkudai, S., Schopf, J.M., 2003. Using Regression
Techniques to Predict Large Data Transfers. In
IJHPCA’03, The International Journal of High
Performance Computing Application.
Vazhkudai, S., Schopf, J.M. and Foster, I, 2002.
Predicting the Performance of Wide Area Data
Transfers. In IPDPS’02, 16th International Parallel
and Distributed Processing Symposium.
Vazhkudai, S., Tuecke, S. and Foster, I., 2001. Replica
Selection in the Globus Data Grid. In CCGRID’01 1
st
IEEE/ACM International Conference on Cluster
Computing and the Grid). IEEE Press.
Wolski, R., 1997. Forecasting Network Performance to
Support Dynamic Scheduling Using the Network
Weather Service. In HPDC’97, 6th IEEE Symposium
on High Performance Distributed Computing.
Wolski, R., 1998. Dynamically Forecasting Network
Performance Using the Network Weather Service. In
JCC’98, Journal of Cluster Computing.
Wolski, R., 2003. Experiences with Predicting Resource
Performance On-line in Computational Grid Settings.
In ACM SIGMETRICS PER’03, ACM SIGMETRICS
Performance Evaluation Review.
Wolski, R., Spring, N., Hayes, J., 1999. The Network
Weather Service: A Distributed Resource Performance
Forecasting Service for Metacomputing. In JFGCS’99,
Journal of Future Generation Computing Systems.
Trace-Data.
http://www.csm.ornl.gov/~vazhkuda/GridFTP-
Predictions.zip
Zhang, X., Freschl, J., Schopf, J., 2003. , A Performance
Study of Monitoring and Information Services for
Distributed Systems. In HPDC’03, 12th IEEE
International Symposium on High Performance
Distributed Computing.
PREDICTING THE PERFORMANCE OF DATA TRANSFER IN A GRID ENVIRONMENT
181