Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Roberto Camacho B.
September 4, 2015
are clustered into classes based on the execution time, and the average euclidean distance is used to predict the execution time. Various
machine learning algorithms were also implemented to compare the
accuracy of the prediction.
[8] is important to my research because they developed a system that
automatically extracts a large number of features from program execution using Sparse Polynomial regression, which I will might try to use
to perform feature selection. This paper was published in Advances in
Neural Information Processing Systems 23.
[9] presents methods based in time delay neural network and regression
for prediction of workload in Grid and Cloud platforms. This paper
was published in the Proceedings of the International Conference on
Computer and Information Technology.
[10] presents a method which combines analytic benchmarking and
statistical analysis techniques. The benchmarking is similar to the
training phase of a machine learning algorithm. This paper was published in the IEEE Heterogeneous Computing Workshop.
[11] shows a model created using the least-squares method with the
results obtained from the measurement of various configurations. This
article was published in the Journal of Parallel Computing.
[12] presents a framework called Mantis which is used to predict the
resource consumption of Android applications. This framework uses
machine learning and program analysis techniques to obtain accurate
predictions. Several works on computational offloading have used this
framework to obtain a prediction. This paper was published on the
IEEE Transactions on Mobile Computing.
[13] proposes a method to predict execution time for Grid applications
using machine learning models based on multilayer perceptron. The
authors also use a principal feature selection algorithm to select the
most important runtime features for prediction. This paper appears
in the Proceedings of the 8th Workshop on Workflows in Support of
Large-Scale Science.
[14] uses machine learning algorithms to predict resource usage, with
a tree-based approach named PQR2. It also provides several good
references. This article was published in the IEEE/ACM International
Conference on Cluster, Cloud and Grid Computing.
3
[15] is from the same author of [14], however this is a dissertation and
contains a longer explanation of their method. Many more papers are
referenced. The only negative of these two publications is that they
are both from 2010, so it is important to look in the Cited by
[16] presents a method to estimate execution time in parallel devices.
This method is based in code profiling, to estimate the percentage
of parallel regions on a program, and Amdahls Law, to predict the
performance of this application for a different machine with a different
number of cores. This paper was presented in the Proceedings of the
ACM/SPEC international conference on performance engineering.
[17] introduces a method to predict execution time of applications
using an historical records and an instance-based learning technique,
which is one of the techniques I want to implement in my research
since it is benefited from clustering. This article was published by
NASA.
[18] presents a mathematical approach to predict the expectation of the
maximum execution time of all processors on a parallel system, since
this will be the total execution time for a parallel application. This
paper is presented in IEEE Transactions on Parallel and Distributed
Systems.
[19] proposes an approach based on augmented code profiling and
benchmark analysis used to generate Representative Set of Templates (RST) which represent the execution behavior of a task with
different inputs. This paper was presented in the International Conference on Parallel Processing.
[20] uses parametric code profiling and parametric benchmarking techniques. It also incorporates the concept of an architecture independent computation model to estimate the execution times. This is a
very different approach but reading should be useful for the Previous
Work section. This paper was published in the IEEE Heterogeneous
Computing Workshop.
Bibliography
[1] Ali Arshad, Ashiq Anjum, Julian Bunn, Richard Cavanaugh, Frank Van
Lingen, Muhammad Atif Mehmood, Harvey Newman, Conrad Steenberg, and Ian Willers. Predicting the Resource Requirements of a Job
Submission. Computing in High Energy Physics, pages 750753, 2004.
[2] Fabian Brosig, Fabian Gorsler, Nikolaus Huber, and Samuel Kounev.
Evaluating approaches for performance prediction in virtualized environments. Proceedings - IEEE Computer Societys Annual International Symposium on Modeling, Analysis, and Simulation of Computer
and Telecommunications Systems, MASCOTS, pages 404408, 2013.
[3] Maria Chtepen, Filip HA Claeys, Bart Dhoedt, Filip De Turck, Jan
Fostier, Piet Demeester, and Peter A Vanrolleghem. Online execution
time prediction for computationally intensive applications with periodic
progress updates. The Journal of Supercomputing, 62(2):768786, 2012.
[4] Bogdan Florin Cornea and Julien Bourgeois. A framework for efficient
performance prediction of distributed applications in heterogeneous systems. The Journal of Supercomputing, 62:16091634, 2012.
[5] Murthy V Devarakonda and Ravishankar K Iyer. Predictability of process resource usage: A measurement-based study on unix. Software
Engineering, IEEE Transactions on, 15(12):15791586, 1989.
[6] Archana Ganapathi, Yanpei Chen, Armando Fox, Randy Katz, and
David Patterson. Statistics-driven workload modeling for the cloud.
pages 8792, 2010.
[7] Gowthami Manogna Gottipati. Predicting Runtimes for Production
Parallel Jobs. Master, Indian Institute of Science, 2012.
[8] Ling Huang, Jinzhu Jia, Bin Yu, Byung-Gon Chun, Petros Maniatis,
and Mayur Naik. Predicting execution time of computer programs using
sparse polynomial regression. pages 883891, 2010.
[9] Md Toukir Imam, Sheikh Faisal Miskhat, Rashedur M. Rahman, and
M. Ashraful Amin. Neural network and regression based processor
load prediction for efficient scaling of grid and cloud resources. 14th
International Conference on Computer and Information Technology,
ICCIT 2011, (Iccit):333338, 2011.
[10] Michael A Iverson, Fusun Ozguner, and Lee C Potter. Statistical prediction of task execution times through analytic benchmarking for scheduling in a heterogeneous environment. pages 99111, 1999.
[11] Yoshinori Kishimoto and Shuichi Ichikawa. Optimizing the configuration of a heterogeneous cluster with multiprocessing and execution-time
estimation. Parallel Computing, 31(7):691710, 2005.
[12] Yongin Kwon, Sangmin Lee, Hayoon Yi, Donghyun Kwon, Seungjun
Yang, Byung-gon Chun, Ling Huang, Petros Maniatis, Mayur Naik,
and Yunheung Paek. Mantis: Efficient Predictions of Execution Time,
Energy Usage, Memory Usage and Network Usage on Smart Mobile
Devices. IEEE Transactions on Mobile Computing, 6(1):11, 2014.
[13] Muhammad Junaid Malik, Thomas Fahringer, and Radu Prodan. Execution time prediction for grid infrastructures based on runtime provenance data. In Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science - WORKS 13, pages 4857, 2013.
[14] Andrea Matsunaga and Jose AB Fortes. On the use of machine learning
to predict the time and resources consumed by applications. pages 495
504, 2010.
[15] Andrea Matsunaga. Automatic enablement, coordination and resource
usage prediction of unmodified applications on clouds. PhD thesis, University of Florida, 2010.
[16] Achille Peternier, Walter Binder, Akira Yokokawa, and Lydia Chen.
Parallelism profiling and wall-time prediction for multi-threaded applications. pages 211216, 2013.
[17] Warren Smith and Parkson Wong. Resource selection using execution
and queue wait time predictions. NASA Ames Research Center TR
NAS-02-003, 2002.
6