Estimating GPU Speedups for Programs Without Writing a Single Line of GPU Code
Loading...
Date
Authors
Ardalani, Newsha
Sankaralingam, Karthikeyan
Zhu, Xiaojin
Advisors
License
DOI
Type
Technical Report
Journal Title
Journal ISSN
Volume Title
Publisher
Grantor
Abstract
Heterogeneous processing using GPUs is here to stay and today spans
mobile devices, laptops, and supercomputers. Although modern
software development frameworks like OpenCL and CUDA serve as a high
productivity environment, software development for GPUs is time
consuming. First, much work needs to be done to restructure software
and data organization to match the GPU's many-threaded programming
model. Second, code optimization is quite time consuming and
performance analysis tools require significant expertise to use
effectively. Third, until the final optimized code has been derived,
it is almost impossible today to know what performance advantage
will be provided by porting a code to a GPU. This paper focuses on
this last question and seeks to develop an automated ``performance
prediction'' tool that can provide accurate estimate of GPU speedup
when provided a piece of CPU code prior to developing the GPU
code.
Our paper is built on two insights: i) Ultimately speedup
on a GPU for a piece of code is dependent on fundamental
microarchitecture-independent program properties like available
parallelism, branching behavior etc. ii) By examining a vast array
of previously implemented GPU codes along-with their CPU
counterpart, we can use machine learning to learn this correlation
between program properties and GPU speedup. In this paper, we use
linear regression, specifically, a technique inspired by
regularized regression, to build a model for speedup prediction for GPUs.
When applied to a never-seen test data selected randomly
from Rodinia, Parboil, Lonestar and Parsec benchmark suites,
as test data (speedup range of 5.9X to $276X our tool makes accurate predictions with an average
weighted error of 32%. Our technique is also
robust - the errors remain similar across other ``unseen'' GPU
platforms we test on. Essentially, we deliver an automated tool that
programmers can use to estimate potential GPU speedup before writing
any GPU code.
Description
Keywords
Related Material and Data
Citation
TR1811