15-745 Spring 2007 Papers for in-class discussions
Basic Information
The readings will be organized as follows:
You will form pairs and pick a half-lecture slot in one of the
lectures between Feb 22 and March 8. A sign-up sheet is on Seth's
door; it will also be circulated in class during the lecture on
Tuesday Feb 20.
Each pair will pick one of the research areas below, and will
discuss two of the papers from that research area. Some areas have
more than two papers, and you can pick any two you like. However, it
is advisable to pick the first paper as one of your two, since it is
usually the most important (please let us know which ones you pick, so
we can let the rest of the class know in advance). Each pair will have
40 minutes total, of which the first 25-30 minutes will be devoted to
them presenting the papers, and the last 10-15 mins will be for
discussion.
Seth will make the first such presentation, on Tuesday Feb 20, on
"Exotic Architectures", so you know what structure to follow in your
own presentations.
Before each class from Feb 22 onwards, all students will be
responsible for turning in a small summary of one paper each from
the two groups being presented that day (so, you will be required
to summarize a total of two papers in each class). Your summary should
include 3 bullet points about the paper's main contributions and
approach, 3 bullet points about what you think the paper's strengths
are, and 3 bullet points about where you think the weaknesses
lie. Email a pdf to Mahim before class.
Our hope is that reading a collection of new and classic research papers
will give you some ideas for your class projects, so keep your thinking
caps on during the readings, presentations and discussions :)
Papers: vouched for so far
The papers in a light font were part of
the group but will not be discussed in class and should not be
summarized.
Steve Swanson, Ken Michelson, Andrew Schwerin and Mark Oskin.
WaveScalar
In the 36th Annual International Symposium on Microarchitecture
(MICRO-36), December 2003
Optimizing Compiler for the Cell Processor PACT 2005.
(A. Eichenberger, K. O'Brien, K. O'Brien,
P. Wu, T. Chen, P. Oden, D. Prener, J. Shepherd, B. So,
Z. Sura, A. Wang, T. Zhang, P. Zhao, and M. Gschwind.) (available here)
nVidia Corporation. nVidia
CUDA programming guide, version 0.8, Feb 12, 2007. (Read sections
2 and 3, skim sections 4.2 and 6, and look at the other sections if
you have time.)
Rumi Zahir, Jonathan Ross, Dale Morris and Drew Hess. ``OS and
Compiler Considerations in the Design of the IA-64 Architecture,''
in Proceedings of the 9th International Conference on Architectural
Support for Programming Languages and Operating Systems, pages 212-221,
November 2000. p212-zahir.pdf
Cooper, K. D., Grosul, A., Harvey, T. J., Reeves, S., Subramanian,
D., Torczon, L., and Waterman, T. ACME: adaptive compilation made efficient. In Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools For Embedded Systems (Chicago, Illinois, USA, June 15 - 17, 2005). LCTES '05. ACM Press, New York, NY, 69-77.
Ding, C. and Zhong, Y. Predicting whole-program locality through reuse distance analysis. In Proceedings of the ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation (San Diego, California, USA, June 09 - 11, 2003). PLDI '03. ACM Press, New York, NY, 245-257.
Rakesh Ghiya and Laurie J. Hendren. ``Is it a Tree, a DAG, or a
Cyclic Graph? A Shape Analysis for Heap-Directed Pointers in C,''
in Proceedings of the 23rd ACM SIGPLAN-SIGACT Symposium on
Principles of
Programming Languages, St. Petersburg, Florida, pages 1-15, January
1996. p1-ghiya.pdf
Rastislav Bodik, Rajiv Gupta and Mary Lou Soffa. ``Load-Reuse
Analysis: Design and Evaluation,'' in Proceedings of the ACM SIGPLAN
'99 Conference on Programming Language Design and Implementation, pages
64-76. May 1999. p64-bodik.pdf
Raymond Lo, Fred Chow, Robert Kennedy, Shin-Ming Liu and Peng Tu.
``Register Promotion by Sparse Partial Redundancy Elimination of Loads
and Stores,'', in Proceedings of the ACM SIGPLAN '98 Conference on
Programming Language Design and Implementation, pages 26-37, June 1998.
p26-lo.pdf
Robert Cohn and P. Geoffrey Lowney. ``Hot Cold Optimization of
Large Windows/NT Applications,'' in Proceedings of the 29th Annual
IEEE/ACM International Symposium on Microarchitecture, pages 80-89, December 1996.
p80-cohn.pdf
Aggressive Inlining. Andrew Ayers, Richard Schooler, Robert Gottlieb. PLDI '97
Profiling 2
James R. Larus. ``Whole Program Paths,'' in Proceedings
of the ACM SIGPLAN '99 Conference on Programming Language Design and
Implementation, pages 259-269, May 1999.
p259-larus.pdf
Glenn Ammons and James R. Larus. ``Improving Data-Flow Analysis
with Path Profiles,'' in Proceedings of the ACM SIGPLAN '98
Conference on Programming Language Design and Implementation,pages
72-84, June 1998. p72-ammons.pdf
Matthew Arnold, Stephen Fink, David Grove, Michael Hind and Peter
F. Sweeney. ``Adaptive Optimization in the Jalapeno JVM,'' in
Proceedings of the Conference on Object-Oriented Programming, Systems,
Languages, and Applications, pages 47-65, October 2000.
p47-arnold.pdf
Vasanth Bala, Evelyn Duesterwald and Sanjeev Banerjia.
``Dynamo: a Transparent Dynamic Optimization System,''
in Proceedings
of the ACM SIGPLAN '00 Conference on Programming Language Design and
Implementation, pages 1-12, June 2000.
p1-bala.pdf
Brian Grant, Matthai Philipose, Markus Mock, Craig Chambers and Susan
J. Eggers. ``An Evaluation of Staged Run-Time Optimizations in
DyC,'' in Proceedings of the ACM SIGPLAN '99 Conference on
Programming Language Design and Implementation, pages 293-304, May
1999.
p656-grant.pdf
Alexander Klaiber. ``The Technology Behind Crusoe
Processors,'' Transmeta Corporation.
paper_aklaiber_19jan00.pdf
Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald
D. Barnes, and Wen-mei W. Hwu. ``A Hardware Mechanism for Dynamic
Extraction and Relayout of Program Hot Spots,'' in Proceedings
of the
27th International Symposium on Computer Architecture, pages 59-70,
June 2000.
p59-merten.pdf
Y. Zhu, G. Magklis, M. L. Scott, C. Ding, D. H. Albonesi,
The Energy
Impact of Aggressive Loop Fusion,
Proceedings of the 13th International Conference on Parallel Architectures
and Compilation Techniques (PACT'04), September 2004.
M. Kandemir, N. Vijaykrishnan, M.J. Irwin,
Compiler Optimizations for Low Power Systems, Chapter 10: Power Aware
Computing, Ed. by R.
Graybill, R. Melhem, Kluwer Academic / Plenum Publishers, 2002.
Hazelwood, K. and Grove, D. Adaptive online context-sensitive inlining. In Proceedings of the international Symposium on Code Generation and Optimization: Feedback-Directed and Runtime Optimization (San Francisco, California, March 23 - 26, 2003). ACM International Conference Proceeding Series, vol. 37. IEEE Computer Society, Washington, DC, 253-264.