- Seth Goldstein (Tuesday, 2/10)
Lazy Threads: Implementing a Fast Parallel Call
Seth Copen Goldstein, Klaus Erik Schauser, and David E. Culler. Journal of Parallel and Distributed Computing. 1996.
- Learning a Better Compiler
[Jim Cipar] (Tuesday, 2/10)
- Mark Stephenson and Saman Amarasinghe. Predicting Unroll Factors Using Supervised Classification. In CGO 2005.
- AbouGhazaleh, N., Ferreira, A., Rusu, C., Xu, R., Liberato, F., Childers, B., Mosse, D., and Melhem, R.
Integrated CPU and l2 cache voltage scaling using machine learning. In LCTES 2007.
- Calder, B., Grunwald, D., Jones, M., Lindsay, D., Martin, J., Mozer, M., and Zorn, B. Evidence-based static branch prediction using machine learning. ACM Trans. Program. Lang. Syst. 19, 1 (Jan. 1997)
- Compiling for Multicore
Jeehyung Lee, 2-12
- Dynamic Compilation/Optimization Rob Arnold, 2-18
-
Michael D. Smith.
"Overcoming the Challenges to
Feedback-Directed Optimization,"
Proc. ACM SIGPLAN Workshop on Dynamic and Adaptive Compilation and
Optimization (Dynamo'00), invited lecture, Boston, MA, January 18, 2000.
-
Matthew Arnold, Stephen Fink, David Grove, Michael Hind and Peter
F. Sweeney. ``Adaptive Optimization in the Jalapeno JVM,'' in
Proceedings of the Conference on Object-Oriented Programming, Systems,
Languages, and Applications, pages 47-65, October 2000.
p47-arnold.pdf
-
Steve S.W. Liao, Perry H. Wang, Hong Wang, Gerolf Hoflehner,
Daniel Lavery, John P. Shen.
Post-pass binary adaptation for
software-based speculative precomputation
PLDI 2002.
-
Vasanth Bala, Evelyn Duesterwald and Sanjeev Banerjia.
``Dynamo: a Transparent Dynamic Optimization System,''
in Proceedings
of the ACM SIGPLAN '00 Conference on Programming Language Design and
Implementation, pages 1-12, June 2000.
p1-bala.pdf
-
Brian Grant, Matthai Philipose, Markus Mock, Craig Chambers and Susan
J. Eggers. ``An Evaluation of Staged Run-Time Optimizations in
DyC,'' in Proceedings of the ACM SIGPLAN '99 Conference on
Programming Language Design and Implementation, pages 293-304, May
1999.
p656-grant.pdf
-
Alexander Klaiber. ``The Technology Behind Crusoe
Processors,'' Transmeta Corporation.
paper_aklaiber_19jan00.pdf
-
Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald
D. Barnes, and Wen-mei W. Hwu. ``A Hardware Mechanism for Dynamic
Extraction and Relayout of Program Hot Spots,'' in Proceedings
of the
27th International Symposium on Computer Architecture, pages 59-70,
June 2000.
p59-merten.pdf
-
cliff-asplos94.ps
-
Trishul M. Chilimbi and Martin Hirzel,
``Dynamic hot data stream prefetching
for general-purpose programs,''
Proceedings of the ACM SIGPLAN 2002 Conference on Programming language
design and implementation, pages 199-209, 2002.
- Pointer Analysis Will Klieber, 2-18
-
Ghiya, Lavery, and Sehr, "On the importance of
points-to analysis and other memory
disambiguation methods for C programs" PLDI 2001
- Hardekopf, B. and Lin, C. The ant and the grasshopper: fast and accurate pointer analysis for millions of lines of code. In PLDI 2007.
-
Nevin Heintze and Olivier Tardieu, "Ultra-fast aliasing
analysis using CLA: a million lines of C code in a second" PLDI 2001
-
Rakesh Ghiya and Laurie J. Hendren. ``Is it a Tree, a DAG, or a
Cyclic Graph? A Shape Analysis for Heap-Directed Pointers in C,''
in Proceedings of the 23rd ACM SIGPLAN-SIGACT Symposium on
Principles of
Programming Languages, St. Petersburg, Florida, pages 1-15, January
1996. p1-ghiya.pdf
-
Lattner and Adve.
Automatic pool allocation: improving performance by controlling data structure layout in the heap.
PLDI'05.
- GPUs and memory hierarchy Matt Stanton, Thurs 2-18
-
Optimizing Compiler for the Cell Processor PACT 2005.
(A. Eichenberger, K. O'Brien, K. O'Brien,
P. Wu, T. Chen, P. Oden, D. Prener, J. Shepherd, B. So,
Z. Sura, A. Wang, T. Zhang, P. Zhao, and M. Gschwind.)
- Brook for GPUs: stream computing on graphics hardware.
(Buck, I., Foley, T., Horn, D., Sugerman, J., Fatahalian, K., Houston, M., and Hanrahan, P.)
In ACM SIGGRAPH 2004 Papers
-
nVidia Corporation. nVidia
CUDA programming guide, version 1.1 (Read sections
2 and 3, skim sections 4.2 and 6, and look at the other sections if
you have time.)
- Object-Oriented Optimizations Keith Bare, 2-19
- Hauswirth, M., Sweeney, P. F., Diwan, A., and Hind, M. Vertical profiling: understanding the behavior of object-priented applications. In Proceedings of the 19th Annual ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Applications (Vancouver, BC, Canada, October 24 - 28, 2004).
- Hazelwood, K. and Grove, D. Adaptive online context-sensitive inlining. In Proceedings of the international Symposium on Code Generation and Optimization: Feedback-Directed and Runtime Optimization (San Francisco, California, March 23 - 26, 2003). ACM International Conference Proceeding Series, vol. 37. IEEE Computer Society, Washington, DC, 253-264.
- T Yasue, T Suganuma, H Komatsu, and T Nakatani. An efficient online path profiling framework for Java just-in-time compilers
In PACT 2003. Page(s):148 - 158
- Proving Compilers Correct Q Youn Hong, 2-25
- Software PipeliningVinod Chandrasekaran, 2-25
-
Mikhail Smelyanskiy, Scott Mahlke, Edward Davidson,
Probabilistic
Predicate-Aware Modulo Scheduling
in CGO 2004
-
Hongbo Rong, Zhizhong Tang,
R. Govindarajan, Alban Douillet, Guang Gao,
Single-Dimension Software Pipelining for Multi-Dimensional
Loops in CGO 2004
-
Hongbo Rong, Alban Douillet, R. Govindarajan, Guang Gao
Code Generation for Single-Dimension
Software Pipelining of Multi-Dimensional Loops
in CGO 2004
- Transactional Memory Iulian Moraru, 2-26
- Adl-Tabatabai, A., Lewis, B. T., Menon, V., Murphy, B. R., Saha,
B., and Shpeisman, T. Compiler and runtime support for efficient software transactional memory. SIGPLAN Not. 41, 6 (Jun. 2006), 26-37.
- Tim Harris, Mark Plesko, Avraham Shinnar, and David Tarditi. Optimizing Memory Transactions. In PLDI 2006.
- SIMD / Vector
John Bauman, 2-26
-
Vectorization for SIMD architectures with alignment constraints
PLDI 2004. (A. Eichenberger, P. Wu, K. O'Brien)
-
Efficient SIMD Code Generation for Runtime Alignment and Length Conversion
CGO 2005. (P. Wu, A. Eichenbreger, A. Wang)
-
Integrated Simdization Framework with Virtual Vectors
ICS 2005. (P. Wu, A. Eichenberger, A. Wang, P. Zhao)
- All are at http://domino.research.ibm.com/comm/research_projects.nsf/pages/cellcompiler.refs.html
- Code Verification Miguel Silva, 2-26