Reading
- Day 1
- Recommended:
- Stephen M. Trimberger. Three
Ages of FPGAs: A Retrospective on the First Thirty Years of FPGA
Technology. In Proceedings of the IEEE, Volume 103, Number 3,
p. 318--331, March 2015.
- Louise H. Crockett, Ross A. Elliot, Martin A. Enderwitz, and
Robert W. Stewart. Chapters 1, 2, 6, In Zynq Book.
- Supplemental:
- Xilinx Zynq Overview
- Day 2
- Day 3
- Recommended:
- Ulrich Drepper, What
Every Programmer Should Know about the Memory, 2007 Chapters 1--3.2
- Supplemental:
- Brent Keeth and Jacob Baker, DRAM Circuit Design: A Tutorial,
IEEE Press, 2001, pp. 108--116.
- Ashok K. Sharma, Semiconductor Memories, IEEE Press, 1997,
pp. 230--248.
- Day 4
- Recommended:
- André DeHon. Chapter 5,
In
Reconfigurable Computing: The Theory and Practice of FPGA-Based
Computation, Elsevier, 2008. See Canvas site.
- Supplemental:
- TBD
- Day 5
- Recommended:
- Gilles Kahn. The Semantics of
a Simple Language for Parallel Processing. In Information
Processing, pages 471--475, 1974.
(modern
reformat; more readable ... also link above broken)
- Supplemental:
- Edward A. Lee and David G. Messerschmitt. Synchronous
Data Flow. In Proceedings of the IEEE, Volume 75, Number
9, pp. 1235--1245, September, 1987.
- André DeHon, Yury Markovsky, Eylon Caspi, Michael Chu,
Randy Huang, Stylianos Perissakis, Laura Pozzi, Joseph Yeh, and John
Wawrzynek.
Stream Computations Organized for Reconfigurable Execution,
In Microprocessors and Microsystems, Volume 30, Number 6,
Pages 334--354, September 2006. See Canvas site.
- Edward A. Lee. and Albert Sangiovanni-Vincintelli.
A Framework
for Comparing Models of Computation. In IEEE Transactions on
Computer-Aided Design, Volume 17, Number 12, pp. 1217--1229, December, 1998.
- Day 6
- Recommended:
- W. Daniel Hillis and Guy L. Steele Data Parallel
Algorithms. In CACM 29(12)1170--1183, 1986.
- John Wawrzynek, Krste Asanovic, Brian Kingsbury, David
Johnson, James Beck, and Nelson Morgan.
Spert-II: A Vector Microprocessor System, IEEE
Computer, 1996.
- Supplemental:
- R.M. Russell, The CRAY-1
Computer System. In CACM
21(31)63--72, 1978.
- Day 7
- Day 8
- Day 9
- Recommended:
- Ben Lickly, Isaac Liu, Sungjun Kim, Hiren Patel, Stephen
A. Edwards, and Edward A. Lee. Predictable
programming on a precision timed architecture, In
Proceedings of the International Conference on Compilers,
Architectures and Synthesis for Embedded Systems (CASES), pp. 137--146, 2008.
- Supplemental:
- TBD
- Day 10
- Recommended:
- Timothy J. Callahan and André DeHon. Chapter 7.
In
Reconfigurable Computing: The Theory and Practice of FPGA-Based
Computation, Elsevier, 2008. See Canvas site.
- Louise H. Crockett, Ross A. Elliot, Martin A. Enderwitz, and
Robert W. Stewart. Chapters 14, 15, In Zynq Book.
- Supplemental:
- HLSynth TRCAD 2011
- Day 11
- Day 12
- Recommended:
- Haiqian Yu and Miriam Leeser. Automatic Sliding Window
Operation Optimization for FPGA-Based Computing Boards. In
Proceedings of the International Symposium on Field-Programmable
Custom Computing Machines, pp. 76--88, 2006.
- Louise H. Crockett, Ross A. Elliot, Martin A. Enderwitz, and
Robert W. Stewart. Chapter 9.3, 10 In Zynq Book.
- Supplemental:
- Louise H. Crockett, Ross A. Elliot, Martin A. Enderwitz, and
Robert W. Stewart. Chapter 19 In Zynq Book.
- Eric S. Chung, James C. Hoe and Ken Mai. CoRAM: an
in-fabric memory architecture for FPGA-based computing, in
Proceedings of the International Symposium on Field-Programmable
Gate Arrays, pp. 97--106, 2011.
- Andrew S. Huang and John P. Shen. ``A Limit Study of Local Memory
Requirements Using Value Reuse Profiles'', in Proceedings of
MICRO-28, pp. 71--81, December, 1995.
- Edward A. Lee. The
Problem with Threads. In IEEE Computer, Volume 39, Issue
5, pp. 33--42, 2006.
- Day 13
- Day 14
- Recommended:
- Joseph A. Fisher. Very Long Instruction Word Architectures and the ELI-512. In Proceedings of the International Symposium on Computer
Architecture, 1983.
- Ilian Tili, Kalin Ovtcharov, and J. Gregory Steffan.
Reducing the
Performance Gap between Soft Scalar CPUs and Custom Hardware with
TILT. In ACM Transactions on Reconfigurable Technology and
Systems, Vol. 10, No. 3., July 2017, Article No. 22.
- Supplemental:
-
Joseph A. Fisher. ``Retrospective: Very Long Instruction Word Architectures
and the ELI-512,'' In 25 Years of the International Symposia on
Computer Architecture: Selected Papers, pp. 34--36, 1998.
- PICO (Program In, Chip Out): Automatically Designing Custom Computers.
In IEEE Computer, vol. 35, no. 9, pp. 39-47, September 2002.
- Day 15
- Day 16
- Recommended:
- See project handout.
- Supplemental:
- TBD
- Day 17
- Recommended:
- Udit Dhawan and André DeHon.
Area-Efficient Near-Associative Memories on FPGAs.
In ACM Transactions on Reconfigurable Technology and Systems
(TRETS),
Volume 7, Number 4, DOI: 10.1145/2629471, January, 2015.
- Supplemental:
- Adam Kirsch and Michael Mitzenmacher. 2010. The power of one move: Hashing schemes for hardware. IEEE/ACM Trans. Networking 18, 6 (2010), 1752–1765.
- Day 18
- Recommended:
- Matteo Frigo and Steven G. Johnson, The Design
and Implementation of FFTW3. In Proceedings of the IEEE,
vol. 93, no. 2, pp. 216--231, 2005. (This focusses on the design space.)
- Supplemental:
- Steven W. Smith, The Scientist and Engineer's Guide to
Digital Signal Processing,
(This is for background on the Fourier Transform and FFT.
Chapter 12 is FFT. May
want to go back to Chapter 8
for DFT introduction.)
- Ryan Kastner, Janarbek Matai, and Stephen Neuendorffer. Chapter
4 and 5. In
Parallel
Programming for FPGAs (This looks specifically at writing
FFTs in C for HLS.)
- R. Clint Whaley, Antoine Petitet, and Jack Dongarra. Automated
Empirical Optimization of Software and the ATLAS Project. In
Parallel Computing, Volume 27, pp. 3--35, February, 2001. (This
is automated desgin-space exploration for matrix multiplication.)
- Day 19
- Day 21
- Recommended:
- Edin Kadric, David Lakata, André DeHon, Impact of Parallelism and
Memory Architecture on FPGA Communication Energy. In ACM
Trans. on Reconfigurable Technology and Systems, 2016.
- Supplemental:
- Rajesh Kumar and K. Bernstein. Scaling, power, and the future of
CMOS. In Technical Digest of the IEEE International Electron Device
Meeting, December 2005, pp. 7--15.
-
Hadi Esmaeilzadeh, Emily Blem, Rene St. Amant, Karthikeyan Sankaralingam, and Doug Burger
Dark silicon and the end of multicore scaling'' in
Proceedings of the International Symposium on Computer
Architecture, 2011, pp. 365--376.
- Borivoje Nikolic. Design
in the Power-limited Scaling Regime. In IEEE Transaction on
Electron Devices, Volume 55, Number 1, pp. 71--83, 2008.
- David J. Frank. ``Power-constrained CMOS Scaling Limits,''
IBM Journal of Research and Development, Volume 46, Number 2/3,
March/May, 2002. See Canvas site.
- S. Hanson, B. Zhai, K. Bernstein, D. Blaauw, A. Bryant,
L. Chang, K. K. Das, W. Haensch, E. J. Nowak, and D. M. Sylvester.
``Ultralow-voltage, minimum-energy CMOS,''
IBM Journal of Research and Development, Volume 50, Number 4/5,
pp. 471--490, July/September 2006.
- David Bol, Renaud Ambroise, Denis Flandre, and Jean-Didier Legat.
``Interests and Limitations of Technology Scaling for Subthreshold
Logic,'' In IEEE Trans. on VLSI Systems, Vol. 17, No. 10,
pp. 1508--1519, 2009.
- Anantha P. Chandrakasan, Samuel Sheng, and Robert W. Brodersen,
``Low-Power CMOS Digital Design,''
in IEEE Journal of Solid State Circuits, 1992, Vol. 27, No. 4,
pp. 473--484.
- Richard P. Feynman. Feynman Lectures on Computation.
Westview Press, 1999. Chapter 5: Reversible Computation and
the Thermodynamics of Computation, pp. 137--184.
- Day 22
- S. Devadas, H-K. T. Ma., and A. R. Newton.
On the verification of
sequential machines at differing levels of abstraction,
IEEE Transactions on Computer-Aided Design of Integrated Circuits,
Volume 7, Number 6, pp. 713--722, 1988.
- E. M. Clarke, E. A. Emerson, and A. P. Sistia.
Automatic
verification of finite-state concurrent systems using temporal logic
specifications, ACM Transactions on Programming Languages and
Systems, Volume 8, Number 2, pp. 244--263, 1986.
- Day 23
- Day 24
- Day 25
- Recommended:
- William J. Dally and Brian Towles,
Route Packets, Not Wires: On-Chip Interconnection Networks.
In Proceedings of the Design Automation Conference, pp. 684-689,
2001.
- Supplemental:
- Nachiket Kapre and Jan Gray, Hoplite:
Building austere overlay NoCs for FPGAs. In International
Conference on Field-Programmable Logic and Applications, 2015.
- Nachiket Kapre, Marathon:
Statically-Scheduled Conflict-Free Routing on FPGA Overlay
NoCs.
In International Symposium on Field-Programmable Custom
Computing Machines, 2016.
- Charles Seitz, Let's Route Packets Instead of Wires.
In Proceedings of the 6th MIT Conference on Advanced Research in VLSI,
pp. 133--137, 1990. [online link?]
- Jose Duato, Sudhakar Yalamanchili, and Lionel Ni.
Interconnection Networks, Morgan Kaufmann, 2002.
- William Dally and Brian Towles, Principles and Practices of Interconnection
Networks,
Morgan Kaufmann, 2004.
- Day 26
- Recommended:
- André DeHon. Chapter 37: Defect and Fault Tolerance. In
Reconfigurable Computing: The Theory and Practice of FPGA-Based
Computation, Elsevier, 2008. See Canvas site.
- Supplemental:
- Avinash Sodani, Roger Gramunt, Jesus Corbal, Ho-Seop Kim, Krishna
Vinod, Sundaram Chinthamani, Steven Hutsell, Rajat Agarwal, and
Yen-Chen Liu.
Knights
Landing: Second-Generation Intel Xeon Phi Product, IEEE Micro,
volume 36, issue 2, pp. 34--46 ,March 2016.
- David Patterson, Garth Gibson, and Randy Katz, A Case for
Redundant Arrays of Inexpensive Disks (RAID). In Proceedings of
the ACM SIGMOD Conference, p. 109--116, June 1988.
- W. Bruce Culbertson and Rick Amerson and Richard Carter and
Phil Kuekes and Greg Snider, Defect Tolerance on the
TERAMAC Custom Computer.
In International Symposium on Field-Programmable Custom
Computing Machines, pp. 116--123, 1997.
- Day 27
- Recommended:
-
Michael D. Linderman, Matthew Ho, David L. Dill, Teresa
H. Meng, and Garry P. Nolan.
Towards
program optimization through automated analysis of numerical
precision, Proceedings of the IEEE/ACM international symposium
on Code generation and optimization, pp. 230-237, 2010.
[distribution]
- Supplemental:
- Xilinx
Floating-Point unit sizes
- David Goldberg, What every computer
scientist should know about floating-point arithmetic, ACM
Computing Surveys, Vol. 23, Issue 1, pages 5--48, 1991.
[also here]
Course Calendar
ESE532: System-on-a-Chip Architecture
Last modified: Tue Nov 13 19:43:08 EST 2018