EE 7700-2 - What's New
12 May 2011, 15:22:23 CDT
Final Exam Question Hint: The answer to Problem 2 applies
to CC 1.X as well as CC 2.x GPUs. In other words, just think
about CC 1.X GPUs when answering the question.
10 May 2011, 9:38:28 CDT
assigned. Please start working
on the take-home final. As mentioned in class it is based on Lee 2010,
the "Debunking" paper. Please feel free to send me questions. Answers
to some questions will be posted here so check back. A completed
exam is due Saturday morning 14 May. Good luck! Note that I'm
out of town now and I won't be back until Thursday.
5 May 2011, 18:17:46 CDT
Linked the radix sort
notes to the lecture notes
4 May 2011, 15:53:16 CDT
The paper to read for the next exam,
, has been linked to the references
page. The paper should be freely accessible within LSU, elsewhere use
the user id "ee4720" and the password given in class. (A good warning
to kids so they don't stick there hands in the cage near basketball
In class a student asked about
comparisons between Knights Ferry (called Larrabee by you-know-who)
and NVIDIA GPUs. The answer was yes, an
does a detailed comparison of the performance of a radix sort
on GPU and Knights Ferry. In fact, some of the authors
also wrote the
assigned in class.
27 April 2011, 14:36:37 CDT
NVIDIA GPU Microarchitecture
to include the additional material scheduling and Fermi (CC 2.X).
25 April 2011, 13:59:58 CDT
NVIDIA GPU Microarchitecture
to include the material on branch divergence.
13 April 2011, 18:01:42 CDT
Here is a draft of the radix sort analysis
shown in class today (with a few additions). A more complete
version will be posted tomorrow or Friday.
30 March 2011, 17:17:42 CDT
Here is a reference
basic parallel sum and prefix sum. The parallel sum is on page 1172
(3) and is headed "Sum of an Array of Numbers". The principal
author of the paper, Daniel Hillis, is a co-founder of Thinking
Machines, maker of the Connection Machine, an early parallel
computer design. Remember that these basic parallel techniques
are only the starting point for efficient GPU implementation.
19 March 2011, 18:34:43 CDT
Updated the solution to Homework 2, including
the version linked here
Also posted the Homework 3 assignment
but expect an E-mail message providing details.
18 March 2011, 16:28:11 CDT
Checked in a solution to Homework 2 Problem 3a to the svn
repository. For your convenience, here is the kernel
Extensive comments have been added to the stencil_shared
routine. The solution appears in stencil_shared_2, however it is not
unrolled and it does not access memory efficiently. Unroll-friendly code
will be put in another routine. For Homework 3 fix the global
and shared memory problems with stencil_shared_2. I'll post more
detailed instructions by Monday.
16 March 2011, 16:42:43 CDT
Linked NVIDIA GPU Microarchitecture
used in todays class to the lecture notes
page. They are under
construction, please report errors or requested clarifications
10 March 2011, 15:50:17 CST
Homework 2 deadline extended will be extended to Monday
if at least two students ask non-trivial questions about
the assignment in Friday's class or earlier.
4 March 2011, 11:36:36 CST
Assigned Homework 2
, due Friday 11 March 2011.
2 March 2011, 11:16:56 CST
Linked Homework 1 solution
to the previous work
page. Also checked in solution code to the svn repository,
do an "svn update" in your homework directory to get it. The
solution is in new files with names starting "solution-".
27 February 2011, 16:12:35 CST
Those getting errors about missing files on Homework 1 when issuing the
"make" command should update their homework code:
This will modifiy the makefile but won't change other
22 February 2011, 18:20:52 CST
assigned, due Monday, 28 February
. ECE computer accounts are needed for this assignment
they will be given out in class, or E-mail me for an account.
16 February 2011, 17:48:53 CST
account setup instructions
I still haven't assigned anything but students
are free to follow them. E-mail me if you need a class account.
24 January 2011, 12:39:19 CST
Please look at the description of the NVIDIA 3-series
GPUs in this
. (Other GPU descriptions can be found
in the GPU descriptions
page.) Don't worry if the paper seems difficult,
we'll go over the important points in class.
15 October 2010, 17:09:14 CDT
Updated Web pages for the Spring 2011 Semester.