data:image/s3,"s3://crabby-images/2cd86/2cd86060402fb03dfc4ba34553561b325f454435" alt="" |
Compiling for ASH |
data:image/s3,"s3://crabby-images/e2d63/e2d63778c360f1b92674a9afd274103f7459205f" alt="" |
|
|
We have developed the CASH (Compiler for ASH) compiler which translates programs written in C into hardware implementations. The compiler applies several code transformations:
- It performs traditional software optimizations.
- We have developed a width analysis algorithm which can discover narrow width computations in C programs. 30% of the bits computed in average C programs are simply useless; in a reconfigurable hardware implementation we can optimize these away.
- The program is decomposed into parts which are run on the reconfigurable fabric and on the supporting CPU; the CPU will handle tasks which do not map well on the reconfigurable hardware, like virtual memory management, operating system activity and I/O.
- The program is analyzed and the memory used is partitioned into blocks which can be accessed in parallel. If a block is accessed only from a small portion of the computation, it will be laid out close to the circuits accessing it (we haven't implemented this part yet).
- The program is decomposed in SAMs.
- Each SAM is synthesized in hardware.
|
data:image/s3,"s3://crabby-images/0eba7/0eba7c58615ceab3164ef4ea55ee4e69148fd8db" alt="" |
|
|
data:image/s3,"s3://crabby-images/34762/347625003e8d190dd18dfc5bad4f4f651d73d824" alt="" |
|
data:image/s3,"s3://crabby-images/3bc0d/3bc0d7a723f0421431365040679b1cda51cf825c" alt="" |
|
data:image/s3,"s3://crabby-images/298f7/298f7f05d8aaa6fe3266370d4ac7c78c520c4523" alt="" |
data:image/s3,"s3://crabby-images/2cd86/2cd86060402fb03dfc4ba34553561b325f454435" alt="" |
Energy Efficiency |
data:image/s3,"s3://crabby-images/e2d63/e2d63778c360f1b92674a9afd274103f7459205f" alt="" |
|
|
When implemented directly in hardware, ASH is up to
three orders of magnitude more power-efficient
than superscalar processors, one to two orders of
magnitude better than low-power DSP processors, one order
of magnitude better than asynchronous processors; it is
roughly comparable to compared to custom hand-designed
hardware.
data:image/s3,"s3://crabby-images/f0e47/f0e473f995a5229e045292b06f2b844abea86679" alt="energy efficiency" |
|
data:image/s3,"s3://crabby-images/34762/347625003e8d190dd18dfc5bad4f4f651d73d824" alt="" |
|
data:image/s3,"s3://crabby-images/3bc0d/3bc0d7a723f0421431365040679b1cda51cf825c" alt="" |
|
data:image/s3,"s3://crabby-images/298f7/298f7f05d8aaa6fe3266370d4ac7c78c520c4523" alt="" |
|