TY - GEN
T1 - Finding the best compromise in compiling compound loops to Verilog
AU - Ben-Asher, Yosi
AU - Shochat, Eddie
PY - 2008
Y1 - 2008
N2 - We consider a special optimization problem involved with compiling compound loops (combining nested and consecutive sub-loops) with array references to Verilog. Each sub-loop of the compound loop may require a different optimized hardware configuration (OHC) for optimized execution times. For example, one loop requires at least two memory ports and one multiplier for an optimized execution time while another loop may require only one memory port but two multipliers, yet one OHC should be selected for both loops. The goal is to compute a minimal OHC which, based on the different execution frequencies of the sub-loops, is a good compromise between all the conflicting requirements of each sub-loop. Though synthesis of nested loop has been implemented in quite a few systems this aspect has not been considered so far. We avoid the use of Integer Linear Programming (ILP) techniques and use instead a fast space exploration technique that is combined with an efficient variant of List scheduling. Another novel aspect of the proposed system is the observation that the real latencies of the hardware units should be considered as variables of the OHC rather than fixed real values as is usually done in highlevel synthesis systems. Experimental results show a significant improvement of the OHC and power consumption without a significant increase in the execution time due to use of this search procedure.
AB - We consider a special optimization problem involved with compiling compound loops (combining nested and consecutive sub-loops) with array references to Verilog. Each sub-loop of the compound loop may require a different optimized hardware configuration (OHC) for optimized execution times. For example, one loop requires at least two memory ports and one multiplier for an optimized execution time while another loop may require only one memory port but two multipliers, yet one OHC should be selected for both loops. The goal is to compute a minimal OHC which, based on the different execution frequencies of the sub-loops, is a good compromise between all the conflicting requirements of each sub-loop. Though synthesis of nested loop has been implemented in quite a few systems this aspect has not been considered so far. We avoid the use of Integer Linear Programming (ILP) techniques and use instead a fast space exploration technique that is combined with an efficient variant of List scheduling. Another novel aspect of the proposed system is the observation that the real latencies of the hardware units should be considered as variables of the OHC rather than fixed real values as is usually done in highlevel synthesis systems. Experimental results show a significant improvement of the OHC and power consumption without a significant increase in the execution time due to use of this search procedure.
UR - http://www.scopus.com/inward/record.url?scp=51849152824&partnerID=8YFLogxK
U2 - 10.1109/ISVLSI.2008.10
DO - 10.1109/ISVLSI.2008.10
M3 - Conference contribution
AN - SCOPUS:51849152824
SN - 9780769531700
T3 - Proceedings - IEEE Computer Society Annual Symposium on VLSI: Trends in VLSI Technology and Design, ISVLSI 2008
SP - 495
EP - 498
BT - Proceedings - IEEE Computer Society Annual Symposium on VLSI
T2 - IEEE Computer Society Annual Symposium on VLSI: Trends in VLSI Technology and Design, ISVLSI 2008
Y2 - 7 April 2008 through 9 April 2008
ER -