Abstract
Intel has recently announced a new set of processor instructions, dubbed AVX512IFMA, that carry out Integer Fused Multiply Accumulate operations. These instructions operate on 512-bit registers and compute eight independent 52-bit unsigned integer multiplications, to generate eight 104-bit products, and accumulate their low/high halves into 64-bit containers. Using these instructions requires that inputs are converted to (redundant form) radix 252, and outputs are converted to the desired representation. This paper demonstrates several techniques for leveraging the AVX512IFMA instructions in order to speed up big-integer multiplications. Although processors that support AVX512IFMA are not yet available at the time this paper is written, we show how currently available public tools can be used for estimating their potential performance benefits. For example, based on these tools, we expect a 2x speedup for 1024-bit integer multiplication, over the best currently available method.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 IEEE 23rd Symposium on Computer Arithmetic, ARITH 2016 |
Editors | Javier Hormigo, Nathalie Revol, Paolo Montuschi, Stuart Oberman, Michael Schulte |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 32-38 |
Number of pages | 7 |
ISBN (Electronic) | 9781509016150 |
DOIs | |
State | Published - 7 Sep 2016 |
Event | 23rd IEEE Symposium on Computer Arithmetic, ARITH 2016 - Santa Clara, United States Duration: 10 Jul 2016 → 13 Jul 2016 |
Publication series
Name | Proceedings - Symposium on Computer Arithmetic |
---|---|
Volume | 2016-September |
Conference
Conference | 23rd IEEE Symposium on Computer Arithmetic, ARITH 2016 |
---|---|
Country/Territory | United States |
City | Santa Clara |
Period | 10/07/16 → 13/07/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- AVX
- AVX512
- IFMA
- SIMD
- big integer arithmetic
- processor instructions
ASJC Scopus subject areas
- Hardware and Architecture
- Software
- Theoretical Computer Science