lois Field (MIGF) multiplier as an accelerator for the
well-known Repeated Multiplication Method of re-
duction. In section 3 we present the improvement in
performance achieved through the use of MIGF mul-
tiplier. We compute the increase in hardware com-
plexity of the said multiplier which is offset by the
improvement in performance of the reduction opera-
tion. In section 4 we present the synthesis results of
a 32-bit MIGF multiplier and evaluate the improve-
ment in performance of reduction operation over five
NIST recommended irreducible polynomials. Finally
we conclude the paper with a short summary.
2 REDUCTION OVER BINARY
FIELDS: THE BASIC
OPERATIONS INVOLVED AND
THEIR REALIZATION
The reduction operation is a modulo operation of a
polynomial with an irreducible polynomial that gen-
erates the finite field under consideration. Section 2.1
presents a brief mathematical background of reduc-
tion operation, various ways of implementing it and
the associated implications. In section 2.2 we com-
pare two algorithms for reduction operation and iden-
tify polynomial multiplications as the core computa-
tions in them. In section 2.3 we analyze the multipli-
cation operations involved in reduction. In section 2.4
we present the design of a MIGF multiplier that can
be used for efficient implementation of the aforemen-
tioned polynomial multiplications.
2.1 Mathematical Background of
Reduction Operation
Elements of a binary field are usually represented as
polynomials over the base field GF(2) i.e. the degree
of the polynomials is determined by the order of the
field and the coefficients belong to GF(2). Multipli-
cation of such elements is governed by the addition
and multiplication rules over GF(2). For instance,
let us consider two elements A(x) and B(x) belonging
to the binary field GF(2
m
). These polynomials can
be represented as a string of m symbols, where each
symbol is 0 or 1. Therefore they are equivalent to two
m-bit long binary strings. Equation 1 shows the two
polynomials and their product C(x).
A(x) = Σ
m−1
i=0
a
i
x
i
B(x) = Σ
m−1
i=0
b
i
x
i
C(x) = A(x) × B(x) (1)
= Σ
2m−2
i=0
c
i
x
i
;wherec
k
= Σ
i+ j=k
a
i
b
j
As is apparent from equation 1, the result C(x) is al-
most twice as long as the input polynomials. C(x) has
a unique equivalent canonical representation among
the set of polynomials of degree m−1. Though, math-
ematically both the representations are equivalent, ef-
ficient utilization of computation resources necessi-
tates conversion from the 2m−1-bit representation to
the m-bit representation. This conversion often re-
ferred to as reduction operation is based on an irre-
ducible polynomial that generates the binary field of
interest. The reduction operation is based on the fact
that a polynomial C(x) belonging to a finite field is
equivalent to the polynomial modulo an irreducible
polynomial P(x) that generates the finite field.
C(x) ≡ C(x) mod P(x) (2)
From equation 2 it is clear that the reduced polyno-
mial can be computed by traditional long division
technique for polynomials. But this method is iter-
ative in nature and requires up to m− 1 iterations.
At this point let us digress a little and consider
the aspect of flexibility regarding reduction opera-
tion over finite fields. There are two major factors
that govern flexibility of a reduction method: the or-
der of the finite field and the irreducible polynomial
that generates the finite field. A flexible reduction
method/system should be capable of operating over
finite fields of arbitrarily large order. Such a solution
should also be versatile enough to handle all possible
irreducible polynomial for any given field order. A
purely hardware approach (Peter et al., 2007; Saqib
et al., 2004) to support arbitrarily flexible reduction
cannot be employed since a hardware solution cannot
be used for finite field beyond a certain range. More-
over supporting all possible irreducible polynomials
even upto a specified field order will immensely in-
crease the complexity of the hardware. A purely soft-
ware implementation is capable of delivering the de-
sired flexibility, but poor performance of such an im-
plementation may make it highly inefficient over very
large fields. In order to cope with this, it is neces-
sary to develop hybrid solutions. In a hybrid solution
the data-path of the core computations are realized as
fast hardware kernels and the control-path to invoke
and cascade the hardware kernels is realized using a
thin layer of software. Such coexistence of hardware
and software necessitates some kind of a protocol to
govern the communication between the two domains.
One of the most important aspects of such a proto-
col is the data-granularity of the hardware kernels.
Data-granularity determines the amount of data that
can be processed by the individual hardware kernels
at any time. In architecture terminology, this granu-
larity translates to word-length. Transport latency of
data and metadata in such hybrid systems is strongly
A METHOD FOR FLEXIBLE REDUCTION OVER BINARY FIELDS USING A FIELD MULTIPLIER
51