In linear algebra, the Hermite normal form is an analogue of reduced echelon form for matrices over the integersZ. Just as reduced echelon form can be used to solve problems about the solution to the linear system Ax=b where x is in Rn, the Hermite normal form can solve problems about the solution to the linear system Ax=b where this time x is restricted to have integer coordinates only. Other applications of the Hermite normal form include integer programming,[1]cryptography,[2] and abstract algebra.[3]
Definition
Various authors may prefer to talk about Hermite normal form in either row-style or column-style. They are essentially the same up to transposition.
Row-style Hermite normal form
An m by n matrix A with integer entries has a (row) Hermite normal form H if there is a square unimodular matrixU where H=UA and H has the following restrictions:[4][5][6]
H is upper triangular (that is, hij = 0 for i > j), and any rows of zeros are located below any other row.
The leading coefficient (the first nonzero entry from the left, also called the pivot) of a nonzero row is always strictly to the right of the leading coefficient of the row above it; moreover, it is positive.
The elements below pivots are zero and elements above pivots are nonnegative and strictly smaller than the pivot.
The third condition is not standard among authors, for example some sources force non-pivots to be nonpositive[7][8] or place no sign restriction on them.[9] However, these definitions are equivalent by using a different unimodular matrix U. A unimodular matrix is a square invertible integer matrix whose determinant is 1 or −1.
Column-style Hermite normal form
A m-by-n matrix A with integer entries has a (column) Hermite normal form H if there is a square unimodular matrixU where H=AU and H has the following restrictions:[8][10]
H is lower triangular, hij = 0 for i < j, and any columns of zeros are located on the right.
The leading coefficient (the first nonzero entry from the top, also called the pivot) of a nonzero column is always strictly below of the leading coefficient of the column before it; moreover, it is positive.
The elements to the right of pivots are zero and elements to the left of pivots are nonnegative and strictly smaller than the pivot.
Note that the row-style definition has a unimodular matrix U multiplying A on the left (meaning U is acting on the rows of A), while the column-style definition has the unimodular matrix action on the columns of A. The two definitions of Hermite normal forms are simply transposes of each other.
Existence and uniqueness of the Hermite normal form
Every full row rank m-by-n matrix A with integer entries has a unique m-by-n matrix H in Hermite normal form, such that H=UA for some square unimodular matrix U.[5][11][12]
Examples
In the examples below, H is the Hermite normal form of the matrix A, and U is a unimodular matrix such that UA = H.
If A has only one row then either H = A or H = −A, depending on whether the single row of A has a positive or negative leading coefficient.
Algorithms
There are many algorithms for computing the Hermite normal form, dating back to 1851. One such algorithm is described in.[13]: 43--45 But only in 1979 an algorithm for computing the Hermite normal form that ran in strongly polynomial time was first developed;[14] that is, the number of steps to compute the Hermite normal form is bounded above by a polynomial in the dimensions of the input matrix, and the space used by the algorithm (intermediate numbers) is bounded by a polynomial in the binary encoding size of the numbers in the input matrix.
One class of algorithms is based on Gaussian elimination in that special elementary matrices are repeatedly used.[11][15][16] The LLL algorithm can also be used to efficiently compute the Hermite normal form.[17][18]
Applications
Lattice calculations
A typical lattice in Rn has the form where the ai are in Rn. If the columns of a matrix A are the ai, the lattice can be associated with the columns of a matrix, and A is said to be a basis of L. Because the Hermite normal form is unique, it can be used to answer many questions about two lattice descriptions. For what follows, denotes the lattice generated by the columns of A. Because the basis is in the columns of the matrix A, the column-style Hermite normal form must be used. Given two bases for a lattice, A and A', the equivalence problem is to decide if This can be done by checking if the column-style Hermite normal form of A and A' are the same up to the addition of zero columns. This strategy is also useful for deciding if a lattice is a subset ( if and only if ), deciding if a vector v is in a lattice ( if and only if ), and for other calculations.[19]
Integer solutions to linear systems
The linear system Ax = b has an integer solution x if and only if the system Hy = b has an integer solution y where y = U−1x and H is the column-style Hermite normal form of A. Checking that Hy = b has an integer solution is easier than Ax = b because the matrix H is triangular.[11]: 55
Implementations
Many mathematical software packages can compute the Hermite normal form:
Hermite normal form can be defined when we replace Z by an arbitrary Dedekind domain.[20] (for instance, any principal-ideal domain). For instance, in control theory it can be useful to consider Hermite normal form for the polynomials F[x] over a given field F.