For evaluating the univariate polynomial the most naive method would use multiplications to compute , use multiplications to compute and so on for a total of multiplications and additions.
Using better methods, such as Horner's rule, this can be reduced to multiplications and additions. If some preprocessing is allowed, even more savings are possible.
In the former case, polynomials are evaluated using floating-point arithmetic, which is not exact. Thus different schemes for the evaluation will, in general, give slightly different answers. In the latter case, the polynomials are usually evaluated in a finite field, in which case the answers are always exact.
Horner's method evaluates a polynomial using repeated bracketing:
This method reduces the number of multiplications and additions to just
Horner's method is so common that a computer instruction "multiply–accumulate operation" has been added to many computer processors, which allow doing the addition and multiplication operations in one combined step.
Multivariate
If the polynomial is multivariate, Horner's rule can be applied recursively over some ordering of the variables.
E.g.
can be written as
An efficient version of this approach was described by Carnicer and Gasca.[1]
While it's not possible to do less computation than Horner's rule (without preprocessing), on modern computers the order of evaluation can matter a lot for the computational efficiency.
A method known as Estrin's scheme computes a (single variate) polynomial in a tree like pattern:
Arbitrary polynomials can be evaluated with fewer
operations than Horner's rule requires if we first "preprocess"
the coefficients .
An example was first given by Motzkin[2] who noted that
can be written as
where the values are computed in advanced, based on .
Motzkin's method uses just 3 multiplications compared to Horner's 4.
The values for each can be easily computed by expanding and equating the coefficients:
Example
To compute the Taylor expansion,
we can upscale by a factor 24, apply the above steps, and scale back down.
That gives us the three multiplication computation
Improving over the equivalent Horner form (that is ) by 1 multiplication.
Evaluation of a degree-n polynomial at multiple points can be done with multiplications by using Horner's method times. Using the above preprocessing approach, this can be reduced by a factor of two; that is, to multiplications.
However, it is possible to do better and reduce the time requirement to just .[4]
The idea is to define two polynomials that are zero in respectively the first and second half of the points: and .
We then compute and using the Polynomial remainder theorem, which can be done in time using a fast Fourier transform.
This means and by construction, where and are polynomials of degree at most .
Because of how and were defined, we have
Thus to compute on all of the , it suffices to compute the smaller polynomials and on each half of the points.
This gives us a divide-and-conquer algorithm with , which implies by the master theorem.
In the case where the points in which we wish to evaluate the polynomials have some structure, simpler methods exist.
For example, Knuth[5] section 4.6.4
gives a method for tabulating polynomial values of the type
Dynamic evaluation
In the case where are not known in advance,
Kedlaya and Umans[6] gave a data structure for evaluating polynomials over a finite field of size in time per evaluation after some initial preprocessing.
This was shown by Larsen[7] to be essentially optimal.
The idea is to transform of degree into a multivariate polynomial, such that and the individual degrees of is at most .
Since this is over , the largest value can take (over ) is .
Using the Chinese remainder theorem, it suffices to evaluate modulo different primes with a product at least .
Each prime can be taken to be roughly , and the number of primes needed, , is roughly the same.
Doing this process recursively, we can get the primes as small as .
That means we can compute and store on all the possible values in time and space.
If we take , we get , so the time/space requirement is just
Kedlaya and Umans further show how to combine this preprocessing with fast (FFT) multipoint evaluation.
This allows optimal algorithms for many important algebraic problems, such as polynomial modular composition.
Specific polynomials
While general polynomials require operations to evaluate, some polynomials can be computed much faster.
For example, the polynomial can be computed using just one multiplication and one addition since
A particularly interesting type of polynomial is powers like .
Such polynomials can always be computed in operations.
Suppose, for example, that we need to compute ; we could simply start with and multiply by to get .
We can then multiply that by itself to get and so on to get and in just four multiplications.
Other powers like can similarly be computed efficiently by first computing by 2 multiplications and then multiplying by .
The most efficient way to compute a given power is provided by addition-chain exponentiation. However, this requires designing a specific algorithm for each exponent, and the computation needed for designing these algorithms are difficult (NP-complete[8]), so exponentiation by squaring is generally preferred for effective computations.
The fact that some polynomials can be computed significantly faster than "general polynomials" suggests the question: Can we give an example of a simple polynomial that cannot be computed in time much smaller than its degree?
Volker Strassen has shown[9] that the polynomial
cannot be evaluated with less than multiplications and additions.
At least this bound holds if only operations of those types are allowed, giving rise to a so-called "polynomial chain of length ".
The polynomial given by Strassen has very large coefficients, but by probabilistic methods, one can show there must exist even polynomials with coefficients just 0's and 1's such that the evaluation requires at least multiplications.[10]
For other simple polynomials, the complexity is unknown.
The polynomial is conjectured to not be computable in time for any .
This is supported by the fact that, if it can be computed fast, then integer factorization can be computed in polynomial time, breaking the RSA cryptosystem.[11]
Matrix polynomials
Sometimes the computational cost of scalar multiplications (like ) is less than the computational cost of "non scalar" multiplications (like ).
The typical example of this is matrices.
If is an matrix, a scalar multiplication takes about arithmetic operations, while computing takes about (or using fast matrix multiplication).
Paterson and Stockmeyer[12]
showed how to compute a degree polynomial using only non scalar multiplications and scalar multiplications.
Thus a matrix polynomial of degree n can be evaluated in time. If this is , as fast as one matrix multiplication with the standard algorithm.
This method works as follows: For a polynomial
let k be the least integer not smaller than
The powers are computed with matrix multiplications, and are then computed by repeated multiplication by
Now,
,
where for i ≥ n.
This requires just more non-scalar multiplications.
The direct application of this method uses non-scalar multiplications, but combining it with Evaluation with preprocessing, Paterson and Stockmeyer show you can reduce this to .
Methods based on matrix polynomial multiplications and additions have been proposed allowing to save nonscalar matrix multiplications with respect to the Paterson-Stockmeyer method.[13]
See also
Estrin's scheme to facilitate parallelization on modern computer architectures
^Rabin, Michael O.; Winograd, Shmuel (July 1972). "Fast evaluation of polynomials by rational preparation". Communications on Pure and Applied Mathematics. 25 (4): 433–458. doi:10.1002/cpa.3160250405.