The Subjectivity of Computers

Jean-François COLONNA
[Contact me]

www.lactamme.polytechnique.fr

CMAP (Centre de Mathématiques APpliquées) UMR CNRS 7641, École polytechnique, Institut Polytechnique de Paris, CNRS, France
france telecom, France Telecom R&D

[Site Map, Help and Search [Plan du Site, Aide et Recherche]]
[The Y2K Bug [Le bug de l'an 2000]]
[Real Numbers don't exist in Computers and Floating Point Computations aren't safe. [Les Nombres Réels n'existent pas dans les Ordinateurs et les Calculs Flottants ne sont pas sûrs.]]
[Please, visit A Virtual Machine for Exploring Space-Time and Beyond, the place where you can find more than 10.000 pictures and animations between Art and Science]
(CMAP28 WWW site: this page was created on 07/11/1995 and last updated on 10/03/2024 17:20:39 -CEST-)

(published in Communications of the ACM, volume 36, number 8, 08/1993)

Abstract: Real numbers do not exist in a computer. Ignoring this fact can lead to erroneous results when using floating point computations.

Keywords: Rounding-off Errors, Real Numbers, Floating Point Computations.

Contents:

1 - Sensitivity to the accuracy of numerical values:

1.1 - Rounded figure errors:
1.2 - Computer-based formulation of the model:
1.3 - Heterogeneous parallelism:

2 - Elementary method of detecting sensitivity to the accuracy of numerical values by re-sequencing arithmetic expressions:
3 - Contribution of K language with regard to the test for problems subject to the accuracy of numerical values:
4 - Conclusion:

Introduction:

1-the infinite Real number field is mapped on many different finite floating-point number sets and these sets are not them'selves fields (see for example the KAM theorem that says important things about the consequences of the approximation of Real numbers with rational numbers).
2-round off errors are not always negligible: when they occur, they imply in particular the loss of following properties: associativity of the multiplication and distributivity of the multiplication over the addition; mathematical equivalent formalisms do not give then equivalent programs when implemented. Moreover, when changing the compiler used (for example when updating the operating system of the computer used) one may discover that old programs recompiled do not give anymore the results expected (because of a different ordering of the elementary operations). The situation could become worse with the sophisticated new microprocessors where very complex scheduling algorithm are implemented...
3-the widespread usage of heterogeneous cooperations must take these facts into account (where 'heterogeneous' means "different hardwares" and/or "different compilers").
4-simple methods can be implemented to detect the sensitivity to round off errors in complex existing programs (for example in fluid dynamics).
5-most programs ask for reproductibility; when using dynamical systems (for random number generation or cryptographic applications,...) it could become impossible to have reproductibility, for example, when changing the computing environment.

1 - Sensitivity to the accuracy of numerical values:

The Lorenz attractor -sensitivity to initial conditions (displayed as the central point of each frame)-

1.1 - Rounded figure errors:

paragraph 1.2

The Lorenz attractor

Euler of the first order

Runge-Kutta of the second order

Runge-Kutta of the fourth order

Computation of the Lorenz attractor on three different computers (the Red one, the Green one and the Blue one: sensitivity to rounding-off errors)

The Lorenz attractor -sensitivity to integration methods used (Red=Euler, Green=Runge-Kutta/2nd order, Blue=Runge-Kutta/4th order)-

more visualizations

1.2 - Computer-based formulation of the model:

                                       2
                    X  = (R+1)X    - RX
                     n         n-1     n-1

C program

                    IBM ES9000:

                              (R+1)X-R(XX)   (R+1)X-(RX)X   ((R+1)-(RX))X  RX+(1-(RX))X   X+R(X-(XX))

                    X(00)   = 0.500000       0.500000       0.500000       0.500000       0.500000
                    X(10)   = 0.384631       0.384631       0.384631       0.384631       0.384631
                    X(20)   = 0.418895       0.418895       0.418895       0.418895       0.418895
                    X(30)   = 0.046399       0.046399       0.046399       0.046399       0.046399
                    X(40)   = 0.320185       0.320183       0.320188       0.320182       0.320189
                    X(50)   = 0.063406       0.064521       0.061895       0.064941       0.061244
                    X(60)   = 1.040381       0.846041       0.529794       1.319900       1.214070
                    X(70)   = 0.004104       1.199452       0.873553       0.573637       0.000009
                    X(80)   = 0.108044       0.121414       1.260726       0.395871       0.280590
                    X(90)   = 0.096374       0.089244       0.582157       0.344503       1.023735

                    IBM RS6000:

                              (R+1)X-R(XX)   (R+1)X-(RX)X   ((R+1)-(RX))X  RX+(1-(RX))X   X+R(X-(XX))

                    X(00)   = 0.500000       0.500000       0.500000       0.500000       0.500000
                    X(10)   = 0.384631       0.384631       0.384631       0.384631       0.384631
                    X(20)   = 0.418895       0.418895       0.418895       0.418895       0.418895
                    X(30)   = 0.046399       0.046399       0.046399       0.046399       0.046399
                    X(40)   = 0.320177       0.320184       0.320188       0.320190       0.320189
                    X(50)   = 0.067567       0.063747       0.061859       0.060822       0.061486
                    X(60)   = 0.001145       0.271115       0.616781       0.298613       1.307350
                    X(70)   = 1.296775       1.328462       0.486629       0.938605       1.054669
                    X(80)   = 0.553038       0.817163       1.277151       1.325437       0.617058
                    X(90)   = 0.094852       0.154184       1.174162       0.148151       0.237355

                    X = (R+1)*X - R*X*X;

                    Power-Challenge M Silicon Graphics (R8000, IRIX 6.2, cc 7.0):

                              option '-O2'   option '-O3'

                    X(00)   = 0.500000       0.500000
                    X(10)   = 0.384631       0.384631
                    X(20)   = 0.418895       0.418895
                    X(30)   = 0.046399       0.046399
                    X(40)   = 0.320184       0.320188
                    X(50)   = 0.063747       0.061859
                    X(60)   = 0.271115       0.616781
                    X(70)   = 1.328462       0.486629
                    X(80)   = 0.817163       1.277151

N-body problem integration (N=4: one star, one heavy planet and one light planet with a satellite) computed with 2 different optimization options on the same computer (sensitivity to rounding-off errors)

scale

                    scale   = 010  020  030  040  050  060  070  080  090  100
                    n       = 035  070  105  136  169  199  230  265  303  335

1.3 - Heterogeneous parallelism:

problems linked to rounding errors that are indigenous to each machine (i.e. the errors listed below),

problems of rounding errors incoherence during cooperations of machines consisting of different hardwares and compilers.

paragraph 2

Rotation about the X axis of the Lorenz attractor (1000 iterations), computed simultaneously on two different computers

Rotation about the X axis of the Lorenz attractor (5000 iterations), computed simultaneously on two different computers

2 - Elementary method of detecting sensitivity to the accuracy of numerical values by re-sequencing arithmetic expressions:

Stage One:

if, at a fixed iteration number, the five columns of values are different on a given machine, it is because the compiler used bases the generated code on the computer formulation of the problem.

if, by carrying out the same calculation on two different machines, the two sets of results differ, it is either because the two arithmetical units do not operate in a similar way , or because the two compilers analyse the expressions in a different way (these two conditions may exist simultaneously).

Stage Two:

locating the "sensitive" areas in the program (i.e. the code sequences, usually very localised, in which the iterative and non-linear processes are implemented),

re-writing these sequences in two or three different ways (e.g. by re-sequencing multiplications with several factors or implementing the distributive law of multiplication as compared to addition),

executing each version of the program with the same parameters and initial conditions, and comparing the results obtained.

3 - Contribution of K language with regard to the test for problems subject to the accuracy of numerical values:

                    AxBxC

                    MUL3(A,B,C) ==> MUL2(A,MUL2(B,C))

                    Ax(B + C)

                    DIS2(A,B,C) ==> MUL2(A,ADD2(B,C))

                    MUL2(B,MUL2(C,A))

                    ADD2(MUL2(A,B),MUL2(A,C))

4 - Conclusion:

N-body problem integration (N=4: one star, one heavy planet and one light planet with a satellite) computed on three different computers (the Red one, the Green one and the Blue one: sensitivity to rounding-off errors)