?r??t à ??s s?????t???s? ??? ??ss? ?ét?r????r ??s ???t??rs ????tré? q?? ??????s??t s?sté??t?q?????t ??? ?rr??rs ??s ???s ????rt??t?s ?st ?? ?r???è?? très ???????? ??r??r ??????t ??s ?rr??rs ???rr???? ?st? ?? ?é?ér??? ?ss?? ????????? ???ér??t?s ?ét????s ??t été ?r???sé?s ???r ?? ?????? ??tt??t ????? ???r?t??ét?q?? st????st?q?? ???? ?? ???????s? st?t?q?? ????? ??s tr????? r???rq?????s ré???ts ???s ??r??t ??s é?é???ts ?? s???t??? à ??s ?r???è??s ???t???s?t??? ?t ??????t????? ???s s? t?ès??ss???? ??r?? ????tr?s ??? ???s ?rès? ??rs ±?? ?t??? ?? ???rr??r r????r??é?? ?? ? ?? s?rt?? ?? ?????r????t ??? s??s ?? ???r?t??ét?q?? ????t?r?????s? ?? ??tt? ?rr??r? ??s ??r??s ???rr??r r?t??r?é?s ??r ????? s??t ???? ???s ???s q?? ?? q?? ?? t???ér?t?r?? ??s ??r??t???s ?? ?? t??s??? ????????t?t???? ??s ??rt?r? ??t???s é???tr?????ét?q??s ??t??r ?? ????? ??s ?r??ts ???s ?? s??str?t ?? ??r???t? ?t?? ???s ?? ?? ??r???t ??té?ré? ??s ????s ?r????s ?? ???? ??????t ????????r ?? ?réq????? ??s ?s?????t??rs ?? ??????? ?q?? ???r????t êtr? t?t??????t ???r?s??s??t? ??é?t??r? ?? ???? ???t ?? ???s êtr? s??s??t? ?? ?ê?? ??????t????? ??s ?tt?q??s s?é???q??s ??????t êtr? ????s??é?s ???r ?é?r???r ?? q????té ?? ????é? ?t ???s? ?r?????s?r ??rt???s s?stè??s ,
??s???rs r??s??s? ?? tr????? ????str? très ???? ??s tr????? s?r ??s ???ér??t?s ????tt?s ?? ???r?t??ét?q?? ??s ?r????t??rs ?rés??té?s????rs r?????s? ?? s????rt ??tér??? ????ér?t???s é????é?s ?st ?é??ss??r? ???r ??rt????s ???????t???s s?é???q??s? ??s ???????t???s ?? tr??t????t ?? s????? ?t ??s ?????s ????t???t ??s ????s???s? ????rs?s? r?????s ??rré?s ?? r?????s ??rré?s ????rs?s? ????tr?s ???????t???s ?t???s??t ??s s???s? ??s???s? ????? ???ér??ts t???s ?? ?ét????s s??t ?r???sés ???s ?? ??ttér?t?r? ???r ??é?????t??? ?? ????t???s? ?? tr???? ???s ???? ??s ??s?s ???r?t??ét?q?? ??s ?r????t??rs ?t ?? ??rt??????r ??s ??ér?t???s ????? ?? ????s???? ??????rs?? ?? r????? ??rré? ?t ?? r????? ??rré? ????rs?? ???s ?????? ??s ?r????????s ?ét????s ??é?????t??? ??s ????t???s é?é???t??r?s ?sin? cos? exp? log? ?t??? s??t ?rés??té?s? P?r?? t??t?s ??s ?ét????s ??ss????s? ??s ???r?????t???s ???????????s s??t s?????t ?t???sé?s? ????s ??r??tt??t ??é?????r ???s ?? tr?????? ???s ???? tr??s ??????t ??t???r ??s ??ér?t??rs s?é???q??s ???r ??é?????t??? ?? ????t???s ??r ??s ,
Digit-serial processing techniques, IEEE Transactions on Circuits and Systems, vol.37, issue.6, pp.707-719, 1990. ,
DOI : 10.1109/31.55029
Advanced Serial-Data Computation, J. Parallel Distributed Comput., (Special Issue on Parallelism in Computer Arithmetic), vol.5, issue.3, pp.228-249, 1988. ,
On-Line arithmetic: A design methodology and applications in digital signal processing, VLSI Signal Processing, pp.252-263, 1988. ,
Signed digit arithmetic on FPGA's, " presented at the Int, FPGA: Logic and Applications, 1993. ,
Bases discretes et calcul des fonctions elementaires par materiel, 1994. ,
Some Operators for On-Line Radix-2 Computations, Journal of Parallel and Distributed Computing, vol.22, issue.2, pp.336-345, 1994. ,
DOI : 10.1006/jpdc.1994.1093
On-line arithmetic: An overview, Proc. SPIE Conf. Real-Time Signal Processing, pp.86-93, 1984. ,
Signed-digit number representations for fast parallel arithmetic, IRE Trans. Electron. Computers, vol.10, pp.389-400, 1961. ,
DOI : 10.1109/tec.1961.5219227
On-line algorithms for division and multiplication, IEEE Trans. Computers, issue.26, pp.681-687, 1977. ,
On-line arithmetic algorithms for efficient implementation, Comput. Sci. Dep, 1990. ,
On-the-Fly Conversion of Redundant into Conventional Representations, IEEE Transactions on Computers, vol.36, issue.7, pp.895-897, 1987. ,
DOI : 10.1109/TC.1987.1676986
Motion control with field programmable gate arrays Tutorial: Tools for Real-Time Motion Control, MOVIC Conf, pp.1-20, 1998. ,
Arithmetique des Ordinateurs, 1989. ,
URL : https://hal.archives-ouvertes.fr/ensl-00086707
A new model for control of systems with friction, IEEE Transactions on Automatic Control, vol.40, issue.3, pp.419-425, 1995. ,
DOI : 10.1109/9.376053
Arithmetique on-line en base 4 pour les controlleurs digitaux en automatique, Ecole Polytechnique de Lausanne, 1997. ,
Division and Square Root: Digit- Recurrence Algorithms and Implementations, Boston: Kluwer Academic, 1994. ,
ªHigh Bandwidth Evaluation of Elementary Functions, Proc. Fifth IEEE Symp, 1981. ,
ªFunction Evaluation by Table Look- Up and Addition, Proc. 12th IEEE Symp, pp.10-16, 1995. ,
DOI : 10.1109/arith.1995.465382
Matula, ªOn Infinitely Precise Roundings for Division, Square-Root, Reciprocal and Square-Root Reciprocal, Proc. 14th IEEE Symp. Computer Arithmetic, pp.233-240, 1999. ,
ªEfficient Initial Approximation and Fast Converging Methods for Division and Square Root, Proc. 12th IEEE Symp, pp.2-9, 1995. ,
ªHigh-Speed Double Precision Computation of Nonlinear Functions, Proc. 12th IEEE Symp, pp.107-114, 1995. ,
Elementary Functions, Algorithms and Implementation, 1997. ,
URL : https://hal.archives-ouvertes.fr/ensl-00000008
Division algorithms and implementations, IEEE Transactions on Computers, vol.46, issue.8, pp.833-854, 1997. ,
DOI : 10.1109/12.609274
ªSymmetric Bipartite Tables for Accurate Function Approximation, Proc. 13th IEEE Symp, 1997. ,
DOI : 10.1109/arith.1997.614893
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.36.6368
ªTable Lookup Algorithms for Elementary Functions and Their Error Analysis, Proc. 10th IEEE Symp, pp.232-236, 1991. ,
Fast hardware-based algorithms for elementary function computations using rectangular multipliers, IEEE Transactions on Computers, vol.43, issue.3, pp.278-294, 1994. ,
DOI : 10.1109/12.272429
Fast evaluation of the elementary functions in single precision, IEEE Transactions on Computers, vol.44, issue.3, pp.453-457, 1995. ,
DOI : 10.1109/12.372037
Very high radix square root with prescaling and rounding and a combined division/square root unit, IEEE Transactions on Computers, vol.48, issue.8, pp.827-841, 1999. ,
DOI : 10.1109/12.795124
Computation of ???(x/d) in a very high radix combined division/square-root unit with scaling and selection by rounding, IEEE Transactions on Computers, vol.47, issue.2, pp.152-161, 1998. ,
DOI : 10.1109/12.663761
Faithful Bipartite ROM Reciprocal Tables, Proc. 12th IEEE Symp, pp.17-28, 1995. ,
Function evaluation by table look-up and addition, Proceedings of the 12th Symposium on Computer Arithmetic, pp.10-16, 1995. ,
DOI : 10.1109/ARITH.1995.465382
Approximating elementary functions with symmetric bipartite tables, IEEE Transactions on Computers, vol.48, issue.8, pp.842-847, 1999. ,
DOI : 10.1109/12.795125
The Symmetric Table Addition Method for Accurate Function Approximation, The Journal of VLSI Signal Processing, vol.21, issue.2, pp.167-177, 1999. ,
DOI : 10.1023/A:1008004523235
A Few Results on Table-Based Methods, Reliable Computing, vol.5, issue.3, pp.279-288, 1999. ,
DOI : 10.1007/978-94-017-1247-7_22
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.39.7830
Elementary Functions, Algorithms and Implementation, 1997. ,
URL : https://hal.archives-ouvertes.fr/ensl-00000008
Analysis of Reciprocal and Square Root Reciprocal Instructions in the AMD K6-2 Implementation of 3DNow!, Electronic Notes in Theoretical Computer Science, vol.24, 1999. ,
DOI : 10.1016/S1571-0661(05)80621-8
A VHDL library of LNS operators, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003, 2003. ,
DOI : 10.1109/ACSSC.2003.1292376
Some improvements on multipartite table methods, Proceedings 15th IEEE Symposium on Computer Arithmetic. ARITH-15 2001, pp.128-135, 2001. ,
DOI : 10.1109/ARITH.2001.930112
URL : https://hal.archives-ouvertes.fr/inria-00072577
Improved table lookup algorithms for postscaled division, Proceedings 15th IEEE Symposium on Computer Arithmetic. ARITH-15 2001, pp.101-108, 2001. ,
DOI : 10.1109/ARITH.2001.930109
A 20 bit logarithmic number system processor, IEEE Transactions on Computers, vol.37, issue.2, 1988. ,
DOI : 10.1109/12.2148
An architecture for addition and subtraction of long word length numbers in the logarithmic number system, IEEE Transactions on Computers, vol.39, issue.11, 1990. ,
DOI : 10.1109/12.61042
Arithmetic on the European logarithmic microprocessor, IEEE Transactions on Computers, vol.49, issue.7, pp.702-715, 2000. ,
DOI : 10.1109/12.863040
A Dual-Path Logarithmic Number System Addition/Subtraction Scheme for FPGA, Proc. Field-Programmable Logic and Applications, 2003. ,
DOI : 10.1007/978-3-540-45234-8_78
Fast evaluation of the elementary functions in single precision, Proc. 36th Asilomar Conf. Signals, Systems, and Computers, pp.453-457, 1995. ,
DOI : 10.1109/12.372037
Faithful Powering Computation Using Table Look-Up and a Fused Accumulation Tree, Proc. 15th IEEE Symp. Computer Arithmetic, N. Burgess and L. Ciminiera, pp.40-47, 2001. ,
Multipartite Tables in Jbits for the Evaluation of Functions on FPGAs, Proc. IEEE Reconfigurable Architecture Workshop, Int'l Parallel and Distributed Symp, 2002. ,
Integer multiplication and division on the HP Precision Architecture, IEEE Transactions on Computers, vol.37, issue.8, pp.980-990, 1988. ,
DOI : 10.1109/12.2248
A SIGNED BINARY MULTIPLICATION TECHNIQUE, The Quarterly Journal of Mechanics and Applied Mathematics, vol.4, issue.2, pp.236-240, 1951. ,
DOI : 10.1093/qjmam/4.2.236
Multiplication by integer constants, Software: Practice and Experience, vol.6, issue.7, pp.641-652, 1986. ,
DOI : 10.1002/spe.4380160704
Some Optimizations of Hardware Multiplication by Constant Matrices, Proc. 16th IEEE Symp. Computer Arithmetic (ARITH 16, pp.20-27, 2003. ,
URL : https://hal.archives-ouvertes.fr/lirmm-00113092
Multiple constant multiplications: efficient and versatile framework and algorithms for exploring common subexpression elimination, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.15, issue.2, pp.151-165, 1996. ,
DOI : 10.1109/43.486662
Advanced Computer Arithmetic Design, 2001. ,
Subexpression sharing in filters using canonic signed digit multipliers, IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol.43, issue.10, pp.677-688, 1996. ,
DOI : 10.1109/82.539000
Fast Integer Multipliers Fit in FPGAs, EDN Magazine, 1994. ,
DCT Implementation with Distributed Arithmetic, IEEE Trans. Computers, vol.50, issue.9, pp.985-991, 2001. ,
FIR Filters with Punctured Radix-8 Symmetric Coefficients: Design and Multiplier-Free Realizations, Circuits, Systems & Signal Processing, vol.21, issue.4, pp.345-367, 2002. ,
DOI : 10.1007/s00034-002-0716-z
On the design and implementation of FIR and IIR digital filters with variable frequency characteristics, IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol.49, issue.11, pp.689-703, 2002. ,
DOI : 10.1109/TCSII.2002.807574
Multiplierless perfect reconstruction modulated filter banks with sum-of-powers-of-two coefficients, IEEE Signal Processing Letters, vol.8, issue.6, pp.163-166, 2001. ,
DOI : 10.1109/97.923040
Theory and applications of the double-base number system, IEEE Transactions on Computers, vol.48, issue.10, pp.1098-1106, 1999. ,
DOI : 10.1109/12.805158
Multiplier-free realizations for FIR multirate converters based on mixed-radix number representation, IEEE Transactions on Signal Processing, vol.45, issue.4, pp.880-890, 1997. ,
DOI : 10.1109/78.564176
Evolutionary graph generation system with transmigration capability and its application to arithmetic circuit synthesis, IEE Proceedings - Circuits, Devices and Systems, vol.149, issue.2, pp.97-104, 2002. ,
DOI : 10.1049/ip-cds:20020261
Multiplication by Integer Constants, technical report, 1994. ,
Multiplier Optimization for Small Sets Of Coefficients, Proc. Int'l Workshop Logic and Architecture Synthesis, pp.13-22, 1997. ,
Number-splitting with shift-and-add decomposition for power and hardware optimization in linear DSP synthesis, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.8, issue.4, pp.419-424, 2000. ,
DOI : 10.1109/92.863621
FIR Filter Synthesis Algorithms for Minimizing the Delay and the Number of Adders, IEEE Trans. Circuits and Systems II: Analog and Digital Signal Processing, vol.48, issue.8, pp.770-777, 2001. ,
Design of high-speed multiplierless filters using a nonrecursive signed common subexpression algorithm, IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol.49, issue.3, pp.196-203, 2002. ,
DOI : 10.1109/TCSII.2002.1013866
FIR filter implementation by efficient sharing of horizontal and vertical common subexpressions, Electronics Letters, vol.39, issue.2, pp.251-253, 2003. ,
DOI : 10.1049/el:20030150
Fast and Efficient Algorithm for the Multiplierless Realisation of Linear DSP Transforms A Hierarchical Clustering Method for the Multiple Constant Multiplication Problem, IEE Proc. Circuits, Devices, and Systems, pp.20-211, 1997. ,
Multiplication par une Constante, Re´seaux et Systèmes Re´partis, CalculateursParallèles, vol.13, pp.4-5, 2001. ,
A new algorithm for elimination of common subexpressions, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.18, issue.1, pp.58-68, 1999. ,
DOI : 10.1109/43.739059
Towards an Algorithm for Matrix Multiplier Blocks, Proc. European Conf. Circuit Theory Design, 2003. ,
An improved search algorithm for the design of multiplierless FIR filters with powers-of-two coefficients, IEEE Transactions on Circuits and Systems, vol.36, issue.7, pp.1044-1047, 1989. ,
DOI : 10.1109/31.31347
Polynomial approximations are almost always used when implementing functions on a computing system In most cases, the polynomial that best approximates (for a given distance and in a given interval) a function has coefficients that are not exactly representable with a finite number of bits. And yet, the polynomial approximations that are actually implemented do have coefficients that are represented with a finite?and sometimes small?number of bits. This is due to the finiteness of the floating-point representations (for software implementations), and to the need to have small, hence fast and/or inexpensive, multipliers (for hardware implementations) We then have to consider polynomial approximations for which the degree-i coefficient has at most m i fractional bits; in other words, it is a rational number with denominator 2 m i . We provide a general and efficient method for finding the best polynomial approximation under this constraint. Moreover, our method also applies if some other constraints (such as requiring some coefficients to be equal to some predefined constants or minimizing relative error instead of absolute error) are required ,
F-69364 Lyon Cedex 07 France; email: Nicolas.Brisebarre@ens-lyon.fr, J.-M ,
46 Allée d'Italie, F-69364 Lyon Cedex 07 France 46 Allée d'Italie, F-69364 Lyon Cedex 07 France; email: Arnaud.Tisserand@ens-lyon.fr. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or direct commercial advantage and that copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this work in other works requires prior specific permission and/or a fee, p.1515 ,
Scanning polyhedra with do loops, Proceedings of the 3rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'91), 1991. ,
Polynomials and Polynomials Inequalities, Graduate Texts in Mathematics, vol.161, 1995. ,
Construction of do loops from systems of affine constraints, Parall. Process. Lett, vol.5, pp.421-436, 1995. ,
Scientific Computing on Itanium-Based Systems, 2002. ,
Chebyshev Polynomials in Numerical Analysis, 1972. ,
On integer Chebyshev polynomials, Math. Computat, vol.66, issue.218, pp.763-770, 1997. ,
Computer Approximations, 1968. ,
Loop nest synthesis using the polyhedral library, 1994. ,
URL : https://hal.archives-ouvertes.fr/inria-00074384
Faithful powering computation using table look-up and a fused accumulation tree, Proceedings of the 15th IEEE Symposium on Computer Arithmetic (Arith-15). Burgess and Ciminiera Eds, pp.40-58, 2001. ,
New algorithms for improved transcendental functions on IA- 64, Proceedings of the 14th IEEE Symposium on Computer, pp.4-11, 1999. ,
Polylib, a library of polyhedral functions, version 5.20.0. http://icps. u-strasbg.fr/polylib/. THE SPACES PROJECT MPFR, the multiple precision floating point reliable library, 2004. ,
High-performance architectures for elementary function generation, Proceedings of the 15th IEEE Symposium on Computer Arithmetic, 2001. ,