, Table 13.1: Syntheses and accuracy measurements of a FIR filter generated using Matlab and the proposed method, vol.224
, XC2000 Logic Cell Array Familii. Xilinx Corporation, 1985.
, FIR suite, 2009.
, Serii DSP48E1 Slice User Guide (UG479). Xilinx Corporation
, Serii FPGAs Configurable Logic Block User Guide (UG474). Xilinx Corporation
, Serii FPGAs Memory Resourcc User Guide (UG473). Xilinx Corporation
, Stratix 10 Embedded Memory User Guide (ug-s10-memory), 2016.
, Stratix 10 Logic Array Blocks and Adaptive Logic Modull User Guide (ug-s10-lab), 2016.
, Stratix 10 Variable Precision DSP Blocks User Guide (ug-s10-dsp), 2016.
Signal Processing Algorithms and Architecturr, 1982. ,
Exact and approximate algorithms for the optimization of area and delay in multiple constant multiplications, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.27, issue.6, pp.1013-1026, 2008. ,
A survey of CORDIC algorithms for FPGA based computers, Proceedings of the 1998 ACM/SIGDA Sixth International Symposium on Field Programmable Gate Arrays, FPGA '98, pp.191-200, 1998. ,
, Digital Signal Processing: Signals, Systems, and Filters, 2005.
On computing the worst-case peak gain of linear systems, Systems & Control Letters, vol.19, pp.265-269, 1992. ,
Multipliers for floating-point double precision and beyond on FPGAs, ACM SIGARCH Computer Architecture News, vol.38, issue.4, pp.73-79, 2011. ,
URL : https://hal.archives-ouvertes.fr/ensl-00475781
A two's complement parallel array multiplication algorithm, IEEE Transactions on Computers, vol.22, issue.12, pp.1045-1047, 1973. ,
Partitioning a square into rectangles: Np-completeness and approximation algorithms, Algorithmica, vol.34, issue.3, pp.217-239, 2002. ,
URL : https://hal.archives-ouvertes.fr/hal-00807407
Pi and the AGM: A Study in the Analytic Number Theory and Computational Complexity, 1987. ,
Some optimizations of hardware multiplication by constant matrices, IEEE Transactions on Computers, vol.54, issue.10, pp.1271-1282, 2005. ,
URL : https://hal.archives-ouvertes.fr/lirmm-00113092
Comparison of peak and RMS gains for discrete-time systems, Systems Control Letters, vol.9, issue.1, pp.1-6, 1987. ,
An area estimation methodology for FPGA based designs at systemc-level, pp.129-132, 2004. ,
A lattice basis reduction approach for the design of quantized FIR filters, 2016. ,
Computer-aided design of VLSI FIR filters, Proceedings of the IEEE, vol.75, issue.9, pp.1260-1271, 1987. ,
A methodology for automatic hardware synthesis of multiplier-less digital filters with prescribed output accuracy, APCCAS 20062006 IEEE Asia Pacific Conference on Circuits and Systems, pp.61-64, 2006. ,
Fast integer multipliers fit in FPGAs, 1994. ,
Fixed-point CORDIC-based qr decomposition by givens rotations on FPGA, 2011 International Conference on Reconfigurable Computing and FPGAs, pp.327-332, 2011. ,
Tools and techniques for efficient high-level system design on FPGAs, 2014. ,
, Tiling rectangles with rectangles. Mathematics Magazine, vol.55, pp.286-291, 1982.
Some schemes for parallel multipliers, Alta frequenza, vol.34, issue.5, pp.349-356, 1965. ,
Faithful bipartite rom reciprocal tables, ARITH'12, pp.17-28, 1995. ,
Characterizing tileorders, Order, vol.10, issue.2, pp.111-128, 1993. ,
When FPGAs are better at floating-point than microprocessors, 2007. ,
URL : https://hal.archives-ouvertes.fr/ensl-00174627
Table-based division by small integer constants, International Symposium on Applied Reconfigurable Computing, pp.53-63, 2012. ,
URL : https://hal.archives-ouvertes.fr/ensl-00642145
Automatic generation of polynomial-based hardware architectures for function evaluation, ASAP 21st IEEE International Conference on Application-specific Systems, Architecturr and Processors, pp.216-222, 2010. ,
URL : https://hal.archives-ouvertes.fr/ensl-00470506
Large multipliers with fewer dsp blocks, International Conference on Field Programmable Logic and Applications, pp.250-255, 2009. ,
Designing custom arithmetic data paths with FloPoCo, IEEE Design & Test of Computers, vol.28, issue.4, pp.18-27, 2011. ,
URL : https://hal.archives-ouvertes.fr/ensl-00646282
A 128-tap complex FIR filter processing 20 giga-samples/s in a single FPGA, 44th Asilomar Conference on Signals, Systems & Computers, 2010. ,
URL : https://hal.archives-ouvertes.fr/ensl-00542950
Some improvements on multipartite table methods, Proceedings 15th IEEE Symposium on Computer Arithmetic. ARITH-15, pp.128-135, 2001. ,
URL : https://hal.archives-ouvertes.fr/inria-00072577
Floating-point trigonometric functions for FPGAs, International Conference on Field Programmable Logic and Applications, pp.29-34, 2007. ,
Reciprocation, square root, inverse square root, and some elementary functions using small multipliers. Computers, IEEE Transactions on, vol.49, issue.7, pp.628-637, 2000. ,
URL : https://hal.archives-ouvertes.fr/hal-02101940
Digital arithmetic, p.247, 2004. ,
URL : https://hal.archives-ouvertes.fr/ensl-00542215
Cnp: An FPGA-based processor for convolutional networks, 2009 International Conference on Field Programmable Logic and Applications, pp.32-37, 2009. ,
A robust and scalable implementation of the Parks-McClellan algorithm for designing FIR filters, ACM Transactions on Mathematical Software (TOMS), vol.43, issue.1, p.7, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01136005
On division by functional iteration. Computers, IEEE Transactions, issue.19, pp.702-706, 1970. ,
Methods of pipelining a data path in an integrated circuit, US Patent, vol.8, p.71, 2014. ,
Computing elementary functions: A new approach for achieving high accuracy and good performance, Accurate Scientific Computations, pp.1-16, 1986. ,
An accurate elementary mathematical library for the IEEE floating point standard, ACM Transactions on Mathematical Software, p.17, 1991. ,
Automated extra pipeline analysis of applications mapped to Xilinx UltraScale+ FPGAs, Field Programmable Logic and Applications, 2016. ,
FPGA-implementation of atan(Y/X) based on logarithmic transformation and LUT-based techniques, Journal of Systems Architecture, p.56, 2010. ,
Low-Power FPGA-Implementation of atan(Y/X) Using Look-Up Table Methods for Communication Applications, Journal of Signal Processing Systems, p.56, 2008. ,
Reconfigurable Computing: The Theory and Practice of FPGA-Based Computation, 2007. ,
Reliable implementation of linear filters with fixed-point arithmetic, SiPS, Workshop on Signal Processing Systems, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-01076048
Table size reduction methods for faithfully rounded lookup-table-based multiplierless function evaluation. Circuits and Systems II: Express Briefs, IEEE Transactions on, vol.62, issue.5, pp.466-470, 2015. ,
A 400-MHz processor for the conversion of rectangular to polar coordinates in 0.25-µm cmos, IEEE Journal of Solid-State Circuits, vol.38, p.248, 2003. ,
Building custom FIR filters using system generator, Proceedings of the Reconfigurable Computing Is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications, FPL '02, pp.1101-1104, 2002. ,
, Ieee standard for information technology-local and metropolitan area networks-specific requirements-part 15.4: Wireless medium access control (mac) and physical layer (phy) specifications for low rate wireless personal area networks (wpans), IEEE Std, vol.802, pp.1-320, 2006.
Automatic layout synthesis for FIR filters using a silicon compiler, IEEE International Symposium on, vol.4, pp.2588-2591, 1990. ,
Efficient initial approximation for multiplicative division and square root by a multiplication with operand modification. Computers, IEEE Transactions on, vol.46, issue.4, pp.495-498, 1997. ,
An efficient universal addition scheme for all hybrid-redundant representations with weighted bit-set encoding, vol.42, pp.149-158, 2006. ,
FIRGEN: a computer-aided design system for high performance FIR filter integrated circuits, IEEE Transactions on Signal Processing, vol.39, issue.7, pp.1655-1668, 1991. ,
A Fast Area-Delay Estimation Technique for RTL Component Generators, 1992. ,
Linear Systems, 1980. ,
Nonrecursive digital filter design using the I 0-sinh window function, Proc. 1974 IEEE International Symposium on Circuits & Systems, pp.20-23, 1974. ,
Multiplication of multidigit numbers on automata, Soviet physics doklady, vol.7, p.595, 1963. ,
FPGA implementation of matrix inversion using QRD-RLS algorithm, Asilomar Conference on Signals, Systems, and Computers, 2005. ,
DOI : 10.1109/acssc.2005.1600043
URL : https://scholarship.rice.edu/bitstream/1911/20002/1/Kar2005Oct5FPGAImplem.PDF
Advanced FPGA Design: Architecture, Implementation, and Optimization, 2007. ,
Implications of historical trends in the electrical efficiency of computing, IEEE Annals of the History of Computing, vol.33, issue.3, pp.46-54, 2011. ,
Dynamically reconfigurable fir filter architectures with fast reconfiguration, 8th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC), pp.1-8, 2013. ,
DOI : 10.1109/recosoc.2013.6581517
Efficient high speed compression trees on xilinx FPGAs, Methoden und Beschreibungssprachen zur Modellierung und Verifikation von Schaltungen und Systemen (MBMV), pp.171-182, 2014. ,
Pipelined compressor tree optimization using integer linear programming, 24th International Conference on Field Programmable Logic and Applications (FPL), pp.1-8, 2014. ,
DOI : 10.1109/fpl.2014.6927468
Retiming synchronous circuitry, Algorithmica, vol.6, issue.1, pp.5-35, 1991. ,
DOI : 10.1007/bf01759032
URL : http://www.cs.columbia.edu/~cs6861/handouts/leiserson-algorithmica-88.pdf
Factoring polynomials with rational coefficients, Mathematische Annalen, vol.261, issue.4, pp.515-534, 1982. ,
Problems, complete in “average” instance, Proceedings of the Sixteenth Annual ACM Symposium on Theory of Computing, STOC '84, p.465, 1984. ,
Average case complete problems, SIAM Journal on Computing, vol.15, issue.1, pp.285-286, 1986. ,
DOI : 10.1007/978-1-4612-4808-8_26
Formatting bits to better implement signal processing algorithms, Pervasive and Embedded Computing and Communication Systems, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01076049
, An FPGA-based cognitive radio framework. IET Conference Proceedings, pp.138-143, 2008.
DOI : 10.1049/cp:20080652
A memory-efficient tables-and-additions method for accurate computation of elementary functions. Computers, IEEE Transactions on, vol.62, issue.5, pp.858-872, 2013. ,
DOI : 10.1109/tc.2012.43
An enhanced and embedded gnu radio flow, 24th International Conference on Field Programmable Logic and Applications (FPL), pp.1-4, 2014. ,
DOI : 10.1109/fpl.2014.6927427
An exact approach for gpc-based compressor tree synthesis, IEICE Transactions on Fundamentals of Electronics, vol.96, issue.12, pp.2553-2560, 2013. ,
Automated design of DSP array processor chips, Proceedings. International Conference on, pp.33-44, 1994. ,
A computer program for designing optimum FIR linear phase digital filters, IEEE Transactions on Audio and Electroacoustics, vol.21, issue.6, pp.506-526, 1973. ,
Synthesis of multiplier-less FIR filters with minimum number of additions, IEEE/ACM International Conference on Computer-Aided Design, pp.668-671, 1995. ,
DOI : 10.1109/iccad.1995.480201
50 years of CORDIC: Algorithms, architectures, and applications, IEEE Transactions on Circuits and Systems I: Regular Papers, vol.56, issue.9, pp.1893-1907, 2009. ,
Journal of VLSI signal processing systems for signal, image and video technoloo, vol.24, pp.211-221, 2000. ,
Fast and accurate resource estimation of automatically generated custom DFT IP cores, Proceedings of the internation symposium on Field programmable gate arrays-FPGA'06, p.211, 2006. ,
Automatic implementation of FIR filters on field programmable gate arrays, IEEE Signal Processing Letters, vol.2, issue.3, pp.51-53, 1995. ,
Five, six, and seven-term karatsuba-like formulae, IEEE Transactions on Computers, vol.54, issue.3, pp.362-369, 2005. ,
Hard tiling problems with simple tiles. Discrete & Computational Geometry, vol.26, pp.573-590, 2001. ,
Cramming more components onto integrated circuits, reprinted from electronics, vol.38, pp.33-35, 1965. ,
A few results on table-based methods, Reliable Computing, vol.5, issue.3, pp.279-288, 1999. ,
URL : https://hal.archives-ouvertes.fr/hal-02101969
Elementary functions, 2006. ,
URL : https://hal.archives-ouvertes.fr/ensl-00989001
Tiling large rectangles. Mathematics magazine, vol.75, pp.372-380, 2002. ,
Accurate area and delay estimators for FPGAs, Proceedings 2002 Design, Automation and Test in Europe Conference and Exhibition, pp.862-869, 2002. ,
FPGA-specific arithmetic optimizations of short-latency adders, 21st International Conference on Field Programmable Logic and Applications, pp.232-237, 2011. ,
URL : https://hal.archives-ouvertes.fr/ensl-00542389
A method for speed optimized partial product reduction and generation of fast parallel multipliers using an algorithmic approach, IEEE Transactions on Computers, vol.45, issue.3, pp.294-306, 1996. ,
Discrete-Time Signal Processing, 2010. ,
Compressor tree synthesis on commercial high-performance FPGAs, ACM Transactions on Reconfigurable Technoloo and Systems (TRETS), vol.4, issue.4, p.39, 2011. ,
Chebyshev Approximation for Nonrecursive Digital Filters with Linear Phase, IEEE Transactions on Circuit Theory, vol.19, issue.2, pp.189-194, 1972. ,
Correctly rounded floating-point division for dsp-enabled FPGAs, 22nd International Conference on Field Programmable Logic and Applications (FPL), pp.249-254, 2012. ,
White-Rabbit Based Revolution Frequency Program for the Longitudinal Beam Control of the CERN PS, 2015. ,
Efficient substitution of multiple constant multiplications by shifts and additions using iterative pairwise matching, ACM IEEE Design Automation Conference, pp.189-194, 1994. ,
Multiple-precision division, Commun. ACM, vol.4, issue.2, p.98, 1961. ,
Efficient Approximations for the Four-Quadrant Arctangent Function, Canadian Conference on Electrical and Computer Engineering, 2006. ,
Sur le calcul effectif des polynomes d'approximation de Tchebichef. Comptt rendd hebdomadairr dd séancc de l, Académie dd Sciencc, vol.199, pp.337-340, 0199. ,
A low-power implementation of arctangent function for communication applications using FPGA, Signal Design and its Applications in Communications, 2009. ,
VISI Physical Design Automation: Theory and Practice, 1994. ,
Hardware/software co-design of an FPGA-based embedded tracking system, Conference on Computer Vision and Pattern Recognition Workshop, pp.123-123, 2006. ,
Symmetric bipartite tables for accurate function approximation, ARITH'13, pp.175-183, 1997. ,
Fast and accurate resource estimation of RTL-based designs targeting FPGAs, Field Programmable Logic and Applications, International Conference on, 2008. ,
The asymptotics of optimal (equiripple) filters, IEEE Transactions on Signal Processing, vol.47, issue.4, pp.1087-1098, 1999. ,
FIReWORK: FIR filters hardware structures auto-generator, Journal of Applied Computer Science, vol.21, issue.1, pp.135-149, 2013. ,
Low-complexity FPGA implementation of compressive sensing reconstruction, Computing, Networking and Communications (ICNC), 2013 International Conference on, pp.671-675, 2013. ,
Optimal circuits for parallel multipliers, IEEE Transactions on Computers, vol.47, issue.3, pp.273-285, 1998. ,
The symmetric table addition method for accurate function approximation, vol.21, pp.167-177, 1999. ,
New algorithms for improved transcendental functions on IA-64, 14th IEEE Symposium on Computer Arithmetic, 1999. ,
Merged arithmetic, IEEE Transactions on Computers, vol.29, issue.10, pp.946-950, 1980. ,
Cutting and packing problems: a categorized, application-orientated research bibliography, Journal of the Operational Research Society, vol.43, issue.7, pp.691-706, 1992. ,
Generating a power of an operand by a table look-up and a multiplication, ARITH'13, pp.126-131, 1997. ,
Modified CORDIC algorithm with reduced iterations, Electronics Letters, vol.15, issue.25, pp.950-951, 1989. ,
Journal of VLSI signal processing systems for signal, image and video technoloo, vol.32, pp.207-222, 2002. ,
Bounded tiling, an alternative to satisfiability, Proceedings of the 2nd Frege Memorial Conference, pp.401-407, 1984. ,
System-level retiming and pipelining, FieldProgrammable Custom Computing Machinn (FCCM), International Symposium on, pp.80-87, 2014. ,
Data-flow transformations to maximize the use of carry-save representation in arithmetic circuits, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol.27, issue.10, pp.1761-1774, 2008. ,
The CORDIC trigonometric computing technique, IRE Transactions on Electronic Computers, EC, vol.8, issue.3, pp.330-334, 1959. ,
Reliable evaluation of the Worst-Case Peak Gain matrix in multiple precision, IEEE Symposium on Computer Arithmetic, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01083879
Determining fixed-point formats for a digital filter implementation using the worst-case peak-gain measure, Asilomar Conference on Signals, Systems and Computers, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01308403
Multiplierless multiple constant multiplication, ACM Trans. Algorithms, issue.2, p.3, 2007. ,
A suggestion for a fast multiplier, IEEE Transactions on Electronic Computers, EC, vol.13, issue.1, pp.14-17, 1964. ,
A unified algorithm for elementary functions, Spring Joint Computer Conference, AFIPS '71 (Spring), pp.379-385, 1971. ,
Complex function approximation using twodimensional interpolation, IEEE Transactions on Computers, vol.63, issue.12, pp.2948-2960, 2014. ,
Applications of distributed arithmetic to digital signal processing: a tutorial review, IEEE ASSP Magazine, vol.6, issue.3, pp.4-19, 1989. ,
Practical analyss: graphical and numerical methods, 1948. ,
Journal of VLSI signal processing systems for signal, image and video technoloo, vol.36, pp.7-15, 2004. ,
Fast evaluation of the elementary functions in single precision. Computers, IEEE Transactions on, vol.44, issue.3, pp.453-457, 1995. ,
A functional silicon compiler for high speed FIR digital filters, Acoustics, Speech, and Signal Processing, pp.1329-1332, 1990. ,