CN103138714A - Hardware implementation of least means square (LMS) self-adapting filter with high performance - Google Patents

Hardware implementation of least means square (LMS) self-adapting filter with high performance Download PDF

Info

Publication number
CN103138714A
CN103138714A CN2013100871756A CN201310087175A CN103138714A CN 103138714 A CN103138714 A CN 103138714A CN 2013100871756 A CN2013100871756 A CN 2013100871756A CN 201310087175 A CN201310087175 A CN 201310087175A CN 103138714 A CN103138714 A CN 103138714A
Authority
CN
China
Prior art keywords
fir
lms
self
adapting filter
filtration module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013100871756A
Other languages
Chinese (zh)
Inventor
李奚鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SUZHOU LANGKUAN ELECTRONIC TECHNOLOGY Co Ltd
Original Assignee
SUZHOU LANGKUAN ELECTRONIC TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SUZHOU LANGKUAN ELECTRONIC TECHNOLOGY Co Ltd filed Critical SUZHOU LANGKUAN ELECTRONIC TECHNOLOGY Co Ltd
Priority to CN2013100871756A priority Critical patent/CN103138714A/en
Publication of CN103138714A publication Critical patent/CN103138714A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to the technical field of hardware implementation of a self-adaptive filtering algorithm based on LMS (least means square). A self-adapting filter with high performance is designed. The hardware implementation of the LMS self-adapting filter with high performance mainly gives priority to data operation and is based on a delay LMS (DLMS) algorithm. A whole system comprises two modules and two recursive loops, wherein the modules comprise a finite impulse response (FIR) filtration module and a coefficient updating module, and the recursive loops comprise a weight updating loop and an error feedback loop. The FIR filtration module comprises a multiply-accumulate unit unique to the invention, the multiply-accumulate unit is achieved by means of a stream line and a parallel technique, and the multiply-accumulate unit does not vary with the variation of an order when delay occurs. The multiply-accumulate unit optimizes multiplication and addition as a whole, and the multiplication reduces the number of partial products and addition series of an accumulator by utilizing an improved Booth code and 4-2 compressors. A sign function is adopted in the coefficient updating module, and hardware expense required by the module is reduced. According to the hardware implementation of the LMS self-adapting filter with high performance, the self-adapting filter system which is high in speed, small in area and remarkable in filtering effect is achieved.

Description

A kind of hardware of high performance LMS sef-adapting filter is realized
Technical field
The present invention relates to the realization technology field based on the sef-adapting filter of LMS algorithm.
Technical background
In high-speed digital communication, multipath fading and channel distortion may cause serious intersymbol interference (ISI), have become one of main difficulty that digital communication faces.Adaptive equalizer is widely used in eliminating ISI owing to having the performance of following the tracks of time varying signal in digital communication.A core component of adaptive equalizer is exactly sef-adapting filter.Sef-adapting filter has a lot of adaptive algorithms, and LMS is exactly wherein a kind of.Because the LMS hardware algorithm is simple in structure, respond well, therefore use very extensive.
The mathematical formulae of the adaptive principle of LMS algorithm is as follows:
Y k = X k TW k …………………………………… (1)
e k=d k-Y k=d k- X k TW k ………………………… (2)
W k+1=W k+2μe kX k …………………………… (3)
Y in formula (1) kBe filter output; D in formula (2) kBe desired signal.e kBe error signal, be used for adjusting the filter weights coefficient.W kBe k weighted vector constantly, W k=[w 0, w 1, w 2W L-1]; X kBe k input data vector constantly, X k=[x 0, x 1, x 2X L-1].Wherein L is the exponent number of filter, i.e. the number of weights coefficient.Formula (3) is the weights iteration expression formula of LMS algorithm, and wherein μ is step-length, affects convergence rate and the steady-state error of LMS.In the span that guarantees weight convergence, the μ value is larger, and convergence rate is faster, but steady-state error is also larger.Therefore to reasonably select the size of μ value, to satisfy the requirement of system.
In traditional LMS algorithm, FIR filtering and weights coefficient update are completed in the interval at the same time.This has just limited the application of this algorithm under the high speed real time environment.For this limitation, the people such as G Long have proposed time-delay LMS(DLMS) algorithm.In this algorithm, FIR filtration module and right value update module can walk abreast and carry out, and this just uses for the streamline of FIR filtration module possibility is provided.
The mathematical formulae of DLMS algorithm and weights iterative formula are respectively:
Y k-D = X k-D TW k-D-1 ………………………… (4)
e k-D=d k-D-Y k-D …………………………… (5)
W k+1=W k+2μe k-DX k-D …………………………(6)
Summary of the invention
The structure that the present invention proposes can greatly reduce the hardware spending of sef-adapting filter, and by adopting streamline and concurrent technique, has greatly improved the speed of service of system.
The present invention has adopted following technical scheme in order to realize above-mentioned requirements.
By claim 3, adopted DLMS(Delay LMS in the present invention) algorithm.FIR filtration module and coefficient updating module can adopt pipelining to realize like this.For example the FIR filtration module adopts 3 level production lines to realize, inputs 3 the timeticks inputs of also delaying time of data and expected data.
By claim 4 and 5, a kind of special multiply accumulating unit has been proposed in the present invention.Consider that multiply operation can be divided into addition and shifting function, thus can be with multiplication and after multiplying each other cumulatively do as a whole being optimized, the time-delay of this multiply accumulating unit just almost has nothing to do with the exponent number of filter like this.
The requirement 4 of having the right, multiplier in the present invention adopts follow-on Booth to encode to reduce the number of partial product, and introduced 4-2 compression unit and Wallace tree structure and come compression section long-pending, thereby reduced the progression of cumulative operation, reduced the cumulative time-delay of partial product.
The present invention has passed through hardware verification.With Verilog, the modules in the present invention is carried out the RTL coding, after functional simulation, it has been carried out the prototype verification of FPGA under the QuartusII platform.At DC(Design Compiler) carried out logic synthesis under environment, and the emulation after having carried out comprehensively, synthesis result and simulation result all show the present invention designed system can stable operation under the high-frequency clock frequency of 200MHz, and hardware spending is also very little.
Description of drawings
Fig. 1: the system block diagram of the sef-adapting filter of DLMS algorithm
Fig. 2: Direct-type FIR Filter structure chart
Fig. 3: transposition type FIR filter graph architecture
Fig. 4: the quick structure chart of multiply accumulating (MAC) unit
Fig. 5: 4-2 compressor configuration figure
Fig. 6: mimimum adder structure chart
Fig. 7: right value update modular structure figure
Embodiment
In order to reach high performance requirement, the present invention has carried out special optimization to the framework of filter system, and has invented a kind of new multiply accumulating unit, has greatly improved the operating rate of system.
By formula (4), (5), (6) as seen, the DLMS sef-adapting filter has the two-way input, is respectively sampled data input and desired signal input.Therefore have two modules in the DLMS structure: FIR filtration module and coefficient updating module; And two recurrence loops: right value update loop and Error Feedback loop.Its system block diagram as shown in Figure 1.
The FIR filter has two kinds of structures, is respectively Direct-type and transposition type, and their structure chart respectively as shown in Figures 2 and 3.Can find out from Fig. 2 and 3, the maximum delay of Direct-type is Tm+MTa, and the maximum delay of transposition type is Tm+Ta, and wherein Tm is the time-delay of multiplier, and Ta is the time-delay of adder, and M is the exponent number of filter.Seem the time-delay of transposition type FIR and M irrelevant, but along with the increase of M, it is large that input load becomes, must Buffer insertion to increase driving force, M is larger, the time-delay that buffer is introduced is also larger, so that lost speed advantage.Simultaneously the required register number of transpose configuration considerably beyond the needed register number of Direct-type FIR, causes the area waste.Therefore the present invention selects Direct-type FIR, and is optimized FIR from system-level, and introduce pipelining, improve the circuit working clock frequency.
The FIR filter circuit mainly is made of multiplication unit and adder unit, if directly realize according to Fig. 2, the critical path time-delay is too large, is unfavorable for the realization of High Speed of system, therefore should top structure be optimized.In hardware was realized, multiply operation finally showed as shifting function and add operation, is optimized so multiplication and addition in the FIR structure can be used as an integral body.For this reason, the present invention proposes a kind of multiply accumulating (MAC) unit of uniqueness.Using the time-delay of the Direct-type FIR of this MAC unit realization to change with the variation of exponent number M hardly, is probably Tm+Ta.Multiplier in this unit adopts improved B ooth multiplier, and by the 4-2 compressor reducer, partial product is compressed, and selects wallace tree structure optimized circuit progression, adopts at last carry look ahead and carry select method to realize mimimum adder.Whole MAC unit block diagram as shown in Figure 4.
The modified Booth encoding algorithm is extensively to be used in the design of high-speed gear, because it can make partial product quantity reduce half, thereby reduces number of adders and operation time, and multiply operation speed is provided.Improved B ooth algorithm coding rule is shown in Table 1.
Table 1 modified Booth encoding rule
Figure 2013100871756100002DEST_PATH_IMAGE001
Although Booth coding can reduce the partial product of half, but the multiplier bit wide is when very large, the quantity of partial product or a lot.If with the direct addition of these partial products, the time-delay of adder carry chain is too large, the speed of restriction system.Therefore introduced the 4-2 compression unit in the present invention, this unit has very large degree of parallelism, and 5 inputs and 3 outputs are arranged.The 4-2 compressor reducer can equivalence be two full adders, if its structure is optimized, can so that its time-delay less than the time-delay of two full adders.Exactly the great advantage of 4-2 compressor reducer is delayed time irrelevant with carry, when being about to N 4-2 compressor reducer and connecting, total time-delay equal the time-delay of a 4-2 compressor reducer.So the 4-2 compressor reducer is often long-pending for the fast multiplier compression section with the array structure of wallace tree, is made of 3 4-2 compressor reducers exactly as the 8-2 compressor reducer in Fig. 4.In the present invention 4-2 compressor reducer used is through after optimizing, 1.5 times of only delaying time for full adder of its time-delay.Its logic diagram such as Fig. 5.Last operation in the FIR filtration module is quick addition, is used for final two the partial product additions after overcompression.The time-delay of traditional ripple carry adder is directly proportional to the figure place N of addition number, and circuit delay is too large, is not suitable in High Speed System.So the present invention adopts carry look ahead and square root carry select structure to realize mimimum adder, its structure chart as shown in Figure 6.
According to formula (6), the right value update circuit needs two multipliers and an adder, but considers that step size mu is constant, can realize with shifting function with multiplying each other of constant, thereby save a multiplier.Even like this, the right value update circuit still needs 1 multiplier and 1 adder, and hardware spending is still very large, and circuit delay is also larger.For this problem, someone has proposed a kind of sign LMS algorithm, and this algorithm becomes the weights iterative formula: W k+1=W k+ 2 μ e k-DSgn (X k-D).By above formula as seen, the right value update circuit only needs 1 adder and some shifting functions just can realize, thereby has saved hardware spending widely.Certainly this is take sacrificial system performance (convergence rate and steady-state error) as cost.The structure of right value update module such as Fig. 7.

Claims (10)

1. based on the adaptive filter system of LMS, it is characterized in that:
This invention is one and is treated to main filter system with data, and the performance of data channel is crucial.
2. in native system, the FIR filtration module is in data path, therefore the performance of system in its performance decision.
3. this system requirements has at a high speed.
4. therefore according to claim 1, can find to adopt streamline and concurrent technique to improve speed as the FIR filtration module of core data processing module.
5. according to traditional LMS algorithm, the FIR filtration module need to be completed in one-period.
6. and according to claim 2, the FIR filtration module need to adopt streamline to realize reaching requirement at a high speed, so need to seek follow-on LMS algorithm.
7.FIR mainly containing delay unit, multiplier and accumulator, filtration module forms.
8. according to claim 2, multiplier need to adopt combinational multiplier to realize reaching the high speed requirement, and needs to consider special construction; Accumulator should adopt special adder structure.
9. the exponent number of the filter system in the present invention is uncertain, means when exponent number is very large, and the speed of system also should be very high.
10. and the time-delay of traditional FIR filter is relevant with the exponent number of filter, thus should traditional FIR filter structure be improved, to adapt to requirement of the present invention.
CN2013100871756A 2013-03-19 2013-03-19 Hardware implementation of least means square (LMS) self-adapting filter with high performance Pending CN103138714A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013100871756A CN103138714A (en) 2013-03-19 2013-03-19 Hardware implementation of least means square (LMS) self-adapting filter with high performance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013100871756A CN103138714A (en) 2013-03-19 2013-03-19 Hardware implementation of least means square (LMS) self-adapting filter with high performance

Publications (1)

Publication Number Publication Date
CN103138714A true CN103138714A (en) 2013-06-05

Family

ID=48498117

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013100871756A Pending CN103138714A (en) 2013-03-19 2013-03-19 Hardware implementation of least means square (LMS) self-adapting filter with high performance

Country Status (1)

Country Link
CN (1) CN103138714A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107820677A (en) * 2017-09-18 2018-03-20 深圳市汇顶科技股份有限公司 Determine the method and its device, terminal of filter coefficient
CN108616466A (en) * 2018-05-09 2018-10-02 清华大学 Channel parallel equalization methods and device
CN108964635A (en) * 2018-06-15 2018-12-07 西安空间无线电技术研究所 A kind of Flouride-resistani acid phesphatase high-order high speed can parameter reconstruct FIR filter design method and system
CN110649912A (en) * 2018-06-27 2020-01-03 深圳光启尖端技术有限责任公司 Modeling method of spatial filter
CN112199912A (en) * 2020-09-22 2021-01-08 重庆邮电大学 Self-adaptive algorithm modular design method based on FPGA
CN113381730A (en) * 2021-05-19 2021-09-10 浙江传媒学院 Robustness self-adaptive filtering system
CN114580628A (en) * 2022-03-14 2022-06-03 北京宏景智驾科技有限公司 Efficient quantization acceleration method and hardware circuit for neural network convolution layer
CN115438790A (en) * 2021-06-04 2022-12-06 合肥本源量子计算科技有限责任公司 Quantum state information processing system, quantum measurement and control system and quantum computer

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006068635A1 (en) * 2004-11-15 2006-06-29 Qualcomm Incorporated Adaptive filter for transmit leakage signal rejection
CN101082812A (en) * 2006-06-01 2007-12-05 模拟装置公司 Self-compensation closed-loop adaptive control system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006068635A1 (en) * 2004-11-15 2006-06-29 Qualcomm Incorporated Adaptive filter for transmit leakage signal rejection
CN101082812A (en) * 2006-06-01 2007-12-05 模拟装置公司 Self-compensation closed-loop adaptive control system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈文博 等: "《高速自适应DLMS算法及其硬件实现》", 《信息技术》, no. 10, 28 December 2007 (2007-12-28) *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107820677A (en) * 2017-09-18 2018-03-20 深圳市汇顶科技股份有限公司 Determine the method and its device, terminal of filter coefficient
CN108616466A (en) * 2018-05-09 2018-10-02 清华大学 Channel parallel equalization methods and device
CN108964635A (en) * 2018-06-15 2018-12-07 西安空间无线电技术研究所 A kind of Flouride-resistani acid phesphatase high-order high speed can parameter reconstruct FIR filter design method and system
CN110649912A (en) * 2018-06-27 2020-01-03 深圳光启尖端技术有限责任公司 Modeling method of spatial filter
CN112199912A (en) * 2020-09-22 2021-01-08 重庆邮电大学 Self-adaptive algorithm modular design method based on FPGA
CN112199912B (en) * 2020-09-22 2022-03-22 重庆邮电大学 Self-adaptive algorithm modular design method based on FPGA
CN113381730A (en) * 2021-05-19 2021-09-10 浙江传媒学院 Robustness self-adaptive filtering system
CN115438790A (en) * 2021-06-04 2022-12-06 合肥本源量子计算科技有限责任公司 Quantum state information processing system, quantum measurement and control system and quantum computer
CN115438790B (en) * 2021-06-04 2024-05-07 本源量子计算科技(合肥)股份有限公司 Quantum state information processing system, quantum measurement and control system and quantum computer
CN114580628A (en) * 2022-03-14 2022-06-03 北京宏景智驾科技有限公司 Efficient quantization acceleration method and hardware circuit for neural network convolution layer

Similar Documents

Publication Publication Date Title
CN103138714A (en) Hardware implementation of least means square (LMS) self-adapting filter with high performance
Sudhashree et al. Analysis of Low Complexity Memory Footprint Reduction for Delay and Area Efficient Realization of 2D FIR Filters
CN103227623B (en) The LMS adaptive filter algorithm of variable step size and filter
Cooklev An efficient architecture for orthogonal wavelet transforms
CN102185587B (en) Low-power-consumption multi-order interpolation half-band filter with two-phase structure
Safarian et al. FPGA implementation of LMS-based FIR adaptive filter for real time digital signal processing applications
CN106936406A (en) A kind of realization of 5 parallel rapid finite impact response filter
Akash et al. An efficient implementation of FIR filter using high speed adders for signal processing applications
CN103853524A (en) Multiplier device and multiplying method
Tsuda et al. An improved NLMS algorithm for channel equalization
CN108897526B (en) Compound finite field inverter based on multiple square operations and inversion method thereof
CN101494448B (en) Serial self-adapting noise elimination module based on FPGA design
Tiwari et al. High throughput adaptive block FIR filter using distributed arithmetic
Mankar et al. Design and Verification of low power DA-Adaptive digital FIR filter
Sreegul et al. An approximation algorithm for reducing the number of non-zero bits in the filter coefficients
Yadav et al. Comparative analysis of digital IIR filter using add and shift method on Xilinx platform
Mullins et al. Alternative least mean square adaptive filter architectures for implementation on field programmable gate arrays
Chandra et al. Differential evolution based design of multiplier-less FIR filter using canonical signed digit representation
Qiu et al. The design of FIR band-pass filter with improved distributed algorithm based on FPGA
Chulet et al. FIR filter designing using wallace multiplier
Maity et al. A comparative study on FPGA based FIR filter using broadcast structure and overlap save method
Meyer-Baese et al. Infinite impulse response (IIR) digital filters
Shahbazi et al. Low power FPGA implementation of a transposed form FIR filter with differential input technique
REDDY et al. Implementation of Area and Power Optimized Adaptive Filter using LMS Algorithm
Durgagopal et al. Power Optimized Low Pass Digital FIR Filter Using Five Modular Redundancy and Parallel Adding Methods

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130605