OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

VLSI Design 2014

Low-Area Wallace Multiplier

DOI: 10.1155/2014/343960

Shahzad Asif,Yinan Kong

Full-Text Cite this paper Add to My Lib

Abstract:

Multiplication is one of the most commonly used operations in the arithmetic. Multipliers based on Wallace reduction tree provide an area-efficient strategy for high speed multiplication. A number of modifications are proposed in the literature to optimize the area of the Wallace multiplier. This paper proposed a reduced-area Wallace multiplier without compromising on the speed of the original Wallace multiplier. Designs are synthesized using Synopsys Design Compiler in 90？nm process technology. Synthesis results show that the proposed multiplier has the lowest area as compared to other tree-based multipliers. The speed of the proposed and reference multipliers is almost the same. 1. Introduction Multiplication is one of the most widely used arithmetic operations. Due to this a wide range of multiplier architectures are reported in the literature providing flexible choices for various applications. Among them the simplest is array multiplier [1] which is also the slowest. Some high performance multipliers are presented in [2–5]. The focus of this paper is Wallace multiplier [6]. Wallace multiplier uses full adders and half adders to reduce the partial product tree to two rows, and then a final adder is used to add these two rows of partial products. We call this design “TW (traditional Wallace) multiplier” in this text. TW multiplier performs its operation in three steps. (1) Generate all the partial products. (2) The partial product tree is reduced using full adders and half adders until it is reduced to two terms. (3) Finally, a fast adder is used to add these two terms. Waters and Swartzlander [7] presented a reduced complexity Wallace multiplier by reducing the number of half adders in the reduction process. We call this design “RCW (reduced complexity Wallace) multiplier” from now on. The speed of the RCW multiplier is expected to be the same as of TW multiplier due to the equal number of reduction stages in both multipliers. The RCW uses a larger final adder as compared to the TW multiplier. A number of strategies are reported in [8–10] to improve the speed of the RCW. However, the focus of their research is to reduce the delay by using a faster final adder while still using the same reduction tree as RCW. As a result, the final adder size for the multipliers in [8–10] is the same as that of RCW. The focus of this paper is to optimize the reduction tree in a way that can reduce the size of the final adder. The reduced size of the final adder resulted in low area of the multiplier without incurring any extra delay. We call our design “PW (Proposed

References

[1]	N. H. E. Weste and D. M. Harris, Integrated Circuit Design, Pearson, 2010.
[2]	J.-Y. Kang and J.-L. Gaudiot, “A fast and well-structured multiplier,” in Proceedings of the EUROMICRO Systems on Digital System Design (DSD '04), pp. 508–515, September 2004.
[3]	C. R. Baugh and B. A. Wooley, “A twos complement parallel array multiplication algorithm,” IEEE Transactions on Computers, vol. C-22, no. 12, pp. 1045–1047, 1973.
[4]	S.-R. Kuang, J.-P. Wang, and C.-Y. Guo, “Modified booth multipliers with a regular partial product array,” IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 56, no. 5, pp. 404–408, 2009.
[5]	B. C. Paul, S. Fujita, and M. Okajima, “ROM-based logic (RBL) Design: a low-power 16 bit multiplier,” IEEE Journal of Solid-State Circuits, vol. 44, no. 11, pp. 2935–2942, 2009.
[6]	C. S. Wallace, “A suggestion for a fast multiplier,” IEEE Transactions on Electronic Computers, vol. EC-13, no. 1, pp. 14–17, 1964.
[7]	R. S. Waters and E. E. Swartzlander, “A reduced complexity wallace multiplier reduction,” IEEE Transactions on Computers, vol. 59, no. 8, pp. 1134–1137, 2010.
[8]	S. Rajaram and K. Vanithamani, “Improvement of Wallace multipliers using parallel prefix adders,” in Proceedings of the International Conference on Signal Processing, Communication, Computing and Networking Technologies (ICSCCN '11), pp. 781–784, July 2011.
[9]	P. Jagadeesh, S. Ravi, and K. H. Mallikarjun, “Design of high performance 64 bit mac unit,” in Proceedings of the International Conference on Circuits, Power and Computing Technologies (ICCPCT '13), pp. 782–786, March 2013.
[10]	M. Kumaran and M. Kamarajan, “Multicore embedded system using parallel processing technique,” International Journal of Emerging Trands in Electrical and Electronics, vol. 5, no. 3, 2013.
[11]	L. Dadda, “Some schemes for parallel multipliers,” Alta Frequenza, vol. 34, pp. 349–356, 1965.
[12]	P. M. Kogge and H. S. Stone, “A parallel algorithm for the efficient solution of a general class of recurrence equations,” IEEE Transactions on Computers, vol. C-22, no. 8, pp. 786–793, 1973.
[13]	J. Sklansky, “Conditional-sum addition logic,” IRE Transactions on Electronic Computers, vol. EC-9, pp. 226–231, 1960.
[14]	R. P. Brent and H. T. Kung, “A regular layout for parallel adders,” IEEE Transactions on Computers, vol. C-31, no. 3, pp. 260–264, 1982.
[15]	R. Ward and T. Molteno, “Table of linear feedback shift registers,” Datasheet, Department of Physics, University of Otago, 2007.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133