An Optimized Hardware Architecture for the Montgomery Multiplication Algorithm

**seminar ideas** · 07-05-2012, 04:31 PM

An Optimized Hardware Architecture for the Montgomery Multiplication Algorithm

.ppt

mm.ppt (Size: 681.5 KB / Downloads: 35)

One PE is in charge of the computation of one column that corresponds to the updating of S with respect to one single bit Xi.
The delay between two contiguous PEs is 2 clock cycles.
The minimum computation time in terms of clock cycle is 2•n+e given (e+1)/2 PEs are implemented to work in parallel.

Avoid the extra clock cycle delay

One singe PE is responsible to update one fixed word in S
It has two branches corresponding to two possibilities of S(i+1)0
The correct results, the carry and the S(i)w-1, is selected from two sets of possible results by S(i+1)0, both available and registered at the same moment

The overall architecture

e PEs are required to compute the e words in S respectively.
Two shift registers, one providing single bits in X and one providing the parities of S(0)0, parallel these PEs.
(n+e-1) clock cycles are required to process the Montgomery multiplication of two n-bit operands.

Conclusion

An optimized hardware architecture to implement MWR2MM algorithm is proposed
The radix-2 version of this architecture takes (n+e-1) clock cycles to process the Montgomery multiplication of two n-bit operands
Compared to original architecture by Tenca & Koç, the new approach takes half time for processing and introduces less than 10% area penalty
The same optimization technique can be applied onto the original architecture by Tenca & Koç, keeping the scalability while reducing the processing latency to half

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	AN EDGE-BASED FACE DETECTION ALGORITHM ROBUST AGAINST ILLUMINATION, FOCUS, AND SCALE	seminar class	1	712,842	19-09-2017, 10:31 AM Last Post: jaseela123
	Hardware of computer	seminar addict	1	32,357	15-09-2017, 01:33 PM Last Post: jaseela123
	A Change Information Based Fast Algorithm for Video Object Detection and Tracking	seminar ideas	1	7,728	06-09-2017, 01:24 PM Last Post: jaseela123
	Cognitive architecture	computer science crazy	0	8,814,404	25-08-2017, 09:32 PM Last Post: computer science crazy
	RTPS BASED SCHEDULING ALGORITHM USING SWIM IN IEEE 802.16e MOBILE WIMAX NETWORK	dhanabhagya	0	480	13-02-2016, 11:00 AM Last Post: dhanabhagya
	Design of 2-D Filters using a Parallel Processor Architecture	presentation Abstract	0	429	29-05-2015, 02:58 PM Last Post: presentation Abstract
	Compute Unified Device Architecture CUDA	presentation Abstract	0	395	27-05-2015, 03:42 PM Last Post: presentation Abstract
	A Major Seminar on A Novel Architecture for Domain Specific Parallel Crawler	study tips	0	823	30-05-2013, 11:50 AM Last Post: study tips
	DEAL (Data Encryption Algorithm with Larger blocks)	computer science crazy	1	12,169,620	03-08-2012, 10:58 AM Last Post: seminar ideas
	3D FACE RECOGNITION USING 3D RPROP ALGORITHM	seminar flower	0	1,918	13-06-2012, 04:44 PM Last Post: seminar flower

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.