Thats about an actual implementation of multiplier (as I said above). You may chose parallel implementation and then you have no problem for further serializing it. But such implementation consumes much more space and power. On the other hand you may choose an iterative implementation and still get a fully pipelined multiplier if you can implement an iterative stage in one cycle but this specific stage is not pipelineable further (in case you want to rise the freq.). I know nothing about the actual implementation of multipliers in Intel's cpu, but my point is that "pipelineablilty" depends on specific design decision.




Reply With Quote
Bookmarks