[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Patches] [PATCH] ARM: NEON detected memcpy.



On Wed, Apr 03, 2013 at 11:47:36PM +0800, Shih-Yuan Lee (FourDollars) wrote:
> Hi Joseph,
>
... 
> > I was previously told by people at ARM that NEON memcpy wasn't a good idea
> > in practice because of raised power consumption, context switch costs etc.
> > from using NEON in processes that otherwise didn't use it, even if it
> > appeared superficially beneficial in benchmarks.
> >
> About raised power consumption and context switch costs, I may be able
> to add some option in configure for the users to decide if they want
> to use this feature or not.
> How do you think?
> 
Configure option is bit overkill. 

You need to compare neon/other implementation speed. Then determine
size where neon is faster if we include energy cost and context switch. 
My first estimate is use neon when larger than 4096 bytes. 

However to determine context switch cost of neon you must account network effect.

If you use neon in one function that is called sufficiently often (to
always save registers) then adding neon implementation for additional functions 
does not increase cost.
_______________________________________________
Patches mailing list
Patches@xxxxxxxxxx
http://eglibc.org/cgi-bin/mailman/listinfo/patches