[LAD] vectorization

From: Malte Steiner <steiner@email-addr-hidden>
Date: Thu Feb 07 2008 - 19:51:06 EET

Hello,

I try to squeeze as much performance as possible out of my upcomming
Linux synthesizer and try manual vectorization with following construct
in c, mainly to vectorize away multiplications :

typedef float v4sf __attribute__ ((vector_size(16)));

union f4vector
{
v4sf v __attribute__((aligned (16)));
float f[4] __attribute__((aligned (16)));
};

On AMD 64bit Turion (single core) on 64 Studio in 64bit mode this doesnt
improve performance at all, actually it even get worse. Is GCC that good
at optimizing on its own? I have no access to Intel processors at the
moment but would love to know how to benefit from SIMD optimizations of
float operations.
Sources on the web are rather thin...

Cheers,

Malte

-- 
Malte Steiner
media art + development
-www.block4.com-
_______________________________________________
Linux-audio-dev mailing list
Linux-audio-dev@email-addr-hidden
http://lists.linuxaudio.org/mailman/listinfo/linux-audio-dev

Received on Thu Feb 7 20:15:08 2008

This message: [ Message body ]
Next message: Chris Cannam: "Re: [LAD] vectorization"
Previous message: dgarcia@email-addr-hidden: "[LAD] CLAM 1.2 'GSoCket plugged-in release'"
Next in thread: Chris Cannam: "Re: [LAD] vectorization"
Reply: Chris Cannam: "Re: [LAD] vectorization"
Reply: Christian Schoenebeck: "Re: [LAD] vectorization"
Maybe reply: Christian Schoenebeck: "Re: [LAD] vectorization"
Maybe reply: Jens M Andreasen: "Re: [LAD] vectorization"
Maybe reply: Christian Schoenebeck: "Re: [LAD] vectorization"
Reply: Jussi Laako: "Re: [LAD] vectorization"
Maybe reply: Jens M Andreasen: "Re: [LAD] vectorization"
Reply: Sampo Savolainen: "Re: [LAD] vectorization"

Mail actions: [ respond to this message ] [ mail a new topic ]
Contemporary messages sorted: [ By Date ] [ By Thread ] [ By Subject ] [ By Author ] [ By messages with attachments ]

This archive was generated by hypermail 2.1.8 : Thu Feb 07 2008 - 20:15:08 EET