Re: [arm-gnu] NEON usage
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [arm-gnu] NEON usage
- To: James <jamessteward@xxxxxxxxxxxxxxx>
- Subject: Re: [arm-gnu] NEON usage
- From: Daniel Jacobowitz <dan@xxxxxxxxxxxxxxxx>
- Date: Tue, 11 May 2010 09:18:28 -0400
On Tue, May 11, 2010 at 01:20:10PM +1000, James wrote:
> $ arm-none-linux-gnueabi-gcc -O3 -march=armv7-a -mtune=cortex-a8
> -mcpu=cortex-a8 -mfpu=neon -mfloat-abi=softfp -ftree-vectorize
> -ftree-vectorizer-verbose=5 -ffast-math -fvect-cost-model -o neon
> test.c
>
> test.c:27: note: not vectorized: unsupported data-type int64_t
> test.c:21: note: vectorized 0 loops in function.
>
> But the ARM NEON Intrinsics here:
> http://gcc.gnu.org/onlinedocs/gcc/ARM-NEON-Intrinsics.html
>
> seems to suggest that
>
> int64x2_t vmlal_s32 (int64x2_t, int32x2_t, int32x2_t)
>
> could be used?
Does it work any better with -mvectorize-with-neon-quad?
Currently released versions of GCC only support auto-vectorizing to a
single size; either double or quad, but not both. A future version
will support automatic selection. The infrastructure and some x86
support is expected in GCC 4.6, but I don't know when the equivalent
ARM pieces will be done.
--
Daniel Jacobowitz
CodeSourcery