| Commit message (Collapse) | Author | Age | Files | Lines |
|
|
|
| |
Signed-off-by: James Almer <jamrial@gmail.com>
|
|
|
|
| |
Signed-off-by: James Almer <jamrial@gmail.com>
|
|
|
|
| |
Results weren't signed.
|
|
|
|
|
| |
This is more similar to its regular use, and prevents inaccuracies
of huge float*float multiplications from failing the tests.
|
|
|
|
|
|
| |
Will help debugging.
Signed-off-by: James Almer <jamrial@gmail.com>
|
|
|
|
|
|
|
| |
Simplifies the test, while trying more values and preventing pointlessly
running benchmarks in a loop.
Signed-off-by: James Almer <jamrial@gmail.com>
|
|
|
|
| |
Signed-off-by: James Almer <jamrial@gmail.com>
|
|
|
|
| |
Passes checkasm with valgrind, tested to sizes of more than 4000 samples.
|
|
Old one was written with the assumption only even inputs would be given.
This very messy replacement supports even and odd inputs, and supports
AVX2 for extra speed. The buffers given are usually quite big (4k samples),
so the speedup is worth it.
The new SSE version is still faster than the old inline asm version by 33%.
Also checkasm is provided to make sure this monstrosity works.
This fixes some FATE tests.
|