Likely I will commit these piece-meal, but here is the sum of my experiments so ...
11 years, 11 months ago
(2012-05-10 20:25:00 UTC)
#1
Likely I will commit these piece-meal, but here is the sum of my experiments so
far to speed up small drawRects (simulating dashing).
Bench runs are very noisy :( but I'm seeing ~25% faster on dash_4_rect
benchmark.
Will run some timings on linux (64bit) before I commit anything
Want to get some aggregate feeling of perf change on all benches (but how?)
The templates add a *lot* of code complexity to BlitRow_D32. Do they really yield that ...
11 years, 11 months ago
(2012-05-10 20:53:50 UTC)
#3
The templates add a *lot* of code complexity to BlitRow_D32. Do they really
yield that much more performance than a naive unrolling? The only thing I see
you're winning is that the odd-man-out part of the loop (1..3) is unrolled.
For noisy timings, what -repeat count are you using?
My rule of thumb is -repeat 50 for 5% noise, -repeat 150 for 1%.
Issue 6197072: optimizations for small drawRect calls
Created 11 years, 11 months ago by reed1
Modified 11 years, 11 months ago
Reviewers: TomH
Base URL: http://skia.googlecode.com/svn/trunk/
Comments: 0