Rietveld Code Review Tool
Help | Bug tracker | Discussion group | Source code | Sign in
(154)

Issue 7058077: Adding SSE2 Matrix-Matrix multiply to SetConcat (when in X64 mode).

Can't Edit
Can't Publish+Mail
Start Review
Created:
12 years ago by whunt
Modified:
11 years, 7 months ago
Reviewers:
jamesr, reed1, whunt1, TomH
CC:
skia-review_googlegroups.com, mike3, reed1
Base URL:
http://skia.googlecode.com/svn/trunk/
Visibility:
Public.

Description

Adding SSE2 Matrix-Matrix multiply to SetConcat (when in X64 mode). Branch occurs at compile time rather than run time. Performance goes from 3.25 units to 1.87 units.

Patch Set 1 #

Total comments: 4

Patch Set 2 : #

Patch Set 3 : #

Total comments: 1

Patch Set 4 : #

Total comments: 1

Patch Set 5 : #

Patch Set 6 : #

Unified diffs Side-by-side diffs Delta from patch set Stats (+107 lines, -4 lines) Patch
M bench/Matrix44Bench.cpp View 1 2 2 chunks +23 lines, -0 lines 0 comments Download
M include/utils/SkMatrix44.h View 1 2 3 4 3 chunks +15 lines, -2 lines 0 comments Download
M src/utils/SkMatrix44.cpp View 1 2 3 4 5 3 chunks +69 lines, -2 lines 0 comments Download

Messages

Total messages: 29
whunt
https://codereview.appspot.com/7058077/diff/1/bench/Matrix44Bench.cpp File bench/Matrix44Bench.cpp (right): https://codereview.appspot.com/7058077/diff/1/bench/Matrix44Bench.cpp#newcode133 bench/Matrix44Bench.cpp:133: fM1.set(2, 0, 3.0f); Added these to make the benchmark ...
12 years ago (2013-01-10 19:19:29 UTC) #1
reed1
1. do we already have adequate coverage in unittests to feel that we're getting the ...
12 years ago (2013-01-10 19:28:03 UTC) #2
reed1
https://codereview.appspot.com/7058077/diff/1/src/utils/SkMatrix44.cpp File src/utils/SkMatrix44.cpp (right): https://codereview.appspot.com/7058077/diff/1/src/utils/SkMatrix44.cpp#newcode438 src/utils/SkMatrix44.cpp:438: p->w_zw = b0 * x_zw + b1 * y_zw ...
12 years ago (2013-01-10 19:30:21 UTC) #3
whunt
On 2013/01/10 19:28:03, reed1 wrote: > 1. do we already have adequate coverage in unittests ...
12 years ago (2013-01-10 22:11:31 UTC) #4
whunt
On 2013/01/10 19:30:21, reed1 wrote: > https://codereview.appspot.com/7058077/diff/1/src/utils/SkMatrix44.cpp > File src/utils/SkMatrix44.cpp (right): > > https://codereview.appspot.com/7058077/diff/1/src/utils/SkMatrix44.cpp#newcode438 > ...
12 years ago (2013-01-10 22:14:41 UTC) #5
whunt
https://codereview.appspot.com/7058077/diff/6001/src/utils/SkMatrix44.cpp File src/utils/SkMatrix44.cpp (right): https://codereview.appspot.com/7058077/diff/6001/src/utils/SkMatrix44.cpp#newcode395 src/utils/SkMatrix44.cpp:395: sk_bzero(result, sizeof(storage)); This change *over doubled* the performance of ...
12 years ago (2013-01-10 22:18:03 UTC) #6
reed1
On 2013/01/10 22:14:41, whunt wrote: > On 2013/01/10 19:30:21, reed1 wrote: > > https://codereview.appspot.com/7058077/diff/1/src/utils/SkMatrix44.cpp > ...
12 years ago (2013-01-10 22:20:23 UTC) #7
reed1
12 years ago (2013-01-10 22:21:24 UTC) #8
jamesr
On 2013/01/10 22:20:23, reed1 wrote: > On 2013/01/10 22:14:41, whunt wrote: > > On 2013/01/10 ...
12 years ago (2013-01-10 22:23:29 UTC) #9
reed1
I presume you meant 16 bytes, not bits. Doesn't that mean it also matters how ...
12 years ago (2013-01-10 22:23:42 UTC) #10
reed1
We will need to turn the static_assert into an #ifdef, since we can build SkMatrix44 ...
12 years ago (2013-01-10 22:25:51 UTC) #11
whunt
On 2013/01/10 22:23:29, jamesr wrote: > On 2013/01/10 22:20:23, reed1 wrote: > > On 2013/01/10 ...
12 years ago (2013-01-10 22:26:04 UTC) #12
jamesr
On 2013/01/10 22:23:42, reed1 wrote: > I presume you meant 16 bytes, not bits. > ...
12 years ago (2013-01-10 22:26:57 UTC) #13
whunt
On 2013/01/10 22:23:42, reed1 wrote: > I presume you meant 16 bytes, not bits. > ...
12 years ago (2013-01-10 22:27:58 UTC) #14
whunt
On 2013/01/10 22:25:51, reed1 wrote: > We will need to turn the static_assert into an ...
12 years ago (2013-01-10 22:28:28 UTC) #15
whunt
On 2013/01/10 22:25:51, reed1 wrote: > We will need to turn the static_assert into an ...
12 years ago (2013-01-10 22:29:45 UTC) #16
TomH
On 2013/01/10 22:26:04, whunt wrote: > On 2013/01/10 22:23:29, jamesr wrote: > > You can ...
12 years ago (2013-01-11 10:08:47 UTC) #17
TomH
And LGTM, for what it's worth.
12 years ago (2013-01-11 10:10:09 UTC) #18
whunt1
All Intel Atom (used in mobile Intel devices) processors support SSE, SSE2, SSE3 and SSE3E ...
12 years ago (2013-01-11 18:14:01 UTC) #19
whunt
On 2013/01/11 18:14:01, whunt_google.com wrote: > All Intel Atom (used in mobile Intel devices) processors ...
12 years ago (2013-01-11 18:52:40 UTC) #20
TomH
On 2013/01/11 18:52:40, whunt wrote: > For skia, what's the process to go from LGTM ...
12 years ago (2013-01-14 09:57:19 UTC) #21
reed1
https://codereview.appspot.com/7058077/diff/10001/include/utils/SkMatrix44.h File include/utils/SkMatrix44.h (right): https://codereview.appspot.com/7058077/diff/10001/include/utils/SkMatrix44.h#newcode60 include/utils/SkMatrix44.h:60: Lets consider consolidating the build flag tests into a ...
12 years ago (2013-01-14 14:53:00 UTC) #22
whunt1
On 2013/01/14 14:53:00, reed1 wrote: > https://codereview.appspot.com/7058077/diff/10001/include/utils/SkMatrix44.h > File include/utils/SkMatrix44.h (right): > > https://codereview.appspot.com/7058077/diff/10001/include/utils/SkMatrix44.h#newcode60 > ...
12 years ago (2013-01-14 18:17:01 UTC) #23
reed1
lgtm we have other examples where we name the test more explicitly (e.g. _scaletrans instead ...
12 years ago (2013-01-14 18:21:36 UTC) #24
TomH
Unfortunately, when I run out/Debug/tests on Linux with this patch installed, I get: [42/85] Matrix44... ...
12 years ago (2013-01-15 11:47:22 UTC) #25
reed1
On 2013/01/15 11:47:22, TomH wrote: > Unfortunately, when I run out/Debug/tests on Linux with this ...
12 years ago (2013-01-15 13:17:47 UTC) #26
TomH
Mike reduced the stringency of our size checks, which avoided the assertion failure. Landed as ...
11 years, 12 months ago (2013-01-17 12:18:41 UTC) #27
TomH
Nope, can't close it. Worked for me in Ubuntu Debug, but tests fail on most ...
11 years, 12 months ago (2013-01-17 13:34:16 UTC) #28
reed1
11 years, 7 months ago (2013-06-03 13:17:53 UTC) #29

          
Sign in to reply to this message.

Powered by Google App Engine
RSS Feeds Recent Issues | This issue
This is Rietveld f62528b