Issue 4844043: Use bfrange to shrink ToUnicode table.

	Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+226 lines, -36 lines)			Patch
M	gyp/tests.gyp	View	1 2 3	1 chunk	+1 line, -0 lines	0 comments	Download
M	src/pdf/SkPDFFont.cpp	View	1 2 3 4 5	2 chunks	+133 lines, -36 lines	2 comments	Download
A	tests/ToUnicode.cpp	View	1 2 3 4 5 6	1 chunk	+92 lines, -0 lines	0 comments	Download

Messages

Total messages: 11

Expand All Messages | Collapse All Messages

reed1

bfrange could probably use some isolated unittests, just to be sure we handle all of ...

14 years ago (2011-08-03 14:46:02 UTC) #3

arthurhsu

On 2011/08/03 13:28:46, TomH wrote: > How much shrinkage does this give us? <2KB per ...

14 years ago (2011-08-03 17:09:42 UTC) #4

arthurhsu

I've added unit tests and updated code based on Mike's comment. The unit tests unveiled ...

14 years ago (2011-08-03 20:41:36 UTC) #5

Steve VanDeBogart

http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp File src/pdf/SkPDFFont.cpp (right): http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp#newcode373 src/pdf/SkPDFFont.cpp:373: namespace { We don't need a namespace here - ...

14 years ago (2011-08-03 23:30:22 UTC) #6

Steve VanDeBogart

http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp File src/pdf/SkPDFFont.cpp (right): http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp#newcode487 src/pdf/SkPDFFont.cpp:487: SkPDFStream* generate_tounicode_cmap(const SkTDArray<SkUnichar>& glyphUnicode, this should still be static. ...

14 years ago (2011-08-03 23:35:03 UTC) #7

arthurhsu

14 years ago (2011-08-04 00:13:58 UTC) #8

Steve VanDeBogart

http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp File src/pdf/SkPDFFont.cpp (right): http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp#newcode447 src/pdf/SkPDFFont.cpp:447: if (i == base.fStart + continuousEntries && On 2011/08/04 ...

14 years ago (2011-08-04 17:15:53 UTC) #9

arthurhsu

http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp File src/pdf/SkPDFFont.cpp (right): http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp#newcode447 src/pdf/SkPDFFont.cpp:447: if (i == base.fStart + continuousEntries && On 2011/08/04 ...

13 years, 12 months ago (2011-08-08 17:38:14 UTC) #10

http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp
File src/pdf/SkPDFFont.cpp (right):

http://codereview.appspot.com/4844043/diff/7001/src/pdf/SkPDFFont.cpp#newcode447
src/pdf/SkPDFFont.cpp:447: if (i == base.fStart + continuousEntries &&
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> On 2011/08/04 00:13:58, arthurhsu wrote:
> > On 2011/08/03 23:30:22, Steve VanDeBogart wrote:
> > > I think we can do better than this when you consider subsetting.  Consider
> > > glyphs 1-10 that map to unicode 101-110.  But we only have 1,3,5,7,9 in
the
> > > subset:
> > > <1> <101> <3> <103> <5> <105> <7> <107> <9> <109>
> > > is much longer than
> > > <1> <10> <101>
> > > It doesn't hurt to map an entry that we won't refer to.
> > 
> > We won't have the info in the near future when subset info is honored in
> > advanced type metrics, therefore I did not attempt to do further
optimization
> > like this.
> 
> As we discussed, please add a comment about the spec being unclear if we can
do
> better, but the savings being bounded (416k pre-compressed worse-case by my
> calculation).

Done.

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp
File src/pdf/SkPDFFont.cpp (right):

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp#newcode394
src/pdf/SkPDFFont.cpp:394: cmap->writeHexAsText(bfchar[i + j].fGlyphId, 4);
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> Does this need to be at least four bytes, or can we just use the natural
length
> of the number?

I tried that at the very beginning of implementing ToUnicode and Adobe Reader
does not like it.

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp#newcode435
src/pdf/SkPDFFont.cpp:435: BFRange base;
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> base -> currentRangeEntry

Done.

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp#newcode441
src/pdf/SkPDFFont.cpp:441: // PDF spec mentioned that bfrange can not change the
higher byte,
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> nit: mentioned that->requires
> nit: bytes, -> byte.

Done.

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp#newcode481
src/pdf/SkPDFFont.cpp:481: // The spec requires bfchar must present before
bfrange per spec.
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> nit: The spec requires that all bfchar entries must come before bfrange
entries.

Done.

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp#newcode482
src/pdf/SkPDFFont.cpp:482: if (bfcharEntries.count())
append_bfchar_section(bfcharEntries, cmap);
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> nit: these don't have to be conditional / skia style requires {}'s on ifs.

Done.

http://codereview.appspot.com/4844043/diff/4003/src/pdf/SkPDFFont.cpp#newcode487
src/pdf/SkPDFFont.cpp:487: const SkTDArray<SkUnichar>& glyphToUnicode,
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> nit: indent is 8 spaces.

Done.

http://codereview.appspot.com/4844043/diff/4003/tests/ToUnicode.cpp
File tests/ToUnicode.cpp (right):

http://codereview.appspot.com/4844043/diff/4003/tests/ToUnicode.cpp#newcode27
tests/ToUnicode.cpp:27: void append_cmap_sections(const SkTDArray<SkUnichar>&
glyphToUnicode,
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> Hmm, if this gets out of sync with the definition, the compiler won't complain
> and we'll get some strange results... Maybe make it a static private function
of
> SkPDFFont, and make TestToUnicode a friend?

I am hesitated of doing so.  I'd like to keep testing related stuff out of
production code's header.  If the test failed, the dev who changed the signature
of this function should fix it, and that's why we unit test this function.

http://codereview.appspot.com/4844043/diff/4003/tests/ToUnicode.cpp#newcode32
tests/ToUnicode.cpp:32: SkTDArray<uint16_t> glyphIDs;
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> nit: glyphsInSubset

Done.

http://codereview.appspot.com/4844043/diff/4003/tests/ToUnicode.cpp#newcode49
tests/ToUnicode.cpp:49: glyphToUnicode.push(0x2F); // 8
On 2011/08/04 17:15:53, Steve VanDeBogart wrote:
> Can you also add a range with an entry missing, like 9,10,11,12

Done.

Steve VanDeBogart

13 years, 12 months ago (2011-08-08 22:34:37 UTC) #11

LGTM with nits... fixed and committed as r2075.

http://codereview.appspot.com/4844043/diff/2003/src/pdf/SkPDFFont.cpp
File src/pdf/SkPDFFont.cpp (right):

http://codereview.appspot.com/4844043/diff/2003/src/pdf/SkPDFFont.cpp#newcode470
src/pdf/SkPDFFont.cpp:470: (i >> 8) == (currentRangeEntry.fStart >> 8) &&
nit: 470-472 in 8 more

http://codereview.appspot.com/4844043/diff/2003/src/pdf/SkPDFFont.cpp#newcode505
src/pdf/SkPDFFont.cpp:505: // The spec requires all bfchar entries for a font
must present before
nit: present -> come

Expand All Messages | Collapse All Messages