Issue 207048: merge upstream ElementTree 1.3 and cElementTree 1.0.6 in /python/trunk/

Issue 207048: merge upstream ElementTree 1.3 and cElementTree 1.0.6 in /python/trunk/ (Closed)

Can't Edit
Can't Publish+Mail
Start Review

Created:
15 years, 5 months ago by flox

Modified:
15 years, 4 months ago

Reviewers:
effbot, flox, Antoine Pitrou

Base URL:
http://svn.python.org/view/*checkout*/python/trunk/

Visibility:
Public.

Description

Merge upstream ElementTree/cElementTree in trunk. * upstream: http://bitbucket.org/effbot/et-2009-provolone/ * branch to prepare 1.3 for Python: http://bitbucket.org/flox/et-2009-provolone/ See the thread of the request: http://bugs.python.org/issue6472 The goals of this patch are: - to fix many bugs reported in Python issue tracker - to ensure consistency between C and Python implementations (with tests) - to improve test coverage Some parts are removed because they are experimental or too much specialized: - module ElementC14N: canonical XML - ElementTree C API This patch fixes many issues, including: #6472 Update ET with upstream changes (and #1143 #1777) #1538691 Patch cET to export CurrentLineNumber #1602189 Suggest a textlist() method for ElementTree #3151 ET serialization bug for weird namespace urls #3475 _elementtree.c import can fail silently #6230 ET.Element and cET.Element have slightly different repr #6232 Improve test coverage of ET and cET #6265 cET & ET use different exceptions for XML Errors #6266 cET.iterparse & ET.iterparse return differently encoded strings #6565 improper use of __setitem__ in ET Test coverage: * relevant tests from upstream are ported to "test_xml_etree.py" * same test_suite is run for both C and Python implementations * merge tests provided with #2746, #6232 and #6233

Patch Set 1 : Patch for 2.7, with documentation and tests. #

Total comments: 24

Patch Set 2 : First iteration, and document the deprecation of XMLParser.doctype(...) #

Patch Set 3 : Merge all tests from upstream. Fix refleaks on ParseError and E.attrib. #

Total comments: 7

Patch Set 4 : Add tests for the C API. Drop unused "element_(get|set)slice". #

Total comments: 15

Patch Set 5 : Extend test coverage with #6232. Fix regression in E.findtext... #

Patch Set 6 : Split out experimental C API. #

Patch Set 7 : Rebase the patch on the Mercurial repository. #

Total comments: 19

Patch Set 8 : Ready for merge in trunk? #

Created: 15 years, 4 months ago

Download [raw] [tar.bz2]

Unified diffs	Side-by-side diffs	Delta from patch set	Stats (+3299 lines, -1186 lines)			Patch
Doc/library/xml.etree.elementtree.rst	View	1 2 3 4 5 6 7	15 chunks	+197 lines, -127 lines	0 comments	Download
Lib/test/samples/simple.xml	View	3 4 5 6	1 chunk	+6 lines, -0 lines	0 comments	Download
Lib/test/samples/simple-ns.xml	View	3 4 5 6	1 chunk	+7 lines, -0 lines	0 comments	Download
Lib/test/test_xml_etree.py	View	1 2 3 4 5 6 7	13 chunks	+1525 lines, -52 lines	0 comments	Download
Lib/test/test_xml_etree_c.py	View	1 2 3 4 5 6 7	2 chunks	+15 lines, -204 lines	0 comments	Download
Lib/xml/etree/ElementInclude.py	View	1 2 3 4 5 6 7	3 chunks	+4 lines, -4 lines	0 comments	Download
Lib/xml/etree/ElementPath.py	View	1 2 3 4 5 6 7	5 chunks	+234 lines, -129 lines	0 comments	Download
Lib/xml/etree/ElementTree.py	View	1 2 3 4 5 6 7	37 chunks	+802 lines, -414 lines	0 comments	Download
Lib/xml/etree/__init__.py	View	1 2 3 4 5 6 7	2 chunks	+3 lines, -3 lines	0 comments	Download
Modules/_elementtree.c	View	1 2 3 4 5 6 7	47 chunks	+506 lines, -253 lines	0 comments	Download

Messages

Total messages: 38

Expand All Messages | Collapse All Messages

flox

Clarify the encoded bytestring. http://codereview.appspot.com/207048/diff/1005/10 File Lib/test/test_xml_etree.py (right): http://codereview.appspot.com/207048/diff/1005/10#newcode260 Lib/test/test_xml_etree.py:260: >>> e = ET.XML("<?xml version='1.0' ...

15 years, 5 months ago (2010-02-10 08:15:27 UTC) #2

flox

Added developer foreword on all modules, to ease the review. It is very close to ...

15 years, 5 months ago (2010-02-10 09:34:36 UTC) #3

flox

http://codereview.appspot.com/207048/diff/1005/10 File Lib/test/test_xml_etree.py (right): http://codereview.appspot.com/207048/diff/1005/10#newcode251 Lib/test/test_xml_etree.py:251: >>> ET.tostring(ET.PI('test', u'<testing&>\xe3'), 'latin1') Backported from py3k: r78126 http://codereview.appspot.com/207048/diff/1005/10#newcode256 ...

15 years, 5 months ago (2010-02-10 12:15:38 UTC) #4

Antoine Pitrou

http://codereview.appspot.com/207048/diff/1005/12 File Lib/xml/etree/ElementTree.py (left): http://codereview.appspot.com/207048/diff/1005/12#oldcode1237 Lib/xml/etree/ElementTree.py:1237: pass Is this method deprecated? It seems to disappear ...

15 years, 5 months ago (2010-02-10 13:14:37 UTC) #5

flox

http://codereview.appspot.com/207048/diff/1005/12 File Lib/xml/etree/ElementTree.py (left): http://codereview.appspot.com/207048/diff/1005/12#oldcode1237 Lib/xml/etree/ElementTree.py:1237: pass On 2010/02/10 13:14:37, Antoine Pitrou wrote: > Is ...

15 years, 5 months ago (2010-02-10 14:31:40 UTC) #6

flox

First iteration, and document the deprecation of XMLParser.doctype(...)

15 years, 5 months ago (2010-02-11 18:35:22 UTC) #7

fredrik_pythonware.com

You do realize that you're merging in an experimental release, right? I'm a bit worried ...

15 years, 5 months ago (2010-02-11 18:39:08 UTC) #8

You do realize that you're merging in an experimental release, right?
I'm a bit worried that the result of this effort will be plenty of
incompatibilities with the upstream library (and there are also signs
on bugs.python.org that some people involved don't understand the
difference between specification of a portable API and artifacts of a
certain implementation of the same API), but I'm travelling right now,
and have no bandwidth to deal with this.  Just be careful.

</F>

On Wed, Feb 10, 2010 at 2:14 PM,  <antoine.pitrou@gmail.com> wrote:
>
> http://codereview.appspot.com/207048/diff/1005/12
> File Lib/xml/etree/ElementTree.py (left):
>
> http://codereview.appspot.com/207048/diff/1005/12#oldcode1237
> Lib/xml/etree/ElementTree.py:1237: pass
> Is this method deprecated? It seems to disappear after the patch.
>
> http://codereview.appspot.com/207048/diff/1005/17
> File Modules/_elementtree.c (right):
>
> http://codereview.appspot.com/207048/diff/1005/17#newcode798
> Modules/_elementtree.c:798: seq = PySequence_Fast(seq_in, "");
> The second argument (error message if seq_in is not a valid sequence)
> should be non-empty.
>
> http://codereview.appspot.com/207048/diff/1005/17#newcode892
> Modules/_elementtree.c:892: See bug 6472. */
> Rather than disabling the C implementation, we could simply call
> PyObject_GetIter() on the result.
>
> http://codereview.appspot.com/207048/diff/1005/17#newcode1498
> Modules/_elementtree.c:1498: recycle = PyList_New(slicelen);
> This lacks an error check.
>
> http://codereview.appspot.com/207048/diff/1005/17#newcode1512
> Modules/_elementtree.c:1512: Py_DECREF(seq);
> There's a refleak with `recycle` but we can't decref it here, since
> otherwise some elements will be destroyed while they are still
> referenced by the parent.
> The call to element_resize() should instead be done before `recycle` is
> populated (but after it is allocated).
>
> http://codereview.appspot.com/207048/show
>

flox

Merge *all* tests from upstream. Fix refleaks on ParseError and E.attrib.

15 years, 5 months ago (2010-02-12 11:46:32 UTC) #9

flox

On 2010/02/11 18:39:08, fredrik_pythonware.com wrote: > You do realize that you're merging in an experimental ...

15 years, 5 months ago (2010-02-12 13:59:50 UTC) #10

On 2010/02/11 18:39:08, fredrik_pythonware.com wrote:
> You do realize that you're merging in an experimental release, right?
> I'm a bit worried that the result of this effort will be plenty of
> incompatibilities with the upstream library (and there are also signs
> on http://bugs.python.org that some people involved don't understand the
> difference between specification of a portable API and artifacts of a
> certain implementation of the same API), but I'm travelling right now,
> and have no bandwidth to deal with this.  Just be careful.
> 
> </F>
> 

Thanks, Fredrik for your feedback.

Actually, I started to fix some ET/cET bugs 3 months ago.
The main issues with the current "xml.etree" package are explained on #6472.

Then I found that most bugs and discrepancies were already fixed in the upstream
versions.
I thought that the best approach is to port the upstream version in trunk,
rather than fixing bugs separately in /python/trunk/.
I took the upstream bundles and I merged them carefully with the trunk 2.7 alpha
over December 2009.

Recently Antoine showed some interest in fixing some bugs of "xml.etree"
(r78123-r78126). I suggested again to merge the upstream implementation of ET in
trunk.

With your comment about the *experimental* status of the package, I decided to
grow the python test suite with both "selftest.py" upstream tests. I merged all
tests together, and now the same test suite passes with Python and C
implementations.
(test_xml_etree.py: 370 lines --> 1500 lines)

The tests show no regression.
Additionally, some reference leakings were identified and fixed, while running
the test suite. (See diff between patch set 1 and 3)

The differences between upstream and the current patch (patch set 3) are
limited:

 * Lib/xml/etree/ElementTree.py
   http://paste.pocoo.org/compare/177059/177058/

 * Modules/_elementree.c
   http://paste.pocoo.org/compare/177063/177061/

 * all other modules are IDENTICAL with upstream

Thank you again for this software, and your comments.
I hope we can merge it in trunk before the beta of 2.7.

-- 
Florent

flox

http://codereview.appspot.com/207048/diff/1035/50 File Modules/_elementtree.c (right): http://codereview.appspot.com/207048/diff/1035/50#newcode799 Modules/_elementtree.c:799: seq = PySequence_Fast(seq_in, ""); Don't miss this one, next ...

15 years, 5 months ago (2010-02-12 14:07:23 UTC) #11

Antoine Pitrou

http://codereview.appspot.com/207048/diff/1035/50 File Modules/_elementtree.c (left): http://codereview.appspot.com/207048/diff/1035/50#oldcode1409 Modules/_elementtree.c:1409: element_setslice, If element_setslice and element_getslice aren't used anymore, they ...

15 years, 5 months ago (2010-02-12 15:27:45 UTC) #12

flox

http://codereview.appspot.com/207048/diff/1035/50 File Modules/_elementtree.c (left): http://codereview.appspot.com/207048/diff/1035/50#oldcode1409 Modules/_elementtree.c:1409: element_setslice, On 2010/02/12 15:27:46, Antoine Pitrou wrote: > If ...

15 years, 5 months ago (2010-02-12 16:07:47 UTC) #13

flox

Add tests for the C API. Drop unused "element_(get|set)slice".

15 years, 5 months ago (2010-02-15 13:20:58 UTC) #14

Antoine Pitrou

http://codereview.appspot.com/207048/diff/67/1054 File Modules/_testcapimodule.c (right): http://codereview.appspot.com/207048/diff/67/1054#newcode1311 Modules/_testcapimodule.c:1311: childob = (*capi->getitem)(newob, 0); You should check that childob ...

15 years, 5 months ago (2010-02-15 15:06:56 UTC) #15

flox

http://codereview.appspot.com/207048/diff/67/1054 File Modules/_testcapimodule.c (right): http://codereview.appspot.com/207048/diff/67/1054#newcode1338 Modules/_testcapimodule.c:1338: strcmp(PyString_AS_STRING(snapshot.tag), "document") != 0) { On 2010/02/15 15:06:56, Antoine ...

15 years, 5 months ago (2010-02-15 16:42:32 UTC) #16

http://codereview.appspot.com/207048/diff/67/1054
File Modules/_testcapimodule.c (right):

http://codereview.appspot.com/207048/diff/67/1054#newcode1338
Modules/_testcapimodule.c:1338: strcmp(PyString_AS_STRING(snapshot.tag),
"document") != 0) {
On 2010/02/15 15:06:56, Antoine Pitrou wrote:
> If you want to make all these checks easier (and/or more complete), you could
> instead return a tuple of the snapshot's contents and check the values in
> Python.
> 

Agreed.

http://codereview.appspot.com/207048/diff/67/1052
File Modules/celementtree.h (right):

http://codereview.appspot.com/207048/diff/67/1052#newcode51
Modules/celementtree.h:51: PyObject* type;
On 2010/02/15 15:06:56, Antoine Pitrou wrote:
> The doc/comments should state whether this reference is owned or borrowed
(i.e.,
> whether one should Py_DECREF it when done with the capi struct).

Added a statement "borrowed reference".

http://codereview.appspot.com/207048/diff/67/1052#newcode56
Modules/celementtree.h:56: int (*assert)(PyObject* elem);
On 2010/02/15 15:06:56, Antoine Pitrou wrote:
> I'm not sure calling this "assert" is a good idea. If some C compiler uses a
> #define for the standard "assert", it can refuse to compile or compile to the
> wrong symbol. Why not something explicit such as "checktype"?
> 

Agreed.

http://codereview.appspot.com/207048/diff/67/1052#newcode72
Modules/celementtree.h:72: /* Returns a borrowed reference, or Py_None if the
element does not
On 2010/02/15 15:06:56, Antoine Pitrou wrote:
> The doc is wrong, because the implementation returns a new reference. Either
the
> doc or the implementation should be fixed. 

Fixed the implementation.

http://codereview.appspot.com/207048/diff/67/1052#newcode74
Modules/celementtree.h:74: PyObject* (*getitem)(PyObject* elem, int index);
On 2010/02/15 15:06:56, Antoine Pitrou wrote:
> The index should probably be a Py_ssize_t instead.
> 

Ok.

flox

Extend test coverage with #6232. Fix regression in E.findtext...

15 years, 5 months ago (2010-02-16 22:38:37 UTC) #17

flox

On 2010/02/16 22:38:37, flox wrote: > Extend test coverage with #6232. Fix regression in E.findtext... ...

15 years, 5 months ago (2010-02-17 09:12:42 UTC) #18

flox

On 2010/02/17 09:12:42, flox wrote: > > Versions proposed for 2.7: > * ElementTree 1.3 ...

15 years, 5 months ago (2010-02-17 09:31:46 UTC) #19

fredrik_pythonware.com

Since you've effectively hijacked the library, and have created your own fork that's not fully ...

15 years, 5 months ago (2010-02-17 09:40:12 UTC) #20

flox

On 2010/02/17 09:40:12, fredrik_pythonware.com wrote: > Since you've effectively hijacked the library, and have created ...

15 years, 5 months ago (2010-02-17 10:12:13 UTC) #21

Antoine Pitrou

Fredrik, > Since you've effectively hijacked the library, and have created your > own fork ...

15 years, 5 months ago (2010-02-17 12:06:35 UTC) #22

fredrik_pythonware.com

The problem is that you're merging in features from a version that has never been ...

15 years, 5 months ago (2010-02-17 12:43:14 UTC) #23

The problem is that you're merging in features from a version that has
never been formally released -- I would have thought labeling
something as "experimental" and "work in progress" and storing it in a
repository named after a cheese (!) would be enough to make it clear
that the design wasn't finalized, but apparently that was a bit
optimistic (guess that's the downside of experimenting in a public
repository :-).  I don't mind you guys pulling bug fixes into Python 3
-- that's great -- the problem is when you start pulling in features,
because that means that you're basically freezing the API based on
something that was never intended to be final.  Since ET is a portable
API with multiple implementations, I'm not sure that's optimal.

But again, work & travel means that I have no time for before
mid-March or so.  If you want to push forward with an 1.3 release
before that, there's not much I can do about it (it's open source with
a permissive license, after all).  Otherwise, I'd recommend sticking
mostly to the 1.2 API with compatible bug fixes and other tweaks
required to make it work well under 3.X (i.e. going for "1.2.8"
instead of "1.3.0").  Adding things like "extend" is
non-controversial, things like the namespace-aware parsers etc less
so.

</F>

On Wed, Feb 17, 2010 at 1:06 PM,  <antoine.pitrou@gmail.com> wrote:
> Fredrik,
>
>> Since you've effectively hijacked the library, and have created your
>> own fork that's not fully compatible with any formal release of the
>> upstream library, and am not contributing any patches back to
>> upstream, I suggest renaming it instead.
>
> The point here is to fix these bugs for all Python users. If the
> "patches" were contributed upstream, it would be a NO-OP for Python
> until upstream gets ported back to Python (which hasn't seemed to happen
> for years, has it?). If you want to propose another process then please
> do so.
>
> But the process you will be proposing has to have the final outcome of
> fixing these bugs *in Python* as well, not only for users of your own
> releases (or, AFAIU, SVN repository checkouts).
>
>
> http://codereview.appspot.com/207048/show
>

flox

On 2010/02/17 12:43:14, fredrik_pythonware.com wrote: > The problem is that you're merging in features from ...

15 years, 5 months ago (2010-02-19 09:30:46 UTC) #25

effbot

Finally managed to set aside enough time to review both the mercurial fork and this ...

15 years, 4 months ago (2010-03-09 10:38:43 UTC) #27

Finally managed to set aside enough time to review both the mercurial fork and
this patch.

Some minor comments inline; the only thing I would prefer to see changed before
commit is the deprecation of getiterator in 1.3 (see notes).  If I've missed
anything else, let's deal with that in 1.4 :)

And again, thanks for doing this work, and sorry for having so little time to
spend on this at this time.

</F>

http://codereview.appspot.com/207048/diff/8001/9006
File Lib/xml/etree/ElementTree.py (right):

http://codereview.appspot.com/207048/diff/8001/9006#newcode487
Lib/xml/etree/ElementTree.py:487: return list(self.iter(tag))
Since the new spelling hasn't been available before, 
I'd prefer to deprecate this in 1.4 (as noted in a comment the original 1.3
code).  That is, version N=document as deprecated, N+1=warn that it will go
away, N+2 or higher=remove.

(btw, 'getiterator' is older than the iterator concept in Python, and was only
defined to return something that you could iterate over.  The 1.3 code uses
list(iter) for strict API compatibility with the ElementTree 1.2 implementation
(being overly cautious here, perhaps)).

http://codereview.appspot.com/207048/diff/8001/9006#newcode1239
Lib/xml/etree/ElementTree.py:1239: raise ValueError("unknown event %r" % event)
This will yield a ValueError instead of an ImportError for c14n if ElementC14N
isn't present; the former doesn't really tell the user what's missing.  It's no
big deal, really, just a minor regression that might be worth revisiting later
on.

http://codereview.appspot.com/207048/diff/8001/9006#newcode1297
Lib/xml/etree/ElementTree.py:1297: for elem in tree.getiterator():
Use iter here?

http://codereview.appspot.com/207048/diff/8001/9010
File Modules/_elementtree.c (right):

http://codereview.appspot.com/207048/diff/8001/9010#newcode110
Modules/_elementtree.c:110: #define Py_RETURN_NONE return Py_INCREF(Py_None),
Py_None
I tend prefer using feature tests instead of version tests
whenever possible, but no big deal.

flox

See my answers below. I don't get the point about ValueError versus ImportError. I will ...

15 years, 4 months ago (2010-03-10 06:02:33 UTC) #28

See my answers below.
I don't get the point about ValueError versus ImportError.

I will commit the changes in the Mercurial repo, then update the patch here.

http://codereview.appspot.com/207048/diff/8001/9006
File Lib/xml/etree/ElementTree.py (right):

http://codereview.appspot.com/207048/diff/8001/9006#newcode487
Lib/xml/etree/ElementTree.py:487: return list(self.iter(tag))
On 2010/03/09 10:38:43, effbot wrote:
> Since the new spelling hasn't been available before, 
> I'd prefer to deprecate this in 1.4 (as noted in a comment the original 1.3
> code).  That is, version N=document as deprecated, N+1=warn that it will go
> away, N+2 or higher=remove.
> 
> (btw, 'getiterator' is older than the iterator concept in Python, and was only
> defined to return something that you could iterate over.  The 1.3 code uses
> list(iter) for strict API compatibility with the ElementTree 1.2
implementation
> (being overly cautious here, perhaps)).

OK, I will rollback this deprecation warning.

http://codereview.appspot.com/207048/diff/8001/9006#newcode1239
Lib/xml/etree/ElementTree.py:1239: raise ValueError("unknown event %r" % event)
On 2010/03/09 10:38:43, effbot wrote:
> This will yield a ValueError instead of an ImportError for c14n if ElementC14N
> isn't present; the former doesn't really tell the user what's missing.  It's
no
> big deal, really, just a minor regression that might be worth revisiting later
> on.
> 

Are you sure? This fix is for "iterparse" to be consistent with the C
implementation. I don't see the link with c14n.

http://codereview.appspot.com/207048/diff/8001/9006#newcode1297
Lib/xml/etree/ElementTree.py:1297: for elem in tree.getiterator():
On 2010/03/09 10:38:43, effbot wrote:
> Use iter here?

ok.

http://codereview.appspot.com/207048/diff/8001/9010
File Modules/_elementtree.c (right):

http://codereview.appspot.com/207048/diff/8001/9010#newcode110
Modules/_elementtree.c:110: #define Py_RETURN_NONE return Py_INCREF(Py_None),
Py_None
On 2010/03/09 10:38:43, effbot wrote:
> I tend prefer using feature tests instead of version tests
> whenever possible, but no big deal.

Ok, I can roll it back.
I changed it to make it easier the day you drop the compatibility with 2.4.

flox

http://codereview.appspot.com/207048/diff/8001/9010 File Modules/_elementtree.c (left): http://codereview.appspot.com/207048/diff/8001/9010#oldcode2699 Modules/_elementtree.c:2699: "def getiterator(node, tag=None):\n" /* helper */ Note: the C ...

15 years, 4 months ago (2010-03-10 06:12:38 UTC) #29

effbot

Some followup comments. http://codereview.appspot.com/207048/diff/8001/9006 File Lib/xml/etree/ElementTree.py (right): http://codereview.appspot.com/207048/diff/8001/9006#newcode790 Lib/xml/etree/ElementTree.py:790: raise ValueError("unknown method %r" % method) ...

15 years, 4 months ago (2010-03-10 14:37:25 UTC) #30

Some followup comments.

http://codereview.appspot.com/207048/diff/8001/9006
File Lib/xml/etree/ElementTree.py (right):

http://codereview.appspot.com/207048/diff/8001/9006#newcode790
Lib/xml/etree/ElementTree.py:790: raise ValueError("unknown method %r" % method)
Reposting to right place:

This will yield a ValueError instead of an ImportError for c14n if ElementC14N
isn't present; the former doesn't really tell the user what's missing.  It's no
big deal, really, just a minor regression that might be worth revisiting later
on.

(An ImportError mentioning a module name gives a stronger hint to the user than
a ValueError.)

http://codereview.appspot.com/207048/diff/8001/9006#newcode1239
Lib/xml/etree/ElementTree.py:1239: raise ValueError("unknown event %r" % event)
Oops.  The comment was supposed to be at line 790, not here.  That's what you
get for copying your notes from a separate document and not paying attention
(that, or Rieveld played tricks on me :).

http://codereview.appspot.com/207048/diff/8001/9006#newcode1642
Lib/xml/etree/ElementTree.py:1642: from ElementC14N import _serialize_c14n
Maybe the import should be done from "elementtree.ElementC14N" so you can use
this even if you import xml.etree but have the stand-alone version installed?

http://codereview.appspot.com/207048/diff/8001/9010
File Modules/_elementtree.c (left):

http://codereview.appspot.com/207048/diff/8001/9010#oldcode2699
Modules/_elementtree.c:2699: "def getiterator(node, tag=None):\n" /* helper */
That inconsistency is old, and I was more concerned with backwards compatibility
for existing code than for people who are explicitly migrating (this is the
whole rationale for having to import cElementTree explicitly, instead of mapping
ElementTree to it if it's there), but ok, let's make them equal and see what
happens.

A bit mixed about PendingDeprecationWarning; it's not a bad idea in itself, but
are people using -Wall enough to motivate the extra overhead for every call in
existing code?  Also, it's not in 2.2 iirc, so you'll have to add extra logic
for that.

flox

Comments about E.getiterator() and support of C14N serializer. I will push some changes to Mercurial ...

15 years, 4 months ago (2010-03-10 15:17:32 UTC) #31

Comments about E.getiterator() and support of C14N serializer.

I will push some changes to Mercurial tonight.

http://codereview.appspot.com/207048/diff/8001/9006
File Lib/xml/etree/ElementTree.py (right):

http://codereview.appspot.com/207048/diff/8001/9006#newcode1642
Lib/xml/etree/ElementTree.py:1642: from ElementC14N import _serialize_c14n
On 2010/03/10 14:37:25, effbot wrote:
> Maybe the import should be done from "elementtree.ElementC14N" so you can use
> this even if you import xml.etree but have the stand-alone version installed?

I'm concerned about the risk of incompatiblity if "xml.etree" and "elementtree"
are mixed.
There will be 2 versions of ElementTree module imported:
xml.etree.ElementTree and elementtree.ElementTree

I preserved the code in the "xml.etree" version to lower the differences with
the upstream "elementtree".
But there's no plan to add ElementC14N in Python (afaiu).
And I don't plan to document C14N, since it is not part of "xml.etree". We can
decide to change it later, (and change ValueError --> ImportError).

http://codereview.appspot.com/207048/diff/8001/9010
File Modules/_elementtree.c (left):

http://codereview.appspot.com/207048/diff/8001/9010#oldcode2699
Modules/_elementtree.c:2699: "def getiterator(node, tag=None):\n" /* helper */
On 2010/03/10 14:37:25, effbot wrote:
> A bit mixed about PendingDeprecationWarning; it's not a bad idea in itself,
but
> are people using -Wall enough to motivate the extra overhead for every call in
> existing code?  Also, it's not in 2.2 iirc, so you'll have to add extra logic
> for that.

Python 2.2 is no longer supported, is it?
The PendingDeprecationWarning will help people migrate their software, when they
will move to 2.7.
For the inconsistency, between C getiterator and Py getiterator, now I think we
can preserve (and document) the inconsistency. Since it is pending deprecation,
it is not a big deal. If the user wants a predictable behavior, he should use
E.iter() and list(E.iter()).

effbot

http://codereview.appspot.com/207048/diff/8001/9006 File Lib/xml/etree/ElementTree.py (right): http://codereview.appspot.com/207048/diff/8001/9006#newcode1642 Lib/xml/etree/ElementTree.py:1642: from ElementC14N import _serialize_c14n It's a bad idea in ...

15 years, 4 months ago (2010-03-10 15:46:16 UTC) #32

flox

15 years, 4 months ago (2010-03-10 16:02:58 UTC) #33

effbot

(this got stuck in review mode; trying again) http://codereview.appspot.com/207048/diff/8001/9010 File Modules/_elementtree.c (left): http://codereview.appspot.com/207048/diff/8001/9010#oldcode2699 Modules/_elementtree.c:2699: "def ...

15 years, 4 months ago (2010-03-11 12:45:46 UTC) #34

fredrik_pythonware.com

I sampled the patch deltas for the modules we've discussed lately, and assuming you haven't ...

15 years, 4 months ago (2010-03-11 14:24:00 UTC) #36

flox

15 years, 4 months ago (2010-03-22 09:33:59 UTC) #38

Thank you for reviewing.
It is merged in trunk (r78838) and 3.x (r78942).

There's a different issue opened about serializer encoding:
http://codereview.appspot.com/664043
http://bugs.python.org/issue8047

Expand All Messages | Collapse All Messages