Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:104687 Return-Path: Delivered-To: mailing list internals@lists.php.net Received: (qmail 93733 invoked from network); 13 Mar 2019 02:04:27 -0000 Received: from unknown (HELO mail-lj1-f194.google.com) (209.85.208.194) by pb1.pair.com with SMTP; 13 Mar 2019 02:04:27 -0000 Received: by mail-lj1-f194.google.com with SMTP id a17so3823362ljd.4 for ; Tue, 12 Mar 2019 15:54:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=beberlei-de.20150623.gappssmtp.com; s=20150623; h=mime-version:from:date:message-id:subject:to; bh=r08GbZBc9MZOPf1VMEs19BOj8V5x3kE7vhFcnFAIVfk=; b=g5T7yh5ZstSAWc+k/nJ/hbOKEzvbE/buPIhYVutdnlIJjHv4mNlaJ/t6F8i9VxJnCW 2LuTc5UyALpbXuBafxCjPR4P+OSi/7ON8p3U+N5ZgB3k0/8ddoqS9Q4tWSPXnyMuXDHp 6YyHsbz44DBuAU60KU1a9Sapbx/PndEqj2RR8q1OzYnPybBZGHCIV2K7zS2yA0bt57G0 bBREWoFowUcv/rwRQaE2m4UV//12pXyNPZ5s/UfGbyEq7RM/lrBAvD1rcHujzvdLuBeG 2PIZIF0p6opjhEvhycXBLGkjz3AMLJbViOLLndfBaHj6FNNTFE+C+H1qsCGMyr8rUXf6 9zEQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=r08GbZBc9MZOPf1VMEs19BOj8V5x3kE7vhFcnFAIVfk=; b=kVxNhK6CxmqVDZW307WDr88wVXjPk2E/K0ZBZ/jE2tGA/rzXy4cSMDiNBT8wWt27P8 yixpdGdasvAlzuVK1RLBwLBAJfm7IhXHj2Qv+q1lsPUo21AszqsL/oVB60wqdaPurwzu vXjE6/GP0Ec45pFxrBc9HLZ3/mWhMzym5Rhue/PoWTfHGiKZz01zc8/rxz0dckzl8H0w HTIplKaeEU85e28ZF14Vn8tHbp2zwXXFV1IIgVtATqzUNZQtnIPqa164SZLDRuRhHSHs loIFMfAmakiuESb32S1/QEOQ4JLDZZDF4buLqueAQ9/8eDSznwe45nWRrJwe3E42f7tb iufQ== X-Gm-Message-State: APjAAAVt/AwUVRa2jAW/rvz2Uu6QamY3ULXO/KLrPnx8EAQd14MYpLNP RuUq+T/jjdcBf57jpAeqh4Ybc+scuwbd2isGtT0h7vUyzE0Few== X-Google-Smtp-Source: APXvYqzSBYM8/7HDmjsojE3d9OD2R0PPFfuT5qj08LlH6c85OuNq0ArDOKdRdbg36B2XZlApOFVs6+UBQhBopLU/MMI= X-Received: by 2002:a2e:ca:: with SMTP id e71mr20753838lji.137.1552431265313; Tue, 12 Mar 2019 15:54:25 -0700 (PDT) MIME-Version: 1.0 Date: Tue, 12 Mar 2019 23:54:13 +0100 Message-ID: To: PHP Internals Content-Type: multipart/alternative; boundary="0000000000001b8d330583ed92f0" Subject: On fixing DOMNameSpaceNode and DOM NS API Inconsistency Problems From: kontakt@beberlei.de (Benjamin Eberlei) --0000000000001b8d330583ed92f0 Content-Type: text/plain; charset="UTF-8" Hi everyone, While looking for things to work on in php-src my friend Thomas pointed me to a peculiar special case in ext/dom that leads to massive inconsistency problems in the API. There is an undocumented class DOMNameSpaceNode that gets returned from DOMElement::getAttributeNode(NS) if you select an attribute that represents a namespace (xmlns). This special case is intentionally handled in the code, contrary to Pythons DOM Extension which doesn't do this and contrary to the DOM Specification, which does not have a special DOMNameSpaceNode. Its all DOMAttr. Problematically DOMNameSpaceNode doesn't extend from DOMAttr, you cannot pass this class to DOMElement::removeAttributeNode(DOMAttr $attr): Fatal error: Uncaught TypeError: Argument 1 passed to DOMElement::removeAttributeNode() must be an instance of DOMAttr, instance of DOMNameSpaceNode given Code example here: https://3v4l.org/jkC5s In addition the DOMNameSpaceNode renames all properties compared to DOMAttr, clearly violating the interface documented here http://php.net/manual/de/domelement.getattributenode.php Two potential fixes come to mind: 1. Have DOMNameSpaceNode extend DOMAttr - maybe this was the originally intended behavior, becuase the properties are all named differently? 2. Have DOMElement::removeAttributeNode accept also a DOMNameSpaceNode (3.) Remove the non standard DOMNameSpaceNode class and use a DOMAttr instead I think approach #1 is the right one from a BC perspective and also fixing the DOM API to be more consistent with the standard. But I am also not opposed to #3 in the longer term, looking at Github DOMNamespaceNode in the open source world is only used for one apc test, in PHP Compat and when dumping it in Symfony Var Dumper. I imagine its more deeply used in closed source code working deeply with XML. The second problem that this inconsistency creates is DOMElement::removeAttributeNS($namespaceUri, $localName). When you want to use it to remove a namespace attribute (xmlns). By default the xmlns namespace has the uri http://www.w3.org/2000/xmlns/. Hence removing xmlns:$name namespace would expected to be: DOMElement::removeAttributeNS("http://www.w3.org/2000/xmlns/", "foo") // removes xmlns:foo But thats not how it works, there is special code implemented that requires you to pick the URI from xmlns:foo value: DOMElement::removeAttributeNS("urn:foo", "foo"); But if your node has an attribute in this namespace with the same name, it gets removed as well: Would incorrectly become this, removing 2 attributes: This is a bug that cannot be fixed in a BC way. I have no clue how to fix this completly broken behavior in a good way. I propose to break BC here and requiring "http://www.w3.org/2000/xmlns/" as namespaceURI from PHP 8.0 to delete a namespace (xmlns) attribute. Should I put both issues into an RFC or are these rather bugs that can be fixed without an RFC? greetings Benjamin --0000000000001b8d330583ed92f0--