Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:85750 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 24662 invoked from network); 9 Apr 2015 10:36:29 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 9 Apr 2015 10:36:29 -0000 Authentication-Results: pb1.pair.com header.from=julienpauli@gmail.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=julienpauli@gmail.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain gmail.com designates 74.125.82.44 as permitted sender) X-PHP-List-Original-Sender: julienpauli@gmail.com X-Host-Fingerprint: 74.125.82.44 mail-wg0-f44.google.com Received: from [74.125.82.44] ([74.125.82.44:36761] helo=mail-wg0-f44.google.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id E6/A0-17490-A2656255 for ; Thu, 09 Apr 2015 06:36:28 -0400 Received: by wgsk9 with SMTP id k9so92960033wgs.3 for ; Thu, 09 Apr 2015 03:36:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type; bh=0/hYdLpD6KH/0cDpEVdZUoQT19+bNlhNFJPpxJdoF4Y=; b=DW9+mfTdQcjbkL0ufsk4Xxq9NorsVaw9N/6UaWrqOTvU7O5bunw8oQdy5Z/rzfi6sR c+PMqEFeYDM/L1a/D9Bsw1BHWjXcvUtAhcBENpeFPDzXRJFHQh1plv4flaQdVLMmSy+E zmY5fcbVc6eaHVBq1hCvIUGOrDw50cWb4qiXWCUObBQbawir/KNbDuoWOpDhtrNtuuJK 2MRWAxXa3P7l5TB0Z96ThspbJoHF+Tei992+8pxANo/EflQtgcqhYDm+qVmM+qBMIbhk 5LmXnOpGwPKqtbV3pDbxR1Q3unlKCaJHAHl48gCcCUwP6I6OkpcZwng9QZc7UbaIMjWc fvBA== X-Received: by 10.180.95.10 with SMTP id dg10mr5038105wib.41.1428575783775; Thu, 09 Apr 2015 03:36:23 -0700 (PDT) MIME-Version: 1.0 Sender: julienpauli@gmail.com Received: by 10.194.198.210 with HTTP; Thu, 9 Apr 2015 03:35:43 -0700 (PDT) In-Reply-To: References: Date: Thu, 9 Apr 2015 12:35:43 +0200 X-Google-Sender-Auth: uphB07L8yp4oiYD7J5zEKXO1kQY Message-ID: To: Dmitry Stogov Cc: Anthony Ferrara , "internals@lists.php.net" Content-Type: multipart/alternative; boundary=f46d04447f8320720b051348384c Subject: Re: [PHP-DEV] Concern around growing complexity in engine - hash table specifically From: jpauli@php.net (Julien Pauli) --f46d04447f8320720b051348384c Content-Type: text/plain; charset=UTF-8 On Wed, Apr 8, 2015 at 10:55 PM, Dmitry Stogov wrote: > On Fri, Apr 3, 2015 at 9:57 PM, Anthony Ferrara > wrote: > > > All, > > > > I spent a little bit of time today trying to debug an issue with 7 > > that Drupal 8 was facing, specifically regarding an array index not > > behaving correctly ($array["key"] returned null, even though the key > > existed in the hash table). > > > > I noticed that the hash table implementation has gotten orders of > > magnitude more complex in recent times (since phpng was merged). > > > > Specifically, that ardata and arhash are now the same block of memory, > > and that we're now doing negative indexing into arData to get the hash > > map list. From Dmitry's commit message, it was done to keep the data > > that's accessed most often in the same CPU cache line. While I am sure > > that there are definitive performance gains to doing this, I do worry > > about the development and debugging costs of this added complexity. > > > > As well as the way it increases the busfactor of the project. > > > > There is definitely a tradeoff there, as the change is pretty well > > encapsulated behind macros. But that introduces a new level of > > abstraction. But deeper than that it really makes debugging with gdb a > > pain in the neck. > > > > Without hard data on this particular patch, I'm not suggesting we roll > > back the change or anything. I more just want to express concern with > > the trend lately to increase complexity significantly on developers > > for the sake of performance. > > > > > While I'm definitely not saying performance doesn't matter, I also > > think performance at all costs is dangerous. And I wonder if some of > > the more fundamental (even if isolated) changes such as this should be > > way more documented and include the performance justification for > > them. I'm definitely not suggesting an RFC, but perhaps some level of > > discussion should be required for these sorts of changes... > > > > I agree with Anthony. Many things however can be solved with a nice .gdbinit. We already have dump_ht() , dump_htptr() , f.e , that I'm using heavilly to debug HT in PHP5. Not talking about dump_bt(). I think one step is to improve our .gdbinit with many more features, and obviously port the actual ones to work with PHP7. A second step is documentation. Anthony, you know about our project phpinternalsbook.com, don't you ;-) There has been recent discussions on IRC to actually merge this project under php.net. I'm really feeling enthusiast about helping or even taking the lead of such a project : I would like php.net to hold a real, detailed documentation about internals. I think with PHP7 should come an internal documentation, somewhere behind php.net , that will explain to a C-aware developper our main internal structures and choices, especially about performance optimisations. Have you had a look at the new Zend Memory Manager ? It has become insanely complex, with many performance-turned code. Same, but in a lower footprint, for the executor : the executor stack frame has really changed from PHP5's one, and is also not very easy to debug (with a long alloced buffer shrinked with many pointer tricks that needs you to have a complete image of the memory buffer in your head). I won't be able myself to document all those tricks, because I'm not the author of them. I think Zend, through Dmitry, Nikic, Bob or Laruence , should help us understanding some concepts, if they are not around to help with the doc. Julien.Pauli --f46d04447f8320720b051348384c--