Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:35951 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 50445 invoked from network); 4 Mar 2008 18:30:14 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 4 Mar 2008 18:30:14 -0000 Authentication-Results: pb1.pair.com header.from=helly@php.net; sender-id=unknown Authentication-Results: pb1.pair.com smtp.mail=helly@php.net; spf=unknown; sender-id=unknown Received-SPF: unknown (pb1.pair.com: domain php.net does not designate 85.214.94.56 as permitted sender) X-PHP-List-Original-Sender: helly@php.net X-Host-Fingerprint: 85.214.94.56 aixcept.net Linux 2.6 Received: from [85.214.94.56] ([85.214.94.56:37006] helo=h1149922.serverkompetenz.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 16/94-13488-D159DC74 for ; Tue, 04 Mar 2008 13:30:12 -0500 Received: from dhcp-172-28-202-237.zrh.corp.google.com (unknown [193.142.125.1]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by h1149922.serverkompetenz.net (Postfix) with ESMTP id 172C2121376; Tue, 4 Mar 2008 19:29:29 +0100 (CET) Date: Tue, 4 Mar 2008 19:29:28 +0100 Reply-To: Marcus Boerger X-Priority: 3 (Normal) Message-ID: <1314614232.20080304192928@marcus-boerger.de> To: "Andi Gutmans" CC: internals@lists.php.net In-Reply-To: <698DE66518E7CA45812BD18E807866CE01506D08@us-ex1.zend.net> References: <1706278209.20080302232134@marcus-boerger.de> <698DE66518E7CA45812BD18E807866CE01506D08@us-ex1.zend.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Transfer-Encoding: 8bit Subject: Re: [PHP-DEV] [RFC] Replace the flex-based scanner with an re2c [1] based lexer From: helly@php.net (Marcus Boerger) Hello Andi, Tuesday, March 4, 2008, 7:51:07 AM, you wrote: > Hi Marcus, Johannes, and all, > First of all let me say that I have no conceptual problem with replacing > the scanner with re2c. If it's cleaner, performs better and a better > maintained piece of software (let's hope Marcus doesn't get run over) > then we can move to re2c. > There are a few important things to consider though: > - There is a huge PHP/MySQL community in the far east especially in > Japan. You may not hear as much from them because they mostly don't post > on our public lists but it's large. They very much depend on multibyte > support and it works well for them (I have talked to several people in > those communities). Shift-JIS is a matter of fact for those communities. > We can't just dump them in PHP 5.3. > - We need to make sure that we have a streams story that works and > existing functionality is supported by it (sounds like this is almost > complete so probably not high risk). > - We should make sure we can achieve compatibility including supporting > functionality like declare(...) which is used by some including > multibyte guys. I haven't heard of a reason why this couldn't be > possible with RE2C. > I think all the above is doable but we shouldn't ship without > accomplishing that 100% compatibility especially telling the non-Latin > world that we will stop supporting them. > So at the end of the day it all boils down to timing. I have been > expecting Johannes to cut a beta any day now (I realize Sun acquisition > somewhat postponed his schedule). PHP 5.3 is on a pretty good track to a > good & stable release cycle. I think re-engineering a core piece of the > engine at this point adds considerable risk and would definitely prolong > the release cycle. > So while I'm supportive of embracing RE2C if we get commitment to reach > that 100% compatibility including multibyte support, I don't quite > understand the sense of urgency and why we'd want to introduce this risk > so late in the development of PHP 5.3. This is a risk the release > manager shouldn't really be willing to take. Rewriting this multibyte > support will require time and interaction with the communities that are > currently using it to make sure that it meets their needs. It will not > be a trivial project. > We can definitely work towards RE2C in parallel and as Stas said the > engine hasn't really been changing very much recently to make this hard > (we finished our todos for 5.3). We could even branch off PHP 5.4 right > after RC1 for PHP 5.3 and therefore reduce the time where this patch > would need to be maintained separately (although I think it can already > be maintained in a branch). > Let's consider all the angles in addition to wanting to get the code in > the tree asap. > Andi Give me any reason why we need 5.4 at this point? Any single one? Are you having a bet or a deal about 5.3 release date? And what is the deal, you do whatever you think goes in and that's a law? Best regards, Marcus