Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:43821 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 91230 invoked from network); 1 May 2009 00:55:45 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 1 May 2009 00:55:45 -0000 Authentication-Results: pb1.pair.com header.from=php_lists@realplain.com; sender-id=unknown Authentication-Results: pb1.pair.com smtp.mail=php_lists@realplain.com; spf=permerror; sender-id=unknown Received-SPF: error (pb1.pair.com: domain realplain.com from 209.151.69.1 cause and error) X-PHP-List-Original-Sender: php_lists@realplain.com X-Host-Fingerprint: 209.151.69.1 liberty.vosn.net Linux 2.4/2.6 Received: from [209.151.69.1] ([209.151.69.1:57169] helo=liberty.vosn.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 80/1A-32602-F884AF94 for ; Thu, 30 Apr 2009 20:55:43 -0400 Received: from 72-161-141-80.dyn.centurytel.net ([72.161.141.80]:56272 helo=pc1) by liberty.vosn.net with smtp (Exim 4.69) (envelope-from ) id 1Lzh2S-00067W-F4; Thu, 30 Apr 2009 18:55:40 -0600 Message-ID: <04C5CFB513A24CA1BBA14112DB6979D8@pc1> To: , "Dmitry Stogov" Cc: References: <6604D94D40FD465F992144110B075BB5@pc1> <49F94BC6.5060904@zend.com> Date: Thu, 30 Apr 2009 19:55:38 -0500 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=response Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.5512 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5579 X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - liberty.vosn.net X-AntiAbuse: Original Domain - lists.php.net X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - realplain.com Subject: Re: [PHP-DEV] Re: [PATCH] Scanner "diet" with fixes, etc. From: php_lists@realplain.com ("Matt Wilmas") Hi Dmitry, ----- Original Message ----- From: "Dmitry Stogov" Sent: Thursday, April 30, 2009 > Hi Matt, > > Does this patch fix EOF handling issues related to mmap()? (e.g. parsing > of files with size 4096, 8192, ...). Now we have two dirty fixes to handle > them correctly. I'm not really sure about the mmap() stuff, and haven't followed the workarounds that you or Brian have done... But as far as scanning strings/comments, there shouldn't be an EOF problem with them after my changes. Those were the ones that could contain NULL and got the EOF NULL(s) confused and scan too far. I guess that's the ZEND_MMAP_AHEAD padding...? So I don't know if reverting the "dirty fixes" would then still cause any problems with re2c scanning other tokens (that can't contain NULL). Brian would know more I guess, sooner than I could look into what to revert, when I don't fully understand that part. > The patch is quite big to understand it quickly. I'll probably take a look > on weekend. OK. I think the patch looks more complicated and is larger because of so many removed lines. :-) > -ANY_CHAR [^\x00] > +ANY_CHAR [^] > > Is [^] a correct regular expression? In re2c, yes, as Scott said. [^] is what it was before Brian excluded NULL. > Thanks. Dmitry. - Matt