Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:91341 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 92466 invoked from network); 21 Feb 2016 10:42:52 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 21 Feb 2016 10:42:52 -0000 Authentication-Results: pb1.pair.com smtp.mail=anatol.php@belski.net; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=anatol.php@belski.net; sender-id=unknown Received-SPF: error (pb1.pair.com: domain belski.net from 85.214.73.107 cause and error) X-PHP-List-Original-Sender: anatol.php@belski.net X-Host-Fingerprint: 85.214.73.107 klapt.com Received: from [85.214.73.107] ([85.214.73.107:58348] helo=h1123647.serverkompetenz.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id CF/E2-25165-5A499C65 for ; Sun, 21 Feb 2016 05:42:45 -0500 Received: by h1123647.serverkompetenz.net (Postfix, from userid 1006) id 1694A7805CE; Sun, 21 Feb 2016 11:42:40 +0100 (CET) Received: from w530phpdev (p579F3A50.dip0.t-ipconnect.de [87.159.58.80]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by h1123647.serverkompetenz.net (Postfix) with ESMTPSA id 132357805CE; Sun, 21 Feb 2016 11:42:38 +0100 (CET) To: =?utf-8?Q?'=C3=81ngel_Gonz=C3=A1lez'?= Cc: "'Christian Schneider'" , "'PHP internals'" References: <79F03701-9083-439B-A9D1-43E24C99CF13@cschneid.com> <012501d16a24$a230d050$e69270f0$@belski.net> <031b01d16aee$645ef340$2d1cd9c0$@belski.net> <56C90452.7040206@gmail.com> In-Reply-To: <56C90452.7040206@gmail.com> Date: Sun, 21 Feb 2016 11:42:35 +0100 Message-ID: <001001d16c94$954769e0$bfd63da0$@belski.net> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-Mailer: Microsoft Outlook 16.0 Thread-Index: AQCki6lgPnkuaeTfOvRULIcZxGmWEQIEFtpNAWCMn0wCadtqG6Fhm7dw Content-Language: en-us Subject: RE: [PHP-DEV] PCRE jit bug with UTF-8 and lookbehind assertion From: anatol.php@belski.net ("Anatol Belski") > -----Original Message----- > From: =C3=81ngel Gonz=C3=A1lez [mailto:keisial@gmail.com] > Sent: Sunday, February 21, 2016 1:27 AM > To: Anatol Belski > Cc: 'Christian Schneider' ; 'PHP internals' > > Subject: Re: [PHP-DEV] PCRE jit bug with UTF-8 and lookbehind = assertion >=20 > On 19/02/16 09:20, Anatol Belski wrote: > > Could you please write back, what is the out difference between = those > > two commands? Thanks. Anatol > In the first case, it correctly outputs =C2=ABx=C2=B011=C2=BB (78 c2 = b0 7a). With jit enabled it > produces =C2=ABx z=C2=BB (78 c2 7a). That is, it is only outputting = the lower byte of the utf-8 > encoding of the U+00B0 character Tested on PHP 7.0.3 using the system = libpcre > 8.38 >=20 Were you putting the snippets into a file or testing on the console? I = had an issue while testing this on the console, that some chars was = partially swallowed by terminal (which was a utf-8 terminal). When = putting into a file, the output is same for both - "x=C2=B0z". Please = see also the continued discussion in the original ticket = https://bugs.exim.org/show_bug.cgi?id=3D1189 . The offsets delivered by = PCRE also seem to be correct, and valgrind doesn't find anything. It = were great if you could confirm these insights. Thanks Anatol