Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:30486 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 98224 invoked by uid 1010); 6 Jul 2007 09:24:42 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 98208 invoked from network); 6 Jul 2007 09:24:42 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 6 Jul 2007 09:24:42 -0000 Authentication-Results: pb1.pair.com smtp.mail=tokul@users.sourceforge.net; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=tokul@users.sourceforge.net; sender-id=unknown Received-SPF: error (pb1.pair.com: domain users.sourceforge.net from 213.197.162.99 cause and error) X-PHP-List-Original-Sender: tokul@users.sourceforge.net X-Host-Fingerprint: 213.197.162.99 avilys.eik.lt Linux 2.6 Received: from [213.197.162.99] ([213.197.162.99:57147] helo=avilys.eik.lt) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 36/53-01565-75A0E864 for ; Fri, 06 Jul 2007 05:24:41 -0400 Received: from avilys.eik.lt (avilys.local [127.0.0.1]) by avilys.eik.lt (Postfix) with ESMTP id C13A41F5147 for ; Fri, 6 Jul 2007 12:22:44 +0300 (EEST) Received: from avilys.eik.lt (avilys.local [127.0.0.1]) by avilys.eik.lt (Postfix) with ESMTP id A9AA01F5145 for ; Fri, 6 Jul 2007 12:22:44 +0300 (EEST) Received: from 78.61.224.253 (NaSMail authenticated user tomas@topolis.lt) by avilys.eik.lt with HTTP; Fri, 6 Jul 2007 12:22:44 +0300 (EEST) Message-ID: <47498.78.61.224.253.1183713764.squirrel@avilys.eik.lt> In-Reply-To: <468DDFEB.3080404@zend.com> References: <1181829227.3478.3.camel@localhost.localdomain> <7d5a202f0706141844l3c75b556hdbecbcd5a43747c9@mail.gmail.com> <4671F184.2020401@lerdorf.com> <6sof73dj69ldpspfc5ukrc58qr9ckbin2b@4ax.com> <4677E7B1.2080305@lerdorf.com> <4677F5FB.1070206@lerdorf.com> <4678252F.2050803@sci.fi> <46783212.4020900@lerdorf.com> <34654.216.230.84.67.1183064088.squirrel@www.l-i-e.com> <54557.78.61.224.253.1183098089.squirrel@avilys.eik.lt> <2159.24.1.37.132.1183693437.squirrel@www.l-i-e.com> <468DDFEB.3080404@zend.com> Date: Fri, 6 Jul 2007 12:22:44 +0300 (EEST) To: internals@lists.php.net User-Agent: NaSMail/1.2 MIME-Version: 1.0 Content-Type: text/plain;charset=utf-8 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal X-Virus-Scanned: ClamAV using ClamSMTP Subject: Re: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? From: tokul@users.sourceforge.net ("Tomas Kuliavas") >>> Unicode code points can be defined with \u, but PHP6 breaks existing >>> octal and hex escape sequences. > > I don't understand what this means... PHP6.0-200707060630 unicode.fallback_encoding => 'utf-8' => 'utf-8' unicode.filesystem_encoding => no value => no value unicode.http_input_encoding => 'utf-8' => 'utf-8' unicode.output_encoding => 'utf-8' => 'utf-8' unicode.runtime_encoding => 'utf-8' => 'utf-8' unicode.script_encoding => 'utf-8' => 'utf-8' unicode.semantics => On => On unicode.stream_encoding => UTF-8 => UTF-8 --- test.php --- --- ą is in utf-8 (latin small letter a with ogonek, latin extended-a range). It contains two bytes with 0xC4 0x85 values. Expected result and actual result for php 5.2.0: --- bool(true) int(1) int(1) --- "/[\240-\377]/" range should match 0xC4 byte. Actual result (PHP6): --- bool(false) int(0) int(1) ---