Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:27662 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 7659 invoked by uid 1010); 25 Jan 2007 18:32:00 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 7644 invoked from network); 25 Jan 2007 18:32:00 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 25 Jan 2007 18:32:00 -0000 Authentication-Results: pb1.pair.com header.from=andrei@gravitonic.com; sender-id=unknown Authentication-Results: pb1.pair.com smtp.mail=andrei@gravitonic.com; spf=permerror; sender-id=unknown Received-SPF: error (pb1.pair.com: domain gravitonic.com from 204.11.219.139 cause and error) X-PHP-List-Original-Sender: andrei@gravitonic.com X-Host-Fingerprint: 204.11.219.139 lerdorf.com Linux 2.5 (sometimes 2.4) (4) Received: from [204.11.219.139] ([204.11.219.139:40947] helo=lerdorf.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 31/F7-13597-F97F8B54 for ; Thu, 25 Jan 2007 13:32:00 -0500 Received: from [66.228.175.145] (borndress-lm.corp.yahoo.com [66.228.175.145]) (authenticated bits=0) by lerdorf.com (8.13.8/8.13.8/Debian-3) with ESMTP id l0PIVsKp021621; Thu, 25 Jan 2007 10:31:54 -0800 In-Reply-To: References: <0F741213-BCA4-4923-A83A-3E4E9C561DAE@prohost.org> <45B8B2E5.4010204@zend.com> <40869.195.22.180.233.1169733866.squirrel@avilys.eik.lt> <3ED37F9A-9BC8-4BBA-BB85-77BB0B188074@prohost.org> <000b01c74090$60bce950$0100a8c0@pc07653> <017A7F13-255C-4C7E-B22F-7481CCE07BAB@prohost.org> <000a01c74093$b03dd180$0100a8c0@pc07653> <0EFF1969-038A-4F67-872C-674B99E75009@prohost.org> <6b4d01c77cd1c8ca09b68d822bcd1f15@gravitonic.com> Mime-Version: 1.0 (Apple Message framework v624) Content-Type: text/plain; charset=US-ASCII; format=flowed Message-ID: Content-Transfer-Encoding: 7bit Cc: "Ilia Alshanetsky" , "Nuno Lopes" , internals@lists.php.net Date: Thu, 25 Jan 2007 10:32:48 -0800 To: Pierre X-Mailer: Apple Mail (2.624) Subject: Re: [PHP-DEV] Re: PHP 5.2.1RC3 Released From: andrei@gravitonic.com (Andrei Zmievski) Please do not confuse an encoding (UTF-8) with full locale information (which is what CLDR provides). -Andrei On Jan 25, 2007, at 10:20 AM, Pierre wrote: > On 1/25/07, Andrei Zmievski wrote: >> Because with UTF-8, PCRE already knows the uppercase and lowercase >> equivalents, without having to rely on the POSIX locales. > > Yes, UTF-8 covers many aspects but does it know about words, white > spaces (not sure if ws are always the same) and other locale specific > issues? generally, not only pcre. Maybe it is more something for ICU > directly, as you said later in this thread. > > --Pierre