Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:31043 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 66497 invoked by uid 1010); 18 Jul 2007 09:49:26 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 66482 invoked from network); 18 Jul 2007 09:49:26 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 18 Jul 2007 09:49:26 -0000 Authentication-Results: pb1.pair.com smtp.mail=wrowe@rowe-clan.net; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=wrowe@rowe-clan.net; sender-id=unknown Received-SPF: error (pb1.pair.com: domain rowe-clan.net from 64.202.165.221 cause and error) X-PHP-List-Original-Sender: wrowe@rowe-clan.net X-Host-Fingerprint: 64.202.165.221 smtpout05-04.prod.mesa1.secureserver.net Linux 2.4/2.6 Received: from [64.202.165.221] ([64.202.165.221:38342] helo=smtpout05.prod.mesa1.secureserver.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 42/39-18661-422ED964 for ; Wed, 18 Jul 2007 05:49:26 -0400 Received: (qmail 3705 invoked from network); 18 Jul 2007 09:49:22 -0000 Received: from unknown (24.15.193.17) by smtpout05-04.prod.mesa1.secureserver.net (64.202.165.221) with ESMTP; 18 Jul 2007 09:49:22 -0000 Message-ID: <469DE221.7080805@rowe-clan.net> Date: Wed, 18 Jul 2007 04:49:21 -0500 User-Agent: Thunderbird 1.5.0.12 (X11/20070530) MIME-Version: 1.0 To: Rasmus Lerdorf CC: Derick Rethans , PHP Developers Mailing List References: <698DE66518E7CA45812BD18E807866CE648191@us-ex1.zend.net> <54C4340A-D9EA-4B5A-B39C-B55B29B1B3BC@prohost.org> <698DE66518E7CA45812BD18E807866CE648193@us-ex1.zend.net> <469B7FB1.1070507@pooteeweet.org> <698DE66518E7CA45812BD18E807866CE648290@us-ex1.zend.net> <7.0.1.0.2.20070718023255.0dc0eed0@zend.com> <469DE09F.9080509@lerdorf.com> In-Reply-To: <469DE09F.9080509@lerdorf.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Subject: Re: [PHP-DEV] POSIX regex From: wrowe@rowe-clan.net ("William A. Rowe, Jr.") Rasmus Lerdorf wrote: > > Perhaps the real argument here is whether we should be doing Unicode at all? I've watched these debate with tremendous interest. i18n is one of my pure 'hobbies' (my 'clients' are all quite happy with ISO-8859-1, and one of my backgrounds is WinNT where everything became unicode within the OS.) I'm pondering if utf-8 as the 'default' encoding wouldn't have been a more effective approach than pure unicode wide-chars, but no matter how you slice this, there will be several points of pain in the transition. Rethinking in terms of utf-8 might be an interesting exercise, just to draw up a comparison of 'what is broken' when sliding between a PHP5 ISO charset and a PHP6 Unicode or utf-8 charset.