Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:19487 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 24708 invoked by uid 1010); 7 Oct 2005 20:55:53 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 24693 invoked from network); 7 Oct 2005 20:55:53 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 7 Oct 2005 20:55:53 -0000 X-Host-Fingerprint: 204.11.219.139 lerdorf.com Linux 2.4/2.6 Received: from ([204.11.219.139:52018] helo=colo.lerdorf.com) by pb1.pair.com (ecelerity 2.0 beta r(6323M)) with SMTP id 44/71-54476-7D0E6434 for ; Fri, 07 Oct 2005 16:55:52 -0400 Received: from [192.168.10.100] (dsl-80-42-4-191.access.as9105.com [80.42.4.191]) (authenticated bits=0) by colo.lerdorf.com (8.13.5/8.13.5/Debian-2) with ESMTP id j97KtY5f018453 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NOT); Fri, 7 Oct 2005 13:55:37 -0700 Message-ID: <4346E0C5.3090001@lerdorf.com> Date: Fri, 07 Oct 2005 21:55:33 +0100 User-Agent: Mozilla Thunderbird 1.0.6 (Macintosh/20050716) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Ilia Alshanetsky CC: Andrei Zmievski , Derick Rethans , PHP Developers Mailing List References: <99dd4f75f4ceebfe1c980cf439e97416@gravitonic.com> <4346E00A.8020504@prohost.org> In-Reply-To: <4346E00A.8020504@prohost.org> X-Enigmail-Version: 0.92.0.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Subject: Re: [PHP-DEV] Unicode Implementation From: rasmus@lerdorf.com (Rasmus Lerdorf) Ilia Alshanetsky wrote: > Andrei Zmievski wrote: > >>>- we have a bit less BC. >> >> >>"A bit less"? I'd say it would break BC in a major way. People who want >>to upgrade to PHP 6 would need to rewrite a lot of their scripts. > > > I think most large applications will be in this boat anyway, we may as > well do it properly once, so we don't end up hacks on top of hacks just > for the sake of BC. > > >>We can try switching to _UNSAFE versions of the iterator macros - they >>assume well-formed UTF-16, so they will be somewhat faster. > > > We definitely need to look at that since if upgrading to 6.0 means a 3x > slower operation very few people will even consider upgrading. Which is why we need the unicode=off switch. I don't think there is any way we can make Unicode PHP as fast as non-Unicode PHP. For people who need Unicode support, Unicode PHP will be faster and easier than any other way for them to get there, but for people who have no need for Unicode it would be really nice to maintain the fast non-Unicode mode. -Rasmus