Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:31792 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 97434 invoked by uid 1010); 21 Aug 2007 15:34:19 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 97391 invoked from network); 21 Aug 2007 15:34:18 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 21 Aug 2007 15:34:18 -0000 Authentication-Results: pb1.pair.com header.from=nicobn@gmail.com; sender-id=pass; domainkeys=bad Authentication-Results: pb1.pair.com smtp.mail=nicobn@gmail.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain gmail.com designates 209.85.146.183 as permitted sender) DomainKey-Status: bad X-DomainKeys: Ecelerity dk_validate implementing draft-delany-domainkeys-base-01 X-PHP-List-Original-Sender: nicobn@gmail.com X-Host-Fingerprint: 209.85.146.183 wa-out-1112.google.com Received: from [209.85.146.183] ([209.85.146.183:12357] helo=wa-out-1112.google.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 02/00-31303-9E50BC64 for ; Tue, 21 Aug 2007 11:34:12 -0400 Received: by wa-out-1112.google.com with SMTP id l24so1179273waf for ; Tue, 21 Aug 2007 08:32:32 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version:content-type:references:x-google-sender-auth; b=pb8j/f2tkFwGGeof+8s0bJV+RdtYnphQHaSOeHGMudkE0/ArwlqnJ5rYDsc/JyL/0Rtk6AeyECwq9LolsAaeiP60XjR1ceBaZ0lv3phoSZn6wyJf5Dhoz/ZsP0FqauCelmQsI4GEJ7h0PdTVAlofUr0pY+Jv/q6sfEbmH2s49ZQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version:content-type:references:x-google-sender-auth; b=ckEcxwAwj0eTD78Ev4P+IUHFrZP1yIYFrhSZ1YYcTOKz/r+FLC6eNaEbZtDWrFKAlzMtpaSR7d9hbjkmNqURPkLHXkKwOk4TN+SeHogsRU15smiT8PTOE9AcVHRq8nmEB+zLOLHOrx9+NWSCbjKMGc2kUMyVQMS9Ka/e/hmHW1g= Received: by 10.115.93.16 with SMTP id v16mr1866378wal.1187710352570; Tue, 21 Aug 2007 08:32:32 -0700 (PDT) Received: by 10.114.125.4 with HTTP; Tue, 21 Aug 2007 08:32:32 -0700 (PDT) Message-ID: <30bd80240708210832s60972bbm811c258233a386a6@mail.gmail.com> Date: Tue, 21 Aug 2007 11:32:32 -0400 Sender: nicobn@gmail.com To: "Andrei Zmievski" Cc: "David Coallier" , "Stefan Priebsch" , RQuadling@googlemail.com, "Lukas Kahwe Smith" , "Antony Dovgal" , "Rasmus Lerdorf" , "Derick Rethans" , "Cristian Rodriguez" , internals@lists.php.net In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_82234_16655927.1187710352409" References: <1181829227.3478.3.camel@localhost.localdomain> <468E2009.9000703@zend.com> <10845a340707060432h6516ea5eja0995dbc974baa0a@mail.gmail.com> <468E2A9C.8030704@zend.com> <10845a340707060454t24a854dfu93aad454dd1f37ed@mail.gmail.com> <468E2F78.9090002@pooteeweet.org> <10845a340707060509u70152abctf1801324be490ed1@mail.gmail.com> <468E400E.6060005@e-novative.de> X-Google-Sender-Auth: c42233953853550b Subject: Re: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? From: nicobn@php.net ("=?ISO-8859-1?Q?Nicolas_B=E9rard-Nault?=") ------=_Part_82234_16655927.1187710352409 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Hi everybody, I first want to personally thank everybody who have voiced their opinions o= n this subject as it shows how much concern you all have for PHP. To give a bit of background, I am one of the PHP Google Summer of Code students and part of my project was to port/create a PHP 6 application, namely, the Jaws CMS. My experience proved to be very instructive in terms of evaluating the new functionalities of PHP 6. My experience with unicode.semantics has been very frustrating. Most of the mainstream projects cannot control their environments and have to be as portable as possible. To create a portable application for PHP 6, you have to consider the two different unicode.semantics scenarios AND the possibility that the switch, for some reason, might be turned on or off at any time in the future. Even if you don't care about Unicode and/or have never heard about it, that is very important for you. I have one specific example of where this can be a HUGE headache. Let's say you have some serialized configuration files, saved with unicode.semantics = =3D 1, containing a single configuration array. Now, for some reason, the administrator decides to turn unicode.semantics =3D 0 and your configuratio= n file does not work anymore. That's because $arr[(unicode) 'key'] and $arr[(binary) 'key'] are different. Hence, all the keys in the configuratio= n array have to be explicitly accessed with either (binary) or (unicode). Now, make your own examples of how the switch will affect the life of PHP programmers in the future and you will discover some disastrous scenarios. From now on, all the strings will have to be explicitly set/casted because you never know if you'll get a binary string or a Unicode string when you ask for a string. Literally, when PHP 6 becomes mainstream, all the programmers will have to be re-educated to face these issues. As far as I'm concerned, this is unprecedented in the history of the PHP project. PHP - loosely typed, except for strings. In the end, the switch benefits only the developers of very specialized applications that run on big websites that can control their environments. I've seen the 95%-who-don't-care figure circulating on this thread. If it were really the case, why was the feature implemented with so much care in PHP in the first place ? I, for one, do not agree with the 95% figure. Most= , if not all of the fastest growing markets in the world are non-english speaking and native Unicode support makes PHP a prime choice for them. We'r= e not talking about 10 folks in Iowa there. We're talking about billions. Overall, I think the damn switch is simply not a good idea. It removes a good part of what made PHP a success: simplicity. Get rid of it once and fo= r all. On 8/20/07, Andrei Zmievski wrote: > > Nothing is happening, as far as I can tell. We are at an impasse, > basically. Personally, I am fine with removing the damn switch and > going forward with PHP 6 as Unicode-only. God knows it will rid us of > at least one headache - having to discuss it anymore. > > -Andrei > http://10fathoms.org/vu - daily photoblog > > On Aug 20, 2007, at 9:14 AM, David Coallier wrote: > > Time to put gas on the fire. > > > > Is this flag going to be removed or what? What is happening here in > > the background that we are not seeing ? :) > > > > -- > PHP Internals - PHP Runtime Development Mailing List > To unsubscribe, visit: http://www.php.net/unsub.php > > --=20 Nicolas B=E9rard-Nault (nicobn@gmail.com) =C9tudiant D.E.C . Sciences, Lettres & Arts C=E9gep de Sherbrooke Homepage: http://nicobn.googlepages.com ------=_Part_82234_16655927.1187710352409--