Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:30811 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 59352 invoked by uid 1010); 12 Jul 2007 01:06:17 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 59336 invoked from network); 12 Jul 2007 01:06:16 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 12 Jul 2007 01:06:16 -0000 Authentication-Results: pb1.pair.com smtp.mail=ceo@l-i-e.com; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=ceo@l-i-e.com; sender-id=unknown Received-SPF: error (pb1.pair.com: domain l-i-e.com from 67.139.134.202 cause and error) X-PHP-List-Original-Sender: ceo@l-i-e.com X-Host-Fingerprint: 67.139.134.202 o2.hostbaby.com FreeBSD 4.7-5.2 (or MacOS X 10.2-10.3) (2) Received: from [67.139.134.202] ([67.139.134.202:1228] helo=o2.hostbaby.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 30/97-05872-58E75964 for ; Wed, 11 Jul 2007 21:06:15 -0400 Received: (qmail 84667 invoked by uid 98); 12 Jul 2007 01:06:16 -0000 Received: from 127.0.0.1 by o2.hostbaby.com (envelope-from , uid 1013) with qmail-scanner-2.01 (clamdscan: 0.88.7/3634. Clear:RC:1(127.0.0.1):. Processed in 0.089489 secs); 12 Jul 2007 01:06:16 -0000 Received: from localhost (HELO l-i-e.com) (127.0.0.1) by localhost with SMTP; 12 Jul 2007 01:06:16 -0000 Received: from 24.1.37.132 (SquirrelMail authenticated user ceo@l-i-e.com) by www.l-i-e.com with HTTP; Wed, 11 Jul 2007 20:06:16 -0500 (CDT) Message-ID: <2186.24.1.37.132.1184202376.squirrel@www.l-i-e.com> In-Reply-To: <4691EC69.2010102@zend.com> References: <1181829227.3478.3.camel@localhost.localdomain> <7d5a202f0706141844l3c75b556hdbecbcd5a43747c9@mail.gmail.com> <4671F184.2020401@lerdorf.com> <6sof73dj69ldpspfc5ukrc58qr9ckbin2b@4ax.com> <4677E7B1.2080305@lerdorf.com> <4677F5FB.1070206@lerdorf.com> <4678252F.2050803@sci.fi> <46783212.4020900@lerdorf.com> <34654.216.230.84.67.1183064088.squirrel@www.l-i-e.com> <54557.78.61.224.253.1183098089.squirrel@avilys.eik.lt> <2159.24.1.37.132.1183693437.squirrel@www.l-i-e.com> <468DDFEB.3080404@zend.com> <2031.24.1.37.132.1183965946.squirrel@www.l-i-e.com> <4691EC69.2010102@zend.com> Date: Wed, 11 Jul 2007 20:06:16 -0500 (CDT) To: "Stanislav Malyshev" Cc: "Tomas Kuliavas" , internals@lists.php.net Reply-To: ceo@l-i-e.com User-Agent: Hostbaby Webmail MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal Subject: Re: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? From: ceo@l-i-e.com ("Richard Lynch") On Mon, July 9, 2007 3:06 am, Stanislav Malyshev wrote: >> But now \xF0 isn't going to be ASCII 128 anymore, is it? > > ASCII doesn't have any characters beyond 0x7f AFAIK, but it doesn't > matter, I get what you mean. \xF0 in unicode mode would be U+00F0 of > course. Now how preg_match should handle it depends on preg_match. I should have said "Extended ASCII". And, unfortunately, there are at least 3 commonly-used "Extended ASCII" out there, and, yes, this is exactly what Unicode is trying to solve. Only problem is, the data coming into most web apps is usually NOT UTF-16, nor even UTF-8, but "Windows Extended ASCII" (more or less) and most end users of PHP do not have the luxury of being able to have a dedicated server. So they are going to be stuck with their data getting totally munged into UTF-16 on new PHP installations and, if I'm following this thread correctly, NOT going to be able to get back to the actual data that came IN to their web application. So the ISPs aren't going to install PHP 6 because their users are going to be screaming at them that it broke their applications. Or they'll all install it with this goofy non-Unicode mode, in which case, there's not much point to them having installed it, and y'all will be effectively maintaining 3 branches: PHP 5 PHP 6 ASCII PHP 6 Unicode Unless you drop PHP 6 ASCII, in which case even fewer will bother to install PHP 6, not even in unicode.semantics off mode. Seems to me we're painted into a corner where the number of people who actually install PHP 6 is going to be abysmally small... But maybe I'm just being pessimistic. -- Some people have a "gift" link here. Know what I want? I want you to buy a CD from some indie artist. http://cdbaby.com/browse/from/lynch Yeah, I get a buck. So?