Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:30741 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 26388 invoked by uid 1010); 10 Jul 2007 16:31:37 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 26373 invoked from network); 10 Jul 2007 16:31:37 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 10 Jul 2007 16:31:37 -0000 Authentication-Results: pb1.pair.com smtp.mail=andi@zend.com; spf=pass; sender-id=pass Authentication-Results: pb1.pair.com header.from=andi@zend.com; sender-id=pass Received-SPF: pass (pb1.pair.com: domain zend.com designates 63.205.162.114 as permitted sender) X-PHP-List-Original-Sender: andi@zend.com X-Host-Fingerprint: 63.205.162.114 unknown Windows 2000 SP4, XP SP1 Received: from [63.205.162.114] ([63.205.162.114:28237] helo=us-ex1.zend.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id C1/87-59162-664B3964 for ; Tue, 10 Jul 2007 12:31:37 -0400 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: quoted-printable Date: Tue, 10 Jul 2007 09:30:47 -0700 Message-ID: <698DE66518E7CA45812BD18E807866CE64794E@us-ex1.zend.net> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? Thread-Index: AcfCderXXdJKJelhRIqq0hzZbu4cJAAJBwUAAB00uHA= References: <1181829227.3478.3.camel@localhost.localdomain> <4678252F.2050803@sci.fi> <46783212.4020900@lerdorf.com> <34654.216.230.84.67.1183064088.squirrel@www.l-i-e.com> <54557.78.61.224.253.1183098089.squirrel@avilys.eik.lt> <4684BB91.4070507@zend.com> <2169.24.1.37.132.1183693664.squirrel@www.l-i-e.com> <1183699755.14343.5.camel@johannes.nop> <7d5a202f0707060224oa64dfeaw2c7ee17a735648f9@mail.gmail.com> <468E1158.2030900@lerdorf.com> <468E13C6.1070109@pooteeweet.org> <468E2009.9000703@zend.com> <468E7180.3020709@zend.com> <468E7256.10905@zend.com> <4692B1A3.1000808@zend.com> <698DE66518E7CA45812BD18E807866CE647897@us-ex1.zend.net> To: "Andi Gutmans" , "Antony Dovgal" , "Andrei Zmievski" Cc: "Stas Malyshev" , Subject: RE: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? From: andi@zend.com ("Andi Gutmans") I was thinking a bit more about this yesterday. Even if I'd agree with this discussion (which I don't at this point in time) I think it is being had far too early. We currently have a very big problem with ability to upgrade to PHP 6 and making decisions without people actually getting their feet wet and seeing what the issues are is not a good idea. Purist decisions tend to fail when they meet the real world. What I really think we need to do for this release, which we haven't been good at doing in the past, is build a PHP Compatibility Team which tries to port many applications to PHP 6 and finds the issues in doing this port (both with unicode_semantics=3Don/off). We can then learn from this experience and have good documentation on how to upgrade to both modes and in some cases, like we have done in the past 2-3 weeks, tweak PHP 6 to not break backwards compatibility. It is possible in many cases. It's something we are willing to spend time on and as I mentioned already started to do but it would really require a larger amount of volunteers to pick various apps and do it. This kind of information would be far more valuable to the project at this point than a prolonged thread about a piece of software which isn't finish (and would also give more information for a discussion like the one we've been having). No one really knows how good/bad of a situation we are at right now. I know from my end it doesn't look great yet. Andi > -----Original Message----- > From: Andi Gutmans [mailto:andi@zend.com]=20 > Sent: Monday, July 09, 2007 7:39 PM > To: Antony Dovgal; Andrei Zmievski > Cc: Stas Malyshev; internals@lists.php.net > Subject: RE: [PHP-DEV] What is the use of "unicode.semantics"=20 > in PHP 6? >=20 > The large amount of the dual IS_UNICODE/IS_STRING will need=20 > to stay in the code base anyway as we will be supporting=20 > binary strings in PHP 6. > So it's not accurate that all these maintance issues will be=20 > resolved by not supporting unicode_semantics=3Doff. >=20 > I believe unlike what Andrei said, for a large community of=20 > ours (probably the majority) default unicode_semantics=3Don=20 > will not be of interest (we don't live in a purists world).=20 > Many won't want to run it because it's going to be=20 > significantly slower and will be harder for them to work=20 > with. This community will be best served to be able to run in=20 > native 8bit mode and having some Unicode functionality=20 > available if/when needed. Having dual mode in PHP 6 is not=20 > the same as forking two code bases. There is still like=20 > namespaces automatically reach both audiences. >=20 > If we're talking from a pure "what is most useful to the=20 > majority of our users" I'd actually argue that explicit=20 > Unicode strings would be the most convenient, i.e. instead of=20 > doing b"8bitstring" you'd do U"unicodestring". Other=20 > languages do the same and there are reasons for that. As=20 > we've decided on a more aggressive (and risky) approach, I=20 > think having this dual mode is extremely important. It will=20 > also make the upgrade path easier. >=20 > Btw, I don't know how many of you have actually tried to port=20 > PHP 5 apps to PHP 6 but it's quite a disaster. We made some=20 > fixes in the past 2-3 weeks and its getting better but it=20 > still requires a lot of work. If we don't make this easy then=20 > this is all not worth too much. >=20 > This project has never been a purists project which is why=20 > it's been so successful, let's not start now... >=20 > Andi >=20 > -- > PHP Internals - PHP Runtime Development Mailing List To=20 > unsubscribe, visit: http://www.php.net/unsub.php >=20 >=20