Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:37651 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 54972 invoked from network); 14 May 2008 15:32:27 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 14 May 2008 15:32:27 -0000 Authentication-Results: pb1.pair.com smtp.mail=johannes@php.net; spf=unknown; sender-id=unknown Authentication-Results: pb1.pair.com header.from=johannes@php.net; sender-id=unknown Received-SPF: unknown (pb1.pair.com: domain php.net does not designate 83.243.58.163 as permitted sender) X-PHP-List-Original-Sender: johannes@php.net X-Host-Fingerprint: 83.243.58.163 mail4.netbeat.de Received: from [83.243.58.163] ([83.243.58.163:47633] helo=mail4.netbeat.de) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id D4/1B-31980-8060B284 for ; Wed, 14 May 2008 11:32:27 -0400 Received: (qmail 23725 invoked by uid 507); 14 May 2008 15:32:20 -0000 Received: from unknown (HELO ?129.187.52.117?) (postmaster%schlueters.de@129.187.52.117) by mail4.netbeat.de with ESMTPA; 14 May 2008 15:32:20 -0000 To: Steph Fox Cc: internals In-Reply-To: <01b101c8b5c2$66718e20$4001a8c0@foxbox> References: <01b101c8b5c2$66718e20$4001a8c0@foxbox> Content-Type: text/plain; charset=utf-8 Date: Wed, 14 May 2008 17:32:18 +0200 Message-ID: <1210779138.2986.2.camel@goldfinger> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-4.fc8) Content-Transfer-Encoding: 8bit Subject: Re: unicode.semantics ad infinitum From: johannes@php.net (Johannes =?ISO-8859-1?Q?Schl=FCter?=) Steph, On Wed, 2008-05-14 at 13:59 +0100, "Steph Fox" wrote: > I wrote a macro to allow us to use the same code for the extension in both > branches, but it occurs to me that the zstr union definition might change or > even disappear when PHP 6 becomes Unicode-only. Is that likely? I don't > know. So keeping it all together is total guesswork at present. In general: No. We still need both string types at most places, the effect of the u.s setting is the default type for strings in the engine. Most other functions still need to work with both kinds of string as parameter and such. (strlen(b"äöü") vs. atrlen(u"äöü")) johannes