Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:34897 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 55380 invoked by uid 1010); 23 Jan 2008 21:36:44 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 55348 invoked from network); 23 Jan 2008 21:36:44 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 23 Jan 2008 21:36:44 -0000 Authentication-Results: pb1.pair.com header.from=foolistbar@googlemail.com; sender-id=pass; domainkeys=bad Authentication-Results: pb1.pair.com smtp.mail=foolistbar@googlemail.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain googlemail.com designates 209.85.128.185 as permitted sender) DomainKey-Status: bad X-DomainKeys: Ecelerity dk_validate implementing draft-delany-domainkeys-base-01 X-PHP-List-Original-Sender: foolistbar@googlemail.com X-Host-Fingerprint: 209.85.128.185 fk-out-0910.google.com Received: from [209.85.128.185] ([209.85.128.185:53446] helo=fk-out-0910.google.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 4C/9C-17042-863B7974 for ; Wed, 23 Jan 2008 16:36:42 -0500 Received: by fk-out-0910.google.com with SMTP id f33so2354464fkf.7 for ; Wed, 23 Jan 2008 13:36:38 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:received:received:cc:message-id:from:to:in-reply-to:content-type:content-transfer-encoding:mime-version:subject:date:references:x-mailer; bh=fpimJ+AKUflzYTIKhbqmpb4wsnOKX9L7kalU5uz3a/0=; b=Z7MGuFyQKH+HnXtlDI0ueKG9TeyS0QOWSD2LxMz2tf5Rng/KHVg87M089QGJ+2BoBCbfsgXK/cdH/6HUeyUqPY+rv72/YIOE/F2VtDUOPJR7HCw3CoB47DzhFdt0k1cEYfgchLmH1b1txOf5KnT4uvfKxG+c8Zx1Bs/vtzgqPhc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=cc:message-id:from:to:in-reply-to:content-type:content-transfer-encoding:mime-version:subject:date:references:x-mailer; b=pM2pCUz3ZnuZUiBHKpnmWG3ZLg/g8pHeVojayk9eHw9294sLq8D1oGVNuM7rQMapYjkSdpsDjwqQmb5UoIgK3QPe5r+yBEK29/mRqjzAlc7R/5vxBgRfQd9TmM7curdLnFqlkiqqBDHolMvP6efNUvS38ePWInE/AeibSzjlNO0= Received: by 10.82.187.2 with SMTP id k2mr18048038buf.19.1201124197739; Wed, 23 Jan 2008 13:36:37 -0800 (PST) Received: from ?192.168.0.116? ( [86.151.228.75]) by mx.google.com with ESMTPS id f13sm9399314gvd.9.2008.01.23.13.36.35 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 23 Jan 2008 13:36:36 -0800 (PST) Cc: php-dev Message-ID: <7324A2C5-C19B-4298-83D3-76194999FFEA@googlemail.com> To: Rasmus Lerdorf In-Reply-To: <4797A1BD.2050803@lerdorf.com> Content-Type: text/plain; charset=WINDOWS-1252; format=flowed; delsp=yes Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Apple Message framework v915) Date: Wed, 23 Jan 2008 21:36:30 +0000 References: <4794AE48.20005@daylessday.org> <38791.98.193.37.55.1201055548.squirrel@www.l-i-e.com> <47979570.4010703@lerdorf.com> <1201.209.254.223.2.1201118608.squirrel@www.l-i-e.com> <4797A1BD.2050803@lerdorf.com> X-Mailer: Apple Mail (2.915) Subject: Re: [PHP-DEV] why we must get rid of unicode.semantics switch ASAP From: foolistbar@googlemail.com (Geoffrey Sneddon) On 23 Jan 2008, at 20:21, Rasmus Lerdorf wrote: > u"foo" is a hack that will eventually disappear from the various =20 > languages that have it or something similar. I think we need to have binary strings as default with u"=85" for a =20 while (whenever that gets merged into the default string type it is =20 probably the real time to break everything without too much regard for =20= backwards compatibility at all =97 I don't think this time has come yet, = =20 though), without any switch to change that. To note: Python introduced support for Unicode in 2.0 (released 2000), =20= and Unicode strings are the default in 3.0 (as of 2007-12-07 alpha 2) =20= =97 the current plan is to release 3.0 final in mid-2008; this is eight =20= years between adding Unicode support and it becoming the default =97 =20 following this precedent would result in Unicode becoming default in =20 PHP at the very earliest of 2016, though I think quite such a long =20 delay in the case of PHP wouldn't be the best for the language, but I =20= doubt we can do it much quicker than around four years (without the =20 benefits outweighing the issues it will cause). > Unfortunately 10 years ago, I wasn't very concerned about that. 10 years ago, not all too many people were with scripting languages, =20 which is part of the reason why (scripting) languages with native =20 Unicode support are only just starting to real take off, as they will, =20= even though we will have Unicode support in PHP 6, have more mature =20 implementations than we will. -- Geoffrey Sneddon