Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:37436 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 13933 invoked from network); 4 May 2008 16:50:47 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 4 May 2008 16:50:47 -0000 Authentication-Results: pb1.pair.com smtp.mail=scott@macvicar.net; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=scott@macvicar.net; sender-id=unknown Received-SPF: error (pb1.pair.com: domain macvicar.net from 193.227.246.108 cause and error) X-PHP-List-Original-Sender: scott@macvicar.net X-Host-Fingerprint: 193.227.246.108 ip246-108-v193.static.x-ip.net Received: from [193.227.246.108] ([193.227.246.108:55765] helo=lovelace.midden.org.uk) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id A6/7B-43080-669ED184 for ; Sun, 04 May 2008 12:50:46 -0400 Received: from macvicar.demon.co.uk ([80.177.111.173] helo=[192.168.1.102]) by lovelace.midden.org.uk with esmtp (Exim 4.69) (envelope-from ) id 1JshQr-0003eF-Ll; Sun, 04 May 2008 17:51:25 +0100 Message-ID: <481DE8DC.9070109@macvicar.net> Date: Sun, 04 May 2008 17:48:28 +0100 User-Agent: Thunderbird 2.0.0.14 (Windows/20080421) MIME-Version: 1.0 To: Tomas Kuliavas CC: internals@lists.php.net References: <4BD5A050-02F2-46BD-B867-FA8CA12FF1BD@macvicar.net> <48988.78.61.224.253.1209918881.nsm@avilys.eik.lt> In-Reply-To: <48988.78.61.224.253.1209918881.nsm@avilys.eik.lt> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: [PHP-DEV] Removal of unicode_semantics From: scott@macvicar.net (Scott MacVicar) Tomas Kuliavas wrote: >> We've discussed this a few times in the past and it's time to make a >> final decision about its removal. >> >> I think most people have agreed that this is the way forward but no >> one has produced a patch. I have a student working on unicode >> conversion for the Google Summer of Code and this would help make it >> simpler. > > unicode_semantics=on breaks backwards compatibility in scripts that have > implemented multiple character set support in current PHP setups. > > If setting is removed, instead of maintaining at least some bits of > backwards compatibility and doing some additional work, you force massive > code rewrites in scripts that depend on working charset support and more > work for people, who use interpreter. > > Every time somebody proposes removal of this setting, they claim that > majority agreed on it when there is no agreement on anything. People only > defended own positions and we had other flame about unicode_semantics. > There has been agreement by the people that actually contribute towards the development of PHP. It certainly doesn't give backwards compatability, you are able to turn it off in php.ini and its going to mean that developers will need to maintain two versions. One for it off and the other for on. My biggest concern is the 2 code bases that need to be maintained by the PHP developers, you need to have two branches for handling unicode and native strings. To sum it up, unicode_semantics is in the exact same vain as ze1_compatability and it was a complete failure. Before any developers decide they need to port things to PHP 6 we need to just make it Unicode only. Scott