Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:47296 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 73571 invoked from network); 15 Mar 2010 23:33:15 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 15 Mar 2010 23:33:15 -0000 Authentication-Results: pb1.pair.com header.from=stas@zend.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=stas@zend.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain zend.com designates 212.25.124.185 as permitted sender) X-PHP-List-Original-Sender: stas@zend.com X-Host-Fingerprint: 212.25.124.185 il-mr1.zend.com Received: from [212.25.124.185] ([212.25.124.185:55035] helo=il-mr1.zend.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 85/2A-34010-9B3CE9B4 for ; Mon, 15 Mar 2010 18:33:14 -0500 Received: from us-gw1.zend.com (unknown [192.168.16.5]) by il-mr1.zend.com (Postfix) with ESMTP id 3401850504 for ; Tue, 16 Mar 2010 01:13:55 +0200 (IST) Received: from [192.168.16.93] ([192.168.16.93]) by us-gw1.zend.com with Microsoft SMTPSVC(6.0.3790.3959); Mon, 15 Mar 2010 16:33:08 -0700 Message-ID: <4B9EC3B2.7070901@zend.com> Date: Mon, 15 Mar 2010 16:33:06 -0700 Organization: Zend Technologies User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.1.8) Gecko/20100227 Thunderbird/3.0.3 MIME-Version: 1.0 To: internals@lists.php.net References: <4B9C9007.1080802@lsces.co.uk> In-Reply-To: <4B9C9007.1080802@lsces.co.uk> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 15 Mar 2010 23:33:08.0143 (UTC) FILETIME=[DE41A3F0:01CAC497] Subject: Re: [PHP-DEV] Where are we ACTUALLY on Unicode? From: stas@zend.com (Stanislav Malyshev) Hi! > What I am probably asking is what was the brick wall PHP6 hit. I was > under the impression that there was no agreement on 'switchable or only' > to unicode core? ( And those who did write PHP6 books seemed to have > their own views on which way the discussions would go ;) ). From what I can see, the biggest issues are these: 1. Performance - Unicode-based PHP right now requires tons of conversions when talking to outside world (like MySQL) which slows down the app significantly. Many extensions frequently used by PHP app writers (such as mysql, pcre, etc.) do not support UTF-16 properly. Also, inflated memory usage hurts scalability a lot. 2. Compatibility - it's hard to make existing app works with Unicode and doesn't lose in performance or doesn't have any weird scenarios where your passwords suddenly stop working because there's an extra recoding step in some md5() call. -- Stanislav Malyshev, Zend Software Architect stas@zend.com http://www.zend.com/ (408)253-8829 MSN: stas@zend.com