Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:72641 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 99710 invoked from network); 17 Feb 2014 03:13:58 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 17 Feb 2014 03:13:58 -0000 Authentication-Results: pb1.pair.com header.from=smalyshev@sugarcrm.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=smalyshev@sugarcrm.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain sugarcrm.com designates 108.166.43.83 as permitted sender) X-PHP-List-Original-Sender: smalyshev@sugarcrm.com X-Host-Fingerprint: 108.166.43.83 smtp83.ord1c.emailsrvr.com Linux 2.6 Received: from [108.166.43.83] ([108.166.43.83:49914] helo=smtp83.ord1c.emailsrvr.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 8B/D6-56374-57E71035 for ; Sun, 16 Feb 2014 22:13:58 -0500 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp3.relay.ord1c.emailsrvr.com (SMTP Server) with ESMTP id 929BD50365; Sun, 16 Feb 2014 22:13:55 -0500 (EST) X-Virus-Scanned: OK Received: by smtp3.relay.ord1c.emailsrvr.com (Authenticated sender: smalyshev-AT-sugarcrm.com) with ESMTPSA id 2CAF45037E; Sun, 16 Feb 2014 22:13:55 -0500 (EST) Message-ID: <53017E72.3050807@sugarcrm.com> Date: Sun, 16 Feb 2014 19:13:54 -0800 Organization: SugarCRM User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:24.0) Gecko/20100101 Thunderbird/24.3.0 MIME-Version: 1.0 To: Rasmus Lerdorf , Rowan Collins , "internals@lists.php.net" References: <50100EC8.3040102@ajf.me> <52FDF7BC.8050408@lsces.co.uk> <52FE46D2.4060903@gmail.com> <52FE6FEA.5050204@lerdorf.com> In-Reply-To: <52FE6FEA.5050204@lerdorf.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Subject: Re: [PHP-DEV] PHP6 wiki page From: smalyshev@sugarcrm.com (Stas Malyshev) Hi! > operation. There are a ton of non-obvious things beyond simple string > manipulation. String collation alone is massively complicated, for example. Oh yes, and if somebody thinks case sensitivity is weird now, wait until Unicode gets into play. There for some chars when you change the case string length changes, and for some conversion is not roundtrip-safe. And you have various long form/short form combining issues which means you need to normalize everything on every corner. So letting Unicode into things like identifiers opens a huge container of worms. Also, if one wants to appreciate what other cans of worms are hiding there, I recommend this oldie but goodie: http://stackoverflow.com/a/6163129/214196 It's about Perl, but we'd have many of the same issues. -- Stanislav Malyshev, Software Architect SugarCRM: http://www.sugarcrm.com/ (408)454-6900 ext. 227