Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:73102 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 23906 invoked from network); 13 Mar 2014 09:17:24 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 13 Mar 2014 09:17:24 -0000 Authentication-Results: pb1.pair.com header.from=lester@lsces.co.uk; sender-id=unknown Authentication-Results: pb1.pair.com smtp.mail=lester@lsces.co.uk; spf=permerror; sender-id=unknown Received-SPF: error (pb1.pair.com: domain lsces.co.uk from 217.147.176.204 cause and error) X-PHP-List-Original-Sender: lester@lsces.co.uk X-Host-Fingerprint: 217.147.176.204 mail4.serversure.net Linux 2.6 Received: from [217.147.176.204] ([217.147.176.204:46131] helo=mail4.serversure.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 2C/C1-12560-2A771235 for ; Thu, 13 Mar 2014 04:17:23 -0500 Received: (qmail 11371 invoked by uid 89); 13 Mar 2014 09:17:19 -0000 Received: by simscan 1.3.1 ppid: 11362, pid: 11367, t: 0.0606s scanners: attach: 1.3.1 clamav: 0.96/m:52 Received: from unknown (HELO linux-dev4.lsces.org.uk) (lester@rainbowdigitalmedia.org.uk@81.138.11.136) by mail4.serversure.net with ESMTPA; 13 Mar 2014 09:17:19 -0000 Message-ID: <532177CE.4080706@lsces.co.uk> Date: Thu, 13 Mar 2014 09:18:06 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:27.0) Gecko/20100101 Firefox/27.0 SeaMonkey/2.24 MIME-Version: 1.0 To: "internals@lists.php.net >> PHP internals" References: <531EE602.3090207@lsces.co.uk> <531EEE2A.2000602@googlemail.com> <531F0146.5010701@lsces.co.uk> <53202DC5.4010306@googlemail.com> <532033E1.60602@lsces.co.uk> <53203687.7090405@googlemail.com> <532037F4.6020204@googlemail.com> <532049BB.2080605@lsces.co.uk> <53204C6B.3000506@googlemail.com> <5320511A.7040606@lsces.co.uk> <5320F554.5000109@lerdorf.com> <53210857.5040908@googlemail.com> <53216D44.5040802@googlemail.com> In-Reply-To: <53216D44.5040802@googlemail.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Subject: Re: [PHP-DEV] Unicode strings? From: lester@lsces.co.uk (Lester Caine) Crypto Compress wrote: >> Good point, but users should use NFC UTF-8 without BOM for variable/function >> names. >> It would be documentation issue. > > in the languages i know combining diacritics are not common so can't evaluate > how practical it is to type those. Would it be impossible to change code with a > dumb editor? > > $café !== $café > 0x63 0x61 0x66 0xC3 0xA9 > 0x63 0x61 0x66 0x65 0xCC 0x81 'cryptocompress' (is that really on your passport :( ) This is exactly the area we need to agree on a plan moving forward. There are a number of options on the table 1 - Limit variable and other names to 'ASCII' only characters so that case folding can be maintained. 2 - Remove 'case insensitivity' but not just for point 1 reasons. ( I see this as your example being two different strings ;) ) 3 - Allow unicode names to be used in places where they currently cause problems. Not actually using unicode variable names myself, I still don't understand where the problems are with '3' except for the simple comparison case where normalizing and case conversion creates a minefield? People are currently using unicode in these areas and understand many of the restrictions? -- Lester Caine - G8HFL ----------------------------- Contact - http://lsces.co.uk/wiki/?page=contact L.S.Caine Electronic Services - http://lsces.co.uk EnquirySolve - http://enquirysolve.com/ Model Engineers Digital Workshop - http://medw.co.uk Rainbow Digital Media - http://rainbowdigitalmedia.co.uk