Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:73095 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 97925 invoked from network); 13 Mar 2014 01:54:20 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 13 Mar 2014 01:54:20 -0000 Authentication-Results: pb1.pair.com header.from=yohgaki@gmail.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=yohgaki@gmail.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain gmail.com designates 209.85.215.46 as permitted sender) X-PHP-List-Original-Sender: yohgaki@gmail.com X-Host-Fingerprint: 209.85.215.46 mail-la0-f46.google.com Received: from [209.85.215.46] ([209.85.215.46:35207] helo=mail-la0-f46.google.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 2D/A2-17655-BCF01235 for ; Wed, 12 Mar 2014 20:54:20 -0500 Received: by mail-la0-f46.google.com with SMTP id hr17so223293lab.33 for ; Wed, 12 Mar 2014 18:54:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type; bh=hpyQsJP+C6bIcFbFt7zvzeeldneAGkezLTG4TY2tAiw=; b=nRpmdhsOMQ7Ro+wZsfzoYUPA/9SLOSi8NqFQIGTs6cf2rXGvwGetcV2cPjfE04QNDG QuS9QY29FxrnVK3Oifyoq0zT4iUSivQne0dW005IU6IyF0yYETcmp9U9n/thTZBEXolL WbNVqCFnHRXpTeI/eJyzH9KxD1Bs1nhaFDc5NonGayNbJrevfOF4WlL4QfiPO5qIxfba tdUnGcj9Peux8o1H4r197eY/D6wm6vbJzrwHUsEU1Ozb4Q7vYUpis7IAM7pDh8/FpKzy YruW/PECKBf9Wl3Ui/GaTthtfiXPdLJx6sBfKy659NOi1FIQV5/RlLzaShP4M8PzoNt3 YaVg== X-Received: by 10.153.7.200 with SMTP id de8mr462247lad.19.1394675656076; Wed, 12 Mar 2014 18:54:16 -0700 (PDT) MIME-Version: 1.0 Sender: yohgaki@gmail.com Received: by 10.112.205.73 with HTTP; Wed, 12 Mar 2014 18:53:36 -0700 (PDT) In-Reply-To: <53210857.5040908@googlemail.com> References: <531EE602.3090207@lsces.co.uk> <531EEE2A.2000602@googlemail.com> <531F0146.5010701@lsces.co.uk> <53202DC5.4010306@googlemail.com> <532033E1.60602@lsces.co.uk> <53203687.7090405@googlemail.com> <532037F4.6020204@googlemail.com> <532049BB.2080605@lsces.co.uk> <53204C6B.3000506@googlemail.com> <5320511A.7040606@lsces.co.uk> <5320F554.5000109@lerdorf.com> <53210857.5040908@googlemail.com> Date: Thu, 13 Mar 2014 10:53:36 +0900 X-Google-Sender-Auth: K7xyslZMxhw23kMSwObX-wJaZmk Message-ID: To: Crypto Compress Cc: PHP Developers Mailing List Content-Type: multipart/alternative; boundary=001a1137e9860eb30c04f4733bbc Subject: Re: [PHP-DEV] Unicode strings? From: yohgaki@ohgaki.net (Yasuo Ohgaki) --001a1137e9860eb30c04f4733bbc Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi all, On Thu, Mar 13, 2014 at 10:22 AM, Crypto Compress < cryptocompress@googlemail.com> wrote: > Am 13.03.2014 01:01, schrieb Rasmus Lerdorf: > > On 3/12/14, 5:20 AM, Lester Caine wrote: >> >>> Crypto Compress wrote: >>> >>>> Unicode variable names ARE secondary, but if the handling of unicode >>>>> works as >>>>> well as it seems to be for me then it may be an option that can be >>>>> considered. >>>>> >>>> http://3v4l.org/kWb0U >>>> Please help me, what is this about? >>>> >>> Exactly what has already been discussed? >>> You can use unicode strings in many areas of PHP, but it is not by >>> design, but rather as the result of 'holes' in the design. >>> >> That's not a hole in the design. It was quite deliberate and it had >> little to do with Unicode at the time. It was a deliberate effort to not >> artificially limit identifiers beyond that which the language syntax >> naturally prevented. Think ; , { } ( ) etc. >> >> -Rasmus >> > > IMHO it was the right decision to no artificially limit identifiers and i= t > is a fair trade-off for case-insensitivity without unicode (class =C3=9F{= } class > SS{}). > With unicode identifiers there is at least one more problem through > normalization to consider. somewhat simplified: $=E2=98=80=E2=98=81 and = $=E2=9B=85 (=3D=3D=3D in unicode) > Good point, but users should use NFC UTF-8 without BOM for variable/function names. It would be documentation issue. Regards, -- Yasuo Ohgaki yohgaki@ohgaki.net --001a1137e9860eb30c04f4733bbc--