Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:100689 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 16787 invoked from network); 17 Sep 2017 12:19:02 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 17 Sep 2017 12:19:02 -0000 Authentication-Results: pb1.pair.com header.from=cmbecker69@gmx.de; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=cmbecker69@gmx.de; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain gmx.de designates 212.227.15.18 as permitted sender) X-PHP-List-Original-Sender: cmbecker69@gmx.de X-Host-Fingerprint: 212.227.15.18 mout.gmx.net Received: from [212.227.15.18] ([212.227.15.18:55677] helo=mout.gmx.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id EF/2A-19300-7286EB95 for ; Sun, 17 Sep 2017 08:18:53 -0400 Received: from [192.168.2.106] ([79.243.117.113]) by mail.gmx.com (mrgmx001 [212.227.17.190]) with ESMTPSA (Nemesis) id 0Lev1D-1d3hiG40ad-00qhJt; Sun, 17 Sep 2017 14:18:44 +0200 To: Rowan Collins , internals@lists.php.net, Lester Caine References: <7E527061-26D5-4E0C-BAF7-A6F1A940053B@gmail.com> Message-ID: <82cc3de5-6aac-6656-cee1-a83e1e3808b0@gmx.de> Date: Sun, 17 Sep 2017 14:18:44 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.3.0 MIME-Version: 1.0 In-Reply-To: <7E527061-26D5-4E0C-BAF7-A6F1A940053B@gmail.com> Content-Type: text/plain; charset=utf-8 Content-Language: de-DE Content-Transfer-Encoding: 8bit X-Provags-ID: V03:K0:lJKxc1/K+lDddjq/0qhYRIWwdpHhAwqTr0B9iuYcmt8NXv3yIp6 AqPbuXcO6WZlpAEhToMIC+bn/FmuXoFcBC1UT8wNMr4/lqWmzHFOJvH8hUy5fc4Rt37MAK5 ipcto4roZWfN3/Z7av5D6o+F+Dl7ymxb2seda2HJ0yGscShzEUcd9J6/qK76VdJC6ggu6K0 oCOzzxO/wKOz25VSDyflw== X-UI-Out-Filterresults: notjunk:1;V01:K0:avNMePARiY0=:lQG+lA/1WN3kUD6PRWNeR0 Ll8NFk1eJAt7AzbP5HKZHtB5JQa2YVR7yPk9WNKwV37OMahIIkCrLChP1Cc7/8DUSf4RsgLjK 5TZ698Ad3w1PWMRMv+6+YYiZ3f8C3LFvJRM92D7YXcRnGpvfPin1XprrQVOXBCFf/hFVqrPLr o5HE0FhrJ9aPpICVQ8272ckAUXzQOTR3fWqQJbg++y+Y73DVhB41IMqylzD4hNxMzY9XB22DL 2ZAuhL8T1nNdHRrZYDUU5ggFpMZ6QyDUkN91CI1shIivGS+a2o5bIp4SFFVJZtky1ol1cnF8A aQckjtms896FVE6KqIeU3IxfCSi9B1yr1ecWbd13FX+6MBPmTTl65ivxWxD7fLhnksxSEZ7XU 9Y7ayHMuLq0RPJ6A4HodZXrFCOZCcZ4U/a0FdWvlwB8UbbXthoabbvvLZS4rTlOGAEdzAHRPk bT8u2LYozpONWSn5w3I5trKzJ9yiLtfVTADshYBWk7F3Z8O/WeBMKhb6L0MbngMh7T2GRyuzw A8qz2FIC3QbjK/Y7yQLX3j4uZ+Z1fUektssvmikJJlhFnMRg3aiIyjk1RSYjyGSQI8UZO8Vws SvYSXf+h4Ve2E6Ww2D1I1TeTDqkBP/ZEBHag9CMUxwdUaa1ZlE9YdHkl6l4jCTrNW51WWZQ9l lLlpEC5FYc5N740t3+qvMEIvonbuieV2mg+9+48iJzP+R9y5oOXNaHel+HO+2Ut/nmDWkmeOA ynx6ex0xeJi7yZBXr81P4l7NShpFAUafo8JONNwhj4ZpiMPcgGCLHtgvGd4OLTxpukSSJ3Mpo o3VFDH8iM/oT/sPKTrYqoZkh6q84G2I3nUvH2j35+HEAqf5muQ= Subject: Re: [PHP-DEV] Progress or just 'a mess'? From: cmbecker69@gmx.de ("Christoph M. Becker") On 17.09.2017 at 12:53, Rowan Collins wrote: > I checked the PHP lang-spec repo expecting to find a set of Unicode classes, but it currently mentions "U+0080-U+00FF": https://github.com/php/php-langspec/blob/master/spec/09-lexical-structure.md#names That seems wrong to me, unless I'm looking at the wrong definition - the first part of that range is control characters, and you can have variables called things like $? (with an emoji as the entire name). The specification in the PHP manual[1] appears to be more appropriate for our current implementation: | As a regular expression, it would be expressed thus: '[a-zA-Z_\x7f- | \xff][a-zA-Z0-9_\x7f-\xff]*' With regard to control characters: that depends on the chosen character encoding; for instance in Windows-1252 the ยข character is mapped to \xA2. [1] -- Christoph M. Becker