Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:26452 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 41812 invoked by uid 1010); 10 Nov 2006 05:40:19 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 41797 invoked from network); 10 Nov 2006 05:40:19 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 10 Nov 2006 05:40:19 -0000 Authentication-Results: pb1.pair.com smtp.mail=php_lists@realplain.com; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=php_lists@realplain.com; sender-id=unknown Received-SPF: error (pb1.pair.com: domain realplain.com from 209.142.136.132 cause and error) X-PHP-List-Original-Sender: php_lists@realplain.com X-Host-Fingerprint: 209.142.136.132 msa2-mx.centurytel.net Linux 2.4/2.6 Received: from [209.142.136.132] ([209.142.136.132:50724] helo=msa2-mx.centurytel.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id EC/90-24370-1C014554 for ; Fri, 10 Nov 2006 00:40:19 -0500 Received: from pc1 (d9-104.rt-bras.wnvl.centurytel.net [69.179.136.104]) by msa2-mx.centurytel.net (8.13.6/8.13.6) with SMTP id kAA5eDWd016287; Thu, 9 Nov 2006 23:40:13 -0600 Message-ID: <00f001c7048a$b2967030$0201a8c0@pc1> To: , "Andrei Zmievski" Date: Thu, 9 Nov 2006 23:40:14 -0600 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2800.1807 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1807 Subject: Unicode chars allowed in numbers? From: php_lists@realplain.com ("Matt Wilmas") Hi Andrei, et al., I was just looking at README.UNICODE, regarding interpretation of numbers: "we restrict numbers to consist only of ASCII digits," and "Numeric strings are supposed to adhere to the same rules." Is it correct to take that to mean only UChar's with values from '0'-'9'/0x30-0x39 (and 'a'-'z' equivalents for bases > 10)? I ask because in zend_u_strtol(), HANDLE_U_NUMERIC() for array keys, etc., the u_digit() function is used, which also allows non-ASCII, higher-value digit characters, doesn't it? But then in is_numeric_unicode(), when checking for hex numbers, the ASCII values '0' and 'x' are used, which is what I'd expect after reading README.UNICODE. Thanks for any clarification, Matt