Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:18187 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 74838 invoked by uid 1010); 16 Aug 2005 21:58:21 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 74823 invoked from network); 16 Aug 2005 21:58:21 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 16 Aug 2005 21:58:21 -0000 X-Host-Fingerprint: 213.237.67.135 213.237.67.135.adsl.by.worldonline.dk Linux 2.4/2.6 Received: from ([213.237.67.135:13743] helo=mail.ter.dk) by pb1.pair.com (ecelerity 2.0 beta r(6323M)) with SMTP id 58/4D-33075-C7162034 for ; Tue, 16 Aug 2005 17:58:20 -0400 Received: from workpenguin (workpenguin [192.168.1.32]) by mail.ter.dk (Symaskine) with SMTP id 29DA78A4015; Tue, 16 Aug 2005 23:58:01 +0200 (CEST) To: andrei@gravitonic.com (Andrei Zmievski) Cc: internals@lists.php.net Date: Tue, 16 Aug 2005 23:57:27 +0200 Message-ID: <57n4g11mj3g0f14huvvmcq852pis23oeop@4ax.com> References: <937066F0-AA5F-41E2-99A0-D74C7F44FFCA@gravitonic.com> In-Reply-To: <937066F0-AA5F-41E2-99A0-D74C7F44FFCA@gravitonic.com> X-Mailer: Forte Agent 1.91/32.564 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: Re: PHP Unicode support design document From: php@ter.dk (Peter Brodersen) On Wed, 10 Aug 2005 00:31:30 -0700, in php.internals andrei@gravitonic.com (Andrei Zmievski) wrote: > - existing PHP escape sequences are also interpreted as Unicode = codepoints, > including \xXX (hex) and \OOO (octal) numbers, e.g. "\x20" =3D> = U+0020 [..] >The single-quoted string is more restrictive than the other two types: = so >far the only escape sequence allowed inside of it was \', which = specifies >a literal single quote. However, single quoted strings now support the = new >Unicode character escape sequences as well. =46or what it's worth, would \1 be interpreted as well in single quotes (as it currently is in double quotes)? I suppose one of the places where \digit would be present in several cases is in poor-written pregs - such as: print preg_replace('/([A-Z])/','\1',$string); (where \1 is used as backreference instead of \\1 or $1) I'm not that worried about my own preg-usage. I just want to be prepared if I ever have to review some code for the purpose of migrating to PHP6. --=20 - Peter Brodersen