Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:30812 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 65126 invoked by uid 1010); 12 Jul 2007 01:12:48 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 65111 invoked from network); 12 Jul 2007 01:12:48 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 12 Jul 2007 01:12:48 -0000 Authentication-Results: pb1.pair.com smtp.mail=ceo@l-i-e.com; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=ceo@l-i-e.com; sender-id=unknown Received-SPF: error (pb1.pair.com: domain l-i-e.com from 67.139.134.202 cause and error) X-PHP-List-Original-Sender: ceo@l-i-e.com X-Host-Fingerprint: 67.139.134.202 o2.hostbaby.com FreeBSD 4.7-5.2 (or MacOS X 10.2-10.3) (2) Received: from [67.139.134.202] ([67.139.134.202:1312] helo=o2.hostbaby.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 1A/78-05872-5FF75964 for ; Wed, 11 Jul 2007 21:12:41 -0400 Received: (qmail 87881 invoked by uid 98); 12 Jul 2007 01:12:01 -0000 Received: from 127.0.0.1 by o2.hostbaby.com (envelope-from , uid 1013) with qmail-scanner-2.01 (clamdscan: 0.88.7/3634. Clear:RC:1(127.0.0.1):. Processed in 0.074287 secs); 12 Jul 2007 01:12:01 -0000 Received: from localhost (HELO l-i-e.com) (127.0.0.1) by localhost with SMTP; 12 Jul 2007 01:12:01 -0000 Received: from 24.1.37.132 (SquirrelMail authenticated user ceo@l-i-e.com) by www.l-i-e.com with HTTP; Wed, 11 Jul 2007 20:12:01 -0500 (CDT) Message-ID: <2193.24.1.37.132.1184202721.squirrel@www.l-i-e.com> In-Reply-To: <43868.195.22.180.233.1183968428.squirrel@avilys.eik.lt> References: <1181829227.3478.3.camel@localhost.localdomain> <7d5a202f0706141844l3c75b556hdbecbcd5a43747c9@mail.gmail.com> <4671F184.2020401@lerdorf.com> <6sof73dj69ldpspfc5ukrc58qr9ckbin2b@4ax.com> <4677E7B1.2080305@lerdorf.com> <4677F5FB.1070206@lerdorf.com> <4678252F.2050803@sci.fi> <46783212.4020900@lerdorf.com> <34654.216.230.84.67.1183064088.squirrel@www.l-i-e.com> <54557.78.61.224.253.1183098089.squirrel@avilys.eik.lt> <2159.24.1.37.132.1183693437.squirrel@www.l-i-e.com> <468DDFEB.3080404@zend.com> <2031.24.1.37.132.1183965946.squirrel@www.l-i-e.com> <43868.195.22.180.233.1183968428.squirrel@avilys.eik.lt> Date: Wed, 11 Jul 2007 20:12:01 -0500 (CDT) To: "Tomas Kuliavas" Cc: internals@lists.php.net Reply-To: ceo@l-i-e.com User-Agent: Hostbaby Webmail MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal Subject: Re: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? From: ceo@l-i-e.com ("Richard Lynch") On Mon, July 9, 2007 3:07 am, Tomas Kuliavas wrote: >>>>> Unicode code points can be defined with \u, but PHP6 breaks >>>>> existing octal and hex escape sequences. >>> >>> I don't understand what this means... >> >> I think I know... >> >> I have code like this, somewhere: >> >> if (preg_match("|[\xF0-\xFF]|", $data)){ >> $data = un_microsuck($data); >> } >> >> un_microsuck() basically detects and converts any of the goof-ball >> extended ASCII from MS products (Word, Outlook, etc) to an HTML >> equivalent character. >> >> But now \xF0 isn't going to be ASCII 128 anymore, is it? > > \xF0 never was ASCII. ASCII (ISO-646) is 7bit character set. \xF0 is > decimal 240. It is 8bit. Don't tell me. Tell Microsoft. Cuz I sure as heck get a LOT of input data >> \x7f and I have to do something reasonable with it... And I did say "extended ASCII" in the other paragraph, after all... >> Or maybe \xF0 will "work" but the octal \360 won't? > > Are you sure that you can't do that by setting > unicode.something_encoding > to iso-8859-1 or windows-1252? I dunno. Doesn't really matter if I can't set those in .htaccess, that's for sure. [joke type="semi"] All this working going into Unicode, and nobody is pushing to replace (CR|CRLF|LF) with a new Unicode all-platform newline character? [/joke] -- Some people have a "gift" link here. Know what I want? I want you to buy a CD from some indie artist. http://cdbaby.com/browse/from/lynch Yeah, I get a buck. So?