Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:30875 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 96353 invoked by uid 1010); 12 Jul 2007 21:46:33 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 96295 invoked from network); 12 Jul 2007 21:46:32 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 12 Jul 2007 21:46:32 -0000 Authentication-Results: pb1.pair.com header.from=stas@zend.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=stas@zend.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain zend.com designates 63.205.162.114 as permitted sender) X-PHP-List-Original-Sender: stas@zend.com X-Host-Fingerprint: 63.205.162.114 unknown Windows 2000 SP4, XP SP1 Received: from [63.205.162.114] ([63.205.162.114:2609] helo=us-ex1.zend.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 67/A3-59331-431A6964 for ; Thu, 12 Jul 2007 17:46:30 -0400 Received: from [127.0.0.1] ([192.168.16.180]) by us-ex1.zend.com with Microsoft SMTPSVC(6.0.3790.1830); Thu, 12 Jul 2007 14:46:22 -0700 Message-ID: <4696A129.1040003@zend.com> Date: Thu, 12 Jul 2007 14:46:17 -0700 Organization: Zend Technologies User-Agent: Thunderbird 2.0.0.4 (Windows/20070604) MIME-Version: 1.0 To: Larry Garfield CC: internals@lists.php.net References: <1181829227.3478.3.camel@localhost.localdomain> <200707101906.30925.larry@garfieldtech.com> <2237.24.1.37.132.1184204516.squirrel@www.l-i-e.com> <200707112058.42229.larry@garfieldtech.com> In-Reply-To: <200707112058.42229.larry@garfieldtech.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 12 Jul 2007 21:46:22.0130 (UTC) FILETIME=[1663E120:01C7C4CE] Subject: Re: [PHP-DEV] What is the use of "unicode.semantics" in PHP 6? From: stas@zend.com (Stanislav Malyshev) > AFAIK, UTF-8 is backward compatible with ASCII. UTF-16 is not. That's why Well, with 7-bit ASCII - yes. With 8-bit "extended ASCII", whatever that means - not exactly. You can have 8-bit strings that aren't valid UTF-8 and can't be translated to UTF-8 without specifying the encoding (iso-889-1 or something like that). -- Stanislav Malyshev, Zend Software Architect stas@zend.com http://www.zend.com/ (408)253-8829 MSN: stas@zend.com