Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:18815 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 75897 invoked by uid 1010); 13 Sep 2005 09:52:10 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 75882 invoked from network); 13 Sep 2005 09:52:10 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 13 Sep 2005 09:52:10 -0000 X-Host-Fingerprint: 82.94.239.5 jdi.jdi-ict.nl Linux 2.5 (sometimes 2.4) (4) Received: from ([82.94.239.5:42114] helo=jdi.jdi-ict.nl) by pb1.pair.com (ecelerity 2.0 beta r(6323M)) with SMTP id A6/8E-58045-841A6234 for ; Tue, 13 Sep 2005 05:52:08 -0400 Received: from localhost (localhost [127.0.0.1]) by jdi.jdi-ict.nl (8.12.11/8.12.11) with ESMTP id j8D9q5aD000861; Tue, 13 Sep 2005 11:52:05 +0200 Received: from localhost (localhost [127.0.0.1]) by jdi.jdi-ict.nl (8.12.11/8.12.11) with ESMTP id j8D9q0Bi000852; Tue, 13 Sep 2005 11:52:01 +0200 Date: Tue, 13 Sep 2005 11:52:00 +0200 (CEST) X-X-Sender: derick@localhost To: Antony Dovgal cc: val khokhlov , internals@lists.php.net In-Reply-To: <43269F21.5030705@zend.com> Message-ID: References: <43215A91.8050409@zend.com> <9CF57DC5-A18B-4264-B20B-8552B0BB66F1@gravitonic.com> <6.2.3.4.2.20050912175136.04449320@localhost> <43268C01.20006@zend.com> <1031242468.20050913123221@vk.kiev.ua> <43269F21.5030705@zend.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Scanned: by amavisd-new at jdi-ict.nl Subject: Re: [PHP-DEV] unserialize() & unicode issues From: derick@php.net (Derick Rethans) On Tue, 13 Sep 2005, Antony Dovgal wrote: > On 13.09.2005 13:32, val khokhlov wrote: > > Hello Antony, > > > > Tuesday, September 13, 2005, 11:21:21 AM, you wrote: > > > > AD> Even if the class name is in Unicode, we can try to convert it to ASCII > > AD> and fail only in the case when we can't find its class entry in the > > AD> list. > > I think, it's not the only way. > > If we don't care about being compatible with previous PHP's > > serialize(), a more portable way is to store class/property names in > > unicode (if unicode_semantics=off when serializing, convert hash keys to > > unicode). Since we do know script encoding, we can always downgrade > > unicoded names into local encoding. > > So you propose to store strings/hash keys/class names in Unicode even if > unicode_semantics is Off ? > It looks like adding unnecessary overhead to me. But needed, as even with the semantics off, you can get unicode strings. Which can end up as array keys. Derick -- Derick Rethans http://derickrethans.nl | http://ez.no | http://xdebug.org