Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:29630 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 32863 invoked by uid 1010); 21 May 2007 17:10:52 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 32848 invoked from network); 21 May 2007 17:10:51 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 21 May 2007 17:10:51 -0000 Authentication-Results: pb1.pair.com header.from=andrei@gravitonic.com; sender-id=unknown Authentication-Results: pb1.pair.com smtp.mail=andrei@gravitonic.com; spf=permerror; sender-id=unknown Received-SPF: error (pb1.pair.com: domain gravitonic.com from 204.11.219.139 cause and error) X-PHP-List-Original-Sender: andrei@gravitonic.com X-Host-Fingerprint: 204.11.219.139 mail.lerdorf.com Received: from [204.11.219.139] ([204.11.219.139:55260] helo=mail.lerdorf.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 98/1E-03101-A92D1564 for ; Mon, 21 May 2007 13:10:51 -0400 Received: from [192.168.1.166] (adsl-75-57-244-158.dsl.snfc21.sbcglobal.net [75.57.244.158]) (authenticated bits=0) by mail.lerdorf.com (8.14.1/8.14.1/Debian-2) with ESMTP id l4LHAkbe015676; Mon, 21 May 2007 10:10:47 -0700 In-Reply-To: <35054.88.118.163.159.1179589687.squirrel@avilys.eik.lt> References: <51491.88.118.163.159.1179577357.squirrel@avilys.eik.lt> <464EEF4B.1030002@zend.com> <40865.88.118.163.159.1179583186.squirrel@avilys.eik.lt> <464F090A.9090200@zend.com> <35054.88.118.163.159.1179589687.squirrel@avilys.eik.lt> Mime-Version: 1.0 (Apple Message framework v752.2) X-Priority: 3 (Normal) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-ID: Cc: "Antony Dovgal" , internals@lists.php.net Content-Transfer-Encoding: 7bit Date: Mon, 21 May 2007 10:10:46 -0700 To: Tomas Kuliavas X-Mailer: Apple Mail (2.752.2) X-Virus-Scanned: ClamAV 0.90.2/3274/Mon May 21 08:19:42 2007 on colo.lerdorf.com X-Virus-Status: Clean Subject: Re: [PHP-DEV] PHP Unicode extension in PHP6 From: andrei@gravitonic.com (Andrei Zmievski) This is by design. If you prefer to work with actual bytes, use binary strings or literals. In unicode strings \xC4 is actually a codepoint (UTF-16 codepoint) specifying character U+00C4. -Andrei On May 19, 2007, at 8:48 AM, Tomas Kuliavas wrote: > strlen("\xC4\x85") = 2. strlen((binary)"\xC4\x85") = 4. Not good. > It is > one character in utf-8.