Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:45167 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 70370 invoked from network); 31 Jul 2009 16:55:43 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 31 Jul 2009 16:55:43 -0000 Authentication-Results: pb1.pair.com header.from=stas@zend.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=stas@zend.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain zend.com designates 63.205.162.117 as permitted sender) X-PHP-List-Original-Sender: stas@zend.com X-Host-Fingerprint: 63.205.162.117 us-mr1.zend.com Linux 2.4/2.6 Received: from [63.205.162.117] ([63.205.162.117:41855] helo=us-mr1.zend.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id BB/46-40158-D02237A4 for ; Fri, 31 Jul 2009 12:55:42 -0400 Received: from us-gw1.zend.com (us-ex1.zend.net [192.168.16.5]) by us-mr1.zend.com (Postfix) with ESMTP id A3BFEE1244 for ; Fri, 31 Jul 2009 09:43:40 -0700 (PDT) Received: from [192.168.16.202] ([192.168.16.202]) by us-gw1.zend.com with Microsoft SMTPSVC(6.0.3790.3959); Fri, 31 Jul 2009 09:38:51 -0700 Message-ID: <4A731DE2.2060206@zend.com> Date: Fri, 31 Jul 2009 09:37:54 -0700 Organization: Zend Technologies User-Agent: Thunderbird 2.0.0.22 (Windows/20090605) MIME-Version: 1.0 To: Moriyoshi Koizumi CC: php-dev References: <4A6C6496.7060603@mozo.jp> <4A71DA47.8080809@zend.com> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 31 Jul 2009 16:38:51.0683 (UTC) FILETIME=[62E19730:01CA11FD] Subject: Re: [PHP-DEV] Re: Alternative mbstring implementation using ICU From: stas@zend.com (Stanislav Malyshev) Hi! >> mb_str* - shouldn't you in 6 just convert them to unicode and do all string >> operations with Unicode strings? Also, in 5 isn't there some intersection >> with grapheme_* functions? > > mb_strwidth() and mb_strimwidth() are not covered. True. I wonder what this function is useful for? >> mb_output_handler - shouldn't setting the proper encoding in 6 do the same job? >> mb_convert_encoding - don't we already have a number of functions that do encoding conversions? > > I don't think It can gracefully handle characters that have no > corresponding entries in the target character set. I'm even thinking That's a common problem, IIRC PHP 6 converters have configurable error modes for that. Don't unicode_set_error_handler() and unicode_set_error_mode() do what you want? -- Stanislav Malyshev, Zend Software Architect stas@zend.com http://www.zend.com/ (408)253-8829 MSN: stas@zend.com