Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:55169 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 81061 invoked from network); 4 Sep 2011 05:13:09 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 4 Sep 2011 05:13:09 -0000 Authentication-Results: pb1.pair.com smtp.mail=smalyshev@sugarcrm.com; spf=pass; sender-id=pass Authentication-Results: pb1.pair.com header.from=smalyshev@sugarcrm.com; sender-id=pass Received-SPF: pass (pb1.pair.com: domain sugarcrm.com designates 67.192.241.143 as permitted sender) X-PHP-List-Original-Sender: smalyshev@sugarcrm.com X-Host-Fingerprint: 67.192.241.143 smtp143.dfw.emailsrvr.com Linux 2.6 Received: from [67.192.241.143] ([67.192.241.143:59140] helo=smtp143.dfw.emailsrvr.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id A1/66-34132-4E8036E4 for ; Sun, 04 Sep 2011 01:13:09 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp24.relay.dfw1a.emailsrvr.com (SMTP Server) with ESMTP id D225F1800CA; Sun, 4 Sep 2011 01:13:05 -0400 (EDT) X-Virus-Scanned: OK Received: by smtp24.relay.dfw1a.emailsrvr.com (Authenticated sender: smalyshev-AT-sugarcrm.com) with ESMTPSA id 484401800C7; Sun, 4 Sep 2011 01:13:05 -0400 (EDT) Message-ID: <4E6308E0.5030508@sugarcrm.com> Date: Sat, 03 Sep 2011 22:13:04 -0700 Organization: SugarCRM User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.6; rv:6.0.1) Gecko/20110830 Thunderbird/6.0.1 MIME-Version: 1.0 To: Rui Hirokawa , Moriyoshi Koizumi CC: PHP Internals Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: mbstring failures From: smalyshev@sugarcrm.com (Stas Malyshev) Hi! In my tests, I have some test failures in mbstring extensions, namely: Bug #28220 (mb_strwidth() returns wrong width values for some Hangul characters) [/Users/smalyshev/php-src/branches/PHP_5_4/ext/mbstring/tests/bug28220.phpt] Bug #49536 (mb_detect_encoding() returns incorrect results when strict_mode is turned on) [/Users/smalyshev/php-src/branches/PHP_5_4/ext/mbstring/tests/bug49536.phpt] Unicode standard conformance test (ill-formed UTF sequences.) [/Users/smalyshev/php-src/branches/PHP_5_4/ext/mbstring/tests/illformed_utf_sequences.phpt] The mb_strwidth() returns 8085 instead of 8101. The mb_detect_encoding() returns false for both utf-8 examples and as far as I can see in the code of mbfl_filt_ident_utf8() it is intended result - this sequence is not considered valid utf-8. For the illformed sequences I see many sequences that are supposed to be empty strings be 0000003f0000003f0000003f0000003f0000003f0000003f or more 3f's of different lengths. Could you please check it out and see why these are failing? -- Stanislav Malyshev, Software Architect SugarCRM: http://www.sugarcrm.com/ (408)454-6900 ext. 227