Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:38163 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 68926 invoked from network); 11 Jun 2008 12:32:16 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 11 Jun 2008 12:32:16 -0000 Authentication-Results: pb1.pair.com smtp.mail=johannes@schlueters.de; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=johannes@schlueters.de; sender-id=unknown Received-SPF: error (pb1.pair.com: domain schlueters.de from 83.243.58.163 cause and error) X-PHP-List-Original-Sender: johannes@schlueters.de X-Host-Fingerprint: 83.243.58.163 mail4.netbeat.de Received: from [83.243.58.163] ([83.243.58.163:43907] helo=mail4.netbeat.de) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 22/B5-26183-EC5CF484 for ; Wed, 11 Jun 2008 08:32:16 -0400 Received: (qmail 2814 invoked by uid 507); 11 Jun 2008 12:32:10 -0000 Received: from unknown (HELO ?192.168.1.100?) (postmaster%schlueters.de@88.217.44.212) by mail4.netbeat.de with ESMTPA; 11 Jun 2008 12:32:10 -0000 To: "Texin, Tex" Cc: Scott MacVicar , Nuno Lopes , internals@lists.php.net, Michal Dziemianko In-Reply-To: <819912BDAE6BCB4097883B226DA473B10B0AC8B4@SACEXMV02.hq.netapp.com> References: <7E62CA6E-83F4-4F9C-86FB-75EBE7D489C9@gmail.com> <484D36EB.9080202@macvicar.net> <819912BDAE6BCB4097883B226DA473B10B0AC8B4@SACEXMV02.hq.netapp.com> Content-Type: text/plain Date: Wed, 11 Jun 2008 14:32:09 +0200 Message-ID: <1213187529.21247.9.camel@goldfinger.johannes.nop> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-4.fc8) Content-Transfer-Encoding: 7bit Subject: RE: [PHP-DEV] Algorithm Optimizations - string search From: johannes@schlueters.de (Johannes =?ISO-8859-1?Q?Schl=FCter?=) Hi, On Wed, 2008-06-11 at 01:01 -0700, Texin, Tex wrote: > When I looked at the code, I assumed that it wasn't intended for international use > I'll have to go back and look to give you details, but it doesn't work for international use or unicode. > It would be ok for 8859-1. That's the default case in PHP < 6, in current PHP versions all string operations use on "binary" strings, so all references to offset work on byte not character base. That's one of the main reasons for PHP 6. johannes