Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:72629 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 60330 invoked from network); 15 Feb 2014 12:28:29 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 15 Feb 2014 12:28:29 -0000 Authentication-Results: pb1.pair.com smtp.mail=lester@lsces.co.uk; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=lester@lsces.co.uk; sender-id=unknown Received-SPF: error (pb1.pair.com: domain lsces.co.uk from 217.147.176.204 cause and error) X-PHP-List-Original-Sender: lester@lsces.co.uk X-Host-Fingerprint: 217.147.176.204 mail4.serversure.net Linux 2.6 Received: from [217.147.176.204] ([217.147.176.204:40470] helo=mail4.serversure.net) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 1C/42-45929-B6D5FF25 for ; Sat, 15 Feb 2014 07:28:28 -0500 Received: (qmail 10360 invoked by uid 89); 15 Feb 2014 12:28:24 -0000 Received: by simscan 1.3.1 ppid: 10353, pid: 10356, t: 0.1037s scanners: attach: 1.3.1 clamav: 0.96/m:52 Received: from unknown (HELO linux-dev4.lsces.org.uk) (lester@rainbowdigitalmedia.org.uk@81.138.11.136) by mail4.serversure.net with ESMTPA; 15 Feb 2014 12:28:24 -0000 Message-ID: <52FF5E46.1020907@lsces.co.uk> Date: Sat, 15 Feb 2014 12:32:06 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:27.0) Gecko/20100101 Firefox/27.0 SeaMonkey/2.24 MIME-Version: 1.0 To: internals@lists.php.net References: <50100EC8.3040102@ajf.me> <52FDF7BC.8050408@lsces.co.uk> <52FE46D2.4060903@gmail.com> <52FE6FEA.5050204@lerdorf.com> In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: [PHP-DEV] PHP6 wiki page From: lester@lsces.co.uk (Lester Caine) Pierre Joye wrote: > On Feb 15, 2014 12:43 PM, "Pierre Joye" wrote: >> >> >> On Feb 15, 2014 2:35 AM, "Rasmus Lerdorf" wrote: >>> >> >>> What we really need is an awesome small and fast Unicode library that >>> does everything ICU does but faster and in less code while using UTF-8 >>> as its internal storage so we don't have to convert on each and every >>> operation. There are a ton of non-obvious things beyond simple string >>> manipulation. String collation alone is massively complicated, for > example. >> >> http://www.public-software-group.org/utf8proc-documentation looks > interesting. There are other but it has to be chosen very carefully :) > > https://github.com/josephg/librope claims to be fast and compliant. Added > to my list. If I'm reading that correctly it does the substring stuff on already converted UTF8 strings? It needs a UTF8 conversion of even a UTF16 string to work ... I think? -- Lester Caine - G8HFL ----------------------------- Contact - http://lsces.co.uk/wiki/?page=contact L.S.Caine Electronic Services - http://lsces.co.uk EnquirySolve - http://enquirysolve.com/ Model Engineers Digital Workshop - http://medw.co.uk Rainbow Digital Media - http://rainbowdigitalmedia.co.uk