Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:78202 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 41106 invoked from network); 21 Oct 2014 13:53:07 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 21 Oct 2014 13:53:07 -0000 Authentication-Results: pb1.pair.com header.from=php@beccati.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=php@beccati.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain beccati.com designates 176.9.114.167 as permitted sender) X-PHP-List-Original-Sender: php@beccati.com X-Host-Fingerprint: 176.9.114.167 spritz.beccati.com Received: from [176.9.114.167] ([176.9.114.167:43034] helo=mail.beccati.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 59/57-02077-24566445 for ; Tue, 21 Oct 2014 09:53:06 -0400 Received: (qmail 6586 invoked from network); 21 Oct 2014 13:53:02 -0000 Received: from home.beccati.com (HELO ?192.168.1.202?) (88.149.176.119) by mail.beccati.com with SMTP; 21 Oct 2014 13:53:02 -0000 Message-ID: <54466532.2060601@beccati.com> Date: Tue, 21 Oct 2014 15:52:50 +0200 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0 MIME-Version: 1.0 To: internals@lists.php.net References: <1413875212.2624.3.camel@localhost.localdomain> <54463A69.8000300@beccati.com> <54465CE8.7080207@lsces.co.uk> In-Reply-To: <54465CE8.7080207@lsces.co.uk> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Subject: Re: [PHP-DEV] [RFC] UString From: php@beccati.com (Matteo Beccati) On 21/10/2014 15:17, Lester Caine wrote: > On 21/10/14 11:50, Matteo Beccati wrote: >> However, doesn't ICU use UTF-16 by default which is undesirable as most >> of the times it requires converting from and to UTF-8? > > http:// userguide.icu-project.org/strings/utf-8 > It is interesting that the earlier adoption of UTF-16 still prevails, > but switching to UTF-8 is becoming the norm? Yes, as far as I knew using UTF-8 by default was a compile-time option for ICU, that most of the times comes from system packages. Cheers -- Matteo Beccati Development & Consulting - http://www.beccati.com/