Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:45143 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 65890 invoked from network); 30 Jul 2009 14:14:09 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 30 Jul 2009 14:14:09 -0000 Authentication-Results: pb1.pair.com smtp.mail=gwynne@darkrainfall.org; spf=permerror; sender-id=unknown Authentication-Results: pb1.pair.com header.from=gwynne@darkrainfall.org; sender-id=unknown Received-SPF: error (pb1.pair.com: domain darkrainfall.org from 208.97.132.74 cause and error) X-PHP-List-Original-Sender: gwynne@darkrainfall.org X-Host-Fingerprint: 208.97.132.74 caiajhbdcahe.dreamhost.com Received: from [208.97.132.74] ([208.97.132.74:58720] helo=homiemail-a1.g.dreamhost.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id B5/6D-03589-EAAA17A4 for ; Thu, 30 Jul 2009 10:14:07 -0400 Received: from Moonstar.home (pool-71-174-84-161.bstnma.fios.verizon.net [71.174.84.161]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by homiemail-a1.g.dreamhost.com (Postfix) with ESMTP id 8C1FF119E3E; Thu, 30 Jul 2009 07:14:03 -0700 (PDT) Cc: internals@lists.php.net Message-ID: To: Alexey Zakhlestin In-Reply-To: Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v935.3) Date: Thu, 30 Jul 2009 10:14:01 -0400 References: <4A6C6496.7060603@mozo.jp> <20090730120535.7AF5.627AFB7B@blueyonder.co.uk> X-Mailer: Apple Mail (2.935.3) Subject: Re: [PHP-DEV] Alternative mbstring implementation using ICU From: gwynne@darkrainfall.org (Gwynne Raskind) On Jul 30, 2009, at 7:12 AM, Alexey Zakhlestin wrote: >>> Implemented functions: >>> >>> - mb_ereg() >>> - mb_ereg_replace() >> as ereg functions are deprecated in 5.3, are these still needed? > these have nothing in common with "those" ereg functions. these are > based on onuguruma regex library > http://www.geocities.jp/kosako3/oniguruma/ I find Oniguruma to be, in general, a pared-down and less-useful version of the PCRE we already have. Given that PCRE has full support for UTF-8, and that there's nothing you can do with Oniguruma that you can't also practically do with PCRE (to the best of my knowledge), I think it would be best for PHP to standardize on a single regexp library, rather than offering competing and confusing options. Killing off POSIX syntax was a step in that direction, and I see no reason not to take the rest of the steps. If Oniguruma were offered as a PECL extension, I would think that perfectly reasonable, but I don't think it belongs in core. -- Gwynne