Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:128079 X-Original-To: internals@lists.php.net Delivered-To: internals@lists.php.net Received: from php-smtp4.php.net (php-smtp4.php.net [45.112.84.5]) by lists.php.net (Postfix) with ESMTPS id 010BD1A00BC for ; Wed, 16 Jul 2025 12:25:09 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=php.net; s=mail; t=1752668601; bh=XGPij5vvoPGdYjREc5gItUVgR9HXt4iB0XfteQpcWl0=; h=Date:From:To:cc:Subject:In-Reply-To:References:From; b=LLre5ig8MKkY0LkfqCoYic6+0qCBrog9KXbLQ34e6R6uJuLhE+VoA1BsvssWKNVEg 7gIJpLX3KbF4uk0VpHIlG3ItKEnHK/jqHE5dvgD0cnfpzQfM7Pd74e0rVHrlZxgnOF sie6trV5XKJTmBq3PT3P7Kpi6EV83FPfrKgpKBtyJWI9lyP186f9VENI2Emb1dTKy3 YqZIsnyBBsdIDWf5InGgYUlGWuZiIxxGMEQaGCT0hVAnSJP7wOhtMiJI41/wpl3Qwz MZeitqqigGqqtz+ZAA2bMgLFwGin3qyR0Y8pEozO84wxBxp8yfWYD0rjKSQHw8Bopy Sg/GjCae6ca0g== Received: from php-smtp4.php.net (localhost [127.0.0.1]) by php-smtp4.php.net (Postfix) with ESMTP id 50D751801DC for ; Wed, 16 Jul 2025 12:23:20 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 4.0.1 (2024-03-25) on php-smtp4.php.net X-Spam-Level: *** X-Spam-Status: No, score=3.6 required=5.0 tests=BAYES_50,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,DMARC_PASS,SPF_HELO_PASS, SPF_SOFTFAIL autolearn=no autolearn_force=no version=4.0.1 X-Spam-Virus: Error (Cannot connect to unix socket '/var/run/clamav/clamd.ctl': connect: Connection refused) X-Envelope-From: Received: from xdebug.org (xdebug.org [82.113.146.227]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by php-smtp4.php.net (Postfix) with ESMTPS for ; Wed, 16 Jul 2025 12:23:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=php.net; s=mail; t=1752668706; bh=XGPij5vvoPGdYjREc5gItUVgR9HXt4iB0XfteQpcWl0=; h=Date:From:To:cc:Subject:In-Reply-To:References:From; b=nwvNx0QAqAokxM2/UiK51DnI3JgcIrgP/HrTgiIIibn6st99gHWCgqa5jzZJ79edT 5Hj+46Zd1omoPkD5NcAl1DZ8unmdJVbjpJ8Yqw4/oRhlG6ef5P6hfHTnAanBeniSJU uq/pWmnDvQUaoRhlW2mQ1VghLdjxI8BHPWsjwCZr/HHIFsI8tPkQ+zHY/LVmNMvIYY ucQR0SGv4i5kpKuG71jp02cx4LoSext5yHaZh1M5nx/uTaJAJJMLUW1tWjzK+eQFfQ eOYXgezqI4dN2jPA8zdHBXo8iv4iqSiqbDhL1tuo61W9dMuoQXFNL4PN8oYGuEMdPp wuHraKKgpfrIw== Received: from localhost (localhost [IPv6:::1]) by xdebug.org (Postfix) with ESMTPS id 379E310C033; Wed, 16 Jul 2025 13:25:06 +0100 (BST) Date: Wed, 16 Jul 2025 13:25:06 +0100 (BST) To: youkidearitai cc: php internals Subject: Re: [PHP-DEV] Oniguruma maintenance was ended on 2025-04-24 In-Reply-To: Message-ID: <2ab3d618-e7a5-9593-66d9-f6302a1ad28c@php.net> References: Precedence: bulk list-help: list-post: List-Id: internals.lists.php.net x-ms-reactions: disallow MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="8323329-2093808193-1752668706=:52945" From: derick@php.net (Derick Rethans) This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. --8323329-2093808193-1752668706=:52945 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: QUOTED-PRINTABLE On Wed, 30 Apr 2025, youkidearitai wrote: > Hi, Internals >=20 > Oniguruma(=E9=AC=BC=E8=BB=8A) maintenance was ended on April 24, 2025. > https://github.com/kkos/oniguruma > This library uses mbregex in php-src. >=20 > There is forked library in Onigumo(=E9=AC=BC=E9=9B=B2). > https://github.com/k-takata/Onigmo >=20 > How do we do that? > - Move to Onigumo > - Stay in Oniguruma > - Deprecate mbregex functions >=20 > I created issue in php-src too. > https://github.com/php/php-src/issues/18467 Considering that (I hope/think) most developers have moved to UTF-8 for=20 their encoding, how useful is it to have a separate (and=20 not-comptible-with-PCRE) regular expression engine still? I don't know how much oniguruma adds on top of PCRE, but PCRE also has=20 had significant improvements for UTF-8 encoded strings since we first=20 added mbstring/mbregex.=20 Wouldn't a replacement for: =09mb_regex_encoding($fromEncoding); =09mb_ereg_match($pattern, $string); be: =09pcre_match($patern, iconv($fromEncoding, 'UTF-8', $string)); cheers, Derick --=20 https://derickrethans.nl | https://xdebug.org | https://dram.io Author of Xdebug. Like it? Consider supporting me: https://xdebug.org/suppo= rt mastodon: @derickr@phpc.social @xdebug@phpc.social --8323329-2093808193-1752668706=:52945--