Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:126686 X-Original-To: internals@lists.php.net Delivered-To: internals@lists.php.net Received: from php-smtp4.php.net (php-smtp4.php.net [45.112.84.5]) by qa.php.net (Postfix) with ESMTPS id 7CD041A00BC for ; Mon, 10 Mar 2025 08:26:01 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=php.net; s=mail; t=1741595006; bh=Zg9jacKfvg4Dr0HQLt4/EcGEQOykPU83P0KHxqlfaMs=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=V+1gC6L3Sabc9+SoW7AHrEfqwOFsI+RtUwvj+ANp6dXTd69lGe8sOTmP2euviyYPz jwwDm+lB3HZ7YMhdLd77Nek28qUcQcxKyaQg5xbE0dDVWDC6ss+DhrdRy0j6nEHJT8 rkfpS0t+FAMzxYGq7FvYgc1P1FMe7OUB0YL3FBjhXUGXozb89PPm9VSLfbzizyzwWv bx/Z+60x349SClZ+21Mff46+cb9n9tpIr9Gt5QjwqCLElAUobDnzgR/5tKyVSd6+7w 7GJcHYNHJAjq0GXni19Gq2GowvhMP4oE43ENDnB9PctN8tiCa+jJaNJOZSXQStrAC3 k1JT0a5uxq+Xw== Received: from php-smtp4.php.net (localhost [127.0.0.1]) by php-smtp4.php.net (Postfix) with ESMTP id 7918C1801D4 for ; Mon, 10 Mar 2025 08:23:24 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 4.0.0 (2022-12-13) on php-smtp4.php.net X-Spam-Level: X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_40,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,DMARC_PASS,FREEMAIL_FROM, HTML_MESSAGE,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_NONE, SPF_PASS autolearn=no autolearn_force=no version=4.0.0 X-Spam-Virus: No X-Envelope-From: Received: from mail-lj1-f170.google.com (mail-lj1-f170.google.com [209.85.208.170]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by php-smtp4.php.net (Postfix) with ESMTPS for ; Mon, 10 Mar 2025 08:23:22 +0000 (UTC) Received: by mail-lj1-f170.google.com with SMTP id 38308e7fff4ca-30bf8632052so26843471fa.0 for ; Mon, 10 Mar 2025 01:25:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1741595155; x=1742199955; darn=lists.php.net; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=ov5srP1wgAXwFyUdVtkHbP/0ZwQSBEEKmGU/9yIqy4o=; b=I5Fmb/8Rnu6hpHxGV9KXdR+wv5NPx7u9jnvk72f/4uqVuQMcC9Mg9Sp2ya6KgX823V uv5ogwciPIgF7y8N6L3sA9e4Vb0O/Cxez+QQcmqnP+ZOhoDKNe9I+KHpjxYDwR/VgDVI NSrosYCw13aXyc2XWmLuvxwchUFQmT7mwgd2PJuJ5GxdMJlTS2lYAsbR/FDJiNajCPI7 5AcP0n7hqIBtCpAU2BYACdFhATBa19lIKOgSPnkavGcG6/5CWCJSz1U0d7KEkIXi7iq3 cANpL9N9Ny9+Bz7UJrs1C0kXV+sm/7a+dYnNYW1Opv9p+MHkWvlLKdEfyubi+Yh8qjCr reZg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1741595155; x=1742199955; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=ov5srP1wgAXwFyUdVtkHbP/0ZwQSBEEKmGU/9yIqy4o=; b=R18dzsuVrpeJA0nynYVpj7AhGGlG0MyOhWqkYSi6mhc/2JqqoW+FKSyxm3gY4Mw2op EM3T4xrPGD7XgSocg00j//RmYq/dQ3rCQOcfPhYsuv7nBSd1vSqMjGdyXeiY0sr0BI4u nd05uKV7YDjYUPr47rCaBdNiXWfSRoqBzyhIizB/dhb82+UYVwWe1M18m6z4jf7eWOrC b3hzmKohv8TpaILzIzIFruuFCjoSeeYdMQiw7+TTcqTbQF5aDJyUfKlyXKI8OI1RWG6K q/ybs+bMPTfiDdJKisb3VXbeV3v0pY+XFV/dpuMrrxYcoH5xEjfDA8Ax4reAYdaYgFbk EXwA== X-Gm-Message-State: AOJu0YyclKpFCk4JE6R4PQheBxT5R75oJVubxbRRqLIVDaL4TQuMfnLY A21zGX3Bkt78WD2ym8lDwQsbh+vkDfMjPnvbZD7c0DSHAgMhFod35cHszhwSZgWtGzsm2iWYHW0 nbWxldl7zXABufhwgz+iLmJe+D/M= X-Gm-Gg: ASbGncusDS+ye/U4oi2vhrxf+4Q8dqzEG+K7POWc8wNKRqrFCB8EjzOiikKqwGgKcKV NXtzKK/JaUmTjr0sl6BkmouPTcDryrWvWljMmfM56EnsOMZzczXB6htWHKto2ED0jHiyuxQpMOJ gvKnp3aExm7J2QfbBLQ5bsSBlm7eg= X-Google-Smtp-Source: AGHT+IGlkW3vGR4gJx/5mOmBwOrxu7x3tjzfWtU3BnbNDAsuQxJ+ddlMHw7y72u5UAz6JnSlld4xiQUo8U3Lvc8dhnk= X-Received: by 2002:a05:6512:1112:b0:545:27af:f2d1 with SMTP id 2adb3069b0e04-549910cbd47mr4272882e87.44.1741595155094; Mon, 10 Mar 2025 01:25:55 -0700 (PDT) Precedence: bulk list-help: list-post: List-Id: internals.lists.php.net x-ms-reactions: disallow MIME-Version: 1.0 References: <70b12ab535eb1371cfe911dc19d5418f@bastelstu.be> In-Reply-To: Date: Mon, 10 Mar 2025 09:25:43 +0100 X-Gm-Features: AQ5f1Jo9uoh5QASd44s0yE5J4kLwFO6WWiP6-1U12rOdzfHoL7PFrQ1lGvFAWiY Message-ID: Subject: Re: [PHP-DEV][RFC][VOTE] Add mb_levenshtein function To: youkidearitai Cc: php internals Content-Type: multipart/alternative; boundary="0000000000008fe880062ff8b81c" From: nicolas.grekas+php@gmail.com (Nicolas Grekas) --0000000000008fe880062ff8b81c Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable > > 2025=E5=B9=B43=E6=9C=888=E6=97=A5(=E5=9C=9F) 19:06 Niels Dossche : > > > > On 08/03/2025 03:30, youkidearitai wrote: > > > Hi, Internals > > > > > > The add mb_levenshtein was end and declined. > > > Vote result is one yes and 5 no. > > > > > > Thank you very much voting. > > > > > > By the way, This message is means add grapheme_levenshtein instead of > > > mb_levenshtein? > > > Or nothing to do? > > > Feel free to comment. > > > > > > Thank you again. > > > Yuya. > > > > > > > Hi Yuya > > > > I think an RFC for grapheme_levenshtein would be better, it would have > my vote at least. > > Levenshtein makes more sense on graphemes than on unicode codepoints. > > > > Kind regards > > Niels > > Hi, Niels > > Thank you very much for reply. > Okay. I will go to grapheme_levenshtein RFC. > > Kind regards > Yuya > > On my side, I'm not sure this would make sense. There's a PHP implementation of the levenshtein algo: https://packagist.org/packages/oefenweb/damerau-levenshtein This might be good enough. Better leave clustering (graphemes/etc) as a separate concern. Did you consider this option? Nicolas --0000000000008fe880062ff8b81c Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
2025=E5=B9=B43=E6=9C=888=E6=97=A5(= =E5=9C=9F) 19:06 Niels Dossche <dossche.niels@gmail.com>:
>
> On 08/03/2025 03:30, youkidearitai wrote:
> > Hi, Internals
> >
> > The add mb_levenshtein=C2=A0 was end and declined.
> > Vote result is one yes and 5 no.
> >
> > Thank you very much voting.
> >
> > By the way, This message is means add grapheme_levenshtein instea= d of
> > mb_levenshtein?
> > Or nothing to do?
> > Feel free to comment.
> >
> > Thank you again.
> > Yuya.
> >
>
> Hi Yuya
>
> I think an RFC for grapheme_levenshtein would be better, it would have= my vote at least.
> Levenshtein makes more sense on graphemes than on unicode codepoints.<= br> >
> Kind regards
> Niels

Hi, Niels

Thank you very much for reply.
Okay. I will go to grapheme_levenshtein RFC.

Kind regards
Yuya


On my side, I'm not sure this would make=C2=A0s= ense.
There's a PHP implementation of the levenshtein algo:
= https://packagist.org/packages/oefenweb/damerau-levenshtein
<= br>
This might be good enough. Better leave clustering (graphemes= /etc) as a separate concern. Did you consider this option?

Nicolas=C2=A0
--0000000000008fe880062ff8b81c--