Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:119160 Return-Path: Delivered-To: mailing list internals@lists.php.net Received: (qmail 15099 invoked from network); 15 Dec 2022 20:19:49 -0000 Received: from unknown (HELO php-smtp4.php.net) (45.112.84.5) by pb1.pair.com with SMTP; 15 Dec 2022 20:19:49 -0000 Received: from php-smtp4.php.net (localhost [127.0.0.1]) by php-smtp4.php.net (Postfix) with ESMTP id 663EA18054C for ; Thu, 15 Dec 2022 12:19:45 -0800 (PST) X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on php-smtp4.php.net X-Spam-Level: X-Spam-Status: No, score=-1.4 required=5.0 tests=BAYES_00, FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS, HTML_MESSAGE,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.2 X-Spam-ASN: AS15169 209.85.128.0/17 X-Spam-Virus: No X-Envelope-From: Received: from mail-ed1-f50.google.com (mail-ed1-f50.google.com [209.85.208.50]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by php-smtp4.php.net (Postfix) with ESMTPS for ; Thu, 15 Dec 2022 12:19:44 -0800 (PST) Received: by mail-ed1-f50.google.com with SMTP id d20so905318edn.0 for ; Thu, 15 Dec 2022 12:19:44 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=RZi4z/2MP3EhkGLYqKqukQZH02UUOi/2071IgXsVDyQ=; b=D0GUrE5Hg4t+sb/3Yzu5Rvp8/8U2MTYAdr5guZqh3OP3MwYqweslnGMJZq3oXGB+Ui neVVFcORF2qlf6JAeoMXtP2/s7G05COR5PtZ+H1ei02rNFrcz4nADbYoVEvz9f1/wrQZ zzL2/QwX37b7hMPbH8OVhsksqjDEpo0/7YR7GT6vHK/FUQQziH4D2tRzrQlzWdjohezK nQMQtEL3Eindy2/Q88YzRaTuniNkW+FQZUtjqOpWC8PFJrzPLmCTM4zeRalRdnXKN6Tl CV2cyzDv+9dXqfaWOK4KqV0gDEYynkfYsPlSLI+DIXWtCVNn2CGpc7RKzzAC0KD0m6U8 9Snw== X-Gm-Message-State: ANoB5pkdUIW5BSKwcZ0ufvAgCAfsX2aBUGWT0w+O+/SvGlfrK8EhNSqr vs7K6NeDMVnPJVU8HL6Olq5OYWeTBRTg1QngGg0= X-Google-Smtp-Source: AA0mqf5XubXGi0EEDfYS+pLRUlwLg6tEsoRbV9XxgHFCeewbI7Ps0I7oGod03MG152L3XiZ6LOvKnm1dEwlM5VWN0FU= X-Received: by 2002:aa7:d551:0:b0:468:fb2c:6c8c with SMTP id u17-20020aa7d551000000b00468fb2c6c8cmr85658270edr.340.1671135583517; Thu, 15 Dec 2022 12:19:43 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: Date: Thu, 15 Dec 2022 20:19:32 +0000 Message-ID: To: "Christoph M. Becker" Cc: Derick Rethans , PHP Developers Mailing List Content-Type: multipart/alternative; boundary="000000000000d3bf8305efe39216" Subject: Re: [PHP-DEV] Re: [RFC] Unicode Text Processing From: bukka@php.net (Jakub Zelenka) --000000000000d3bf8305efe39216 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Thu, Dec 15, 2022 at 4:56 PM Christoph M. Becker wrote: > On 15.12.2022 at 16:34, Derick Rethans wrote: > > > I have just published an initial draft of the "Unicode Text Processing" > > RFC, a proposal to have performant unicode text processing always > > available to PHP users, by introducing a new "Text" class. > > > > You can find it at: > > https://wiki.php.net/rfc/unicode_text_processing > > > > I'm looking forwards to hearing your opinions, additions, and > > suggestions =E2=80=94 the RFC specifically asks for these in places. > > | As the implementation requires ICU, this would also mean that PHP will > | depend on the ICU library. > > Our current stance is that a minimal PHP should be buildable without > requiring any "non-standard" libraries; this is the reason why we bundle > PCRE. If we wanted to stick with that policy, we would need to bundle > ICU, what might not be the best idea =E2=80=93 it's generally not great t= o have > bundled libraries which are still maintained outside of php-src, and > especially for such huge libraries. > > I agree with this. Bundling ICU doesn't seem like a good idea. Wouldn't be better to base on something smaller that can be bundled and does the job? For example NJS and QuickJS use their own implementations which seem to be fine. Especially https://github.com/bellard/quickjs/blob/master/libunicode.= c seems like something that we could fork and maintain potentially. Cheers Jakub --000000000000d3bf8305efe39216--