Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:124990 X-Original-To: internals@lists.php.net Delivered-To: internals@lists.php.net Received: from php-smtp4.php.net (php-smtp4.php.net [45.112.84.5]) by qa.php.net (Postfix) with ESMTPS id 3D64D1A00BD for ; Fri, 16 Aug 2024 19:37:05 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=php.net; s=mail; t=1723837132; bh=1kvcdLWSFjrr/Knx67DD/5lfQwowWLKL/7w2RhH6en0=; h=Date:From:To:Subject:In-Reply-To:References:From; b=SaQ4regdH08Fnlu+CjL2d2XkPMfUHbcarAYgN+85STFx+NM7QHKWJAyWnA2DqdRov xW/rq/KXsw5Kd7QXqyuWmojfzRxAorqvEutUOJct7WmcUceRsBFZluvDX1cgW50o3+ wSXWu4WThiiiuqe2xD4xKE9UHX7MfQ6QHhBnq6RMLWpohG5g0NB0ODbR67y/dLj7vB rNiT9LEy9Pon33J5KcH7jp0pRFu8fM0tPObT7fTggZVum+WO4o5U6WyfV78+j+ebQY a6EJRs9vgrmWItpl/CCFsGd2fvm3H/UlWimkoJuhg2FQODPaisR3NajV26N1VWMqaH cqv9Zo/qHx//A== Received: from php-smtp4.php.net (localhost [127.0.0.1]) by php-smtp4.php.net (Postfix) with ESMTP id 5321B1801D7 for ; Fri, 16 Aug 2024 19:38:51 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 4.0.0 (2022-12-13) on php-smtp4.php.net X-Spam-Level: X-Spam-Status: No, score=-0.1 required=5.0 tests=BAYES_50,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,DMARC_MISSING,RCVD_IN_DNSWL_LOW, SPF_HELO_PASS,SPF_PASS autolearn=no autolearn_force=no version=4.0.0 X-Spam-Virus: No X-Envelope-From: Received: from fhigh8-smtp.messagingengine.com (fhigh8-smtp.messagingengine.com [103.168.172.159]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by php-smtp4.php.net (Postfix) with ESMTPS for ; Fri, 16 Aug 2024 19:38:50 +0000 (UTC) Received: from phl-compute-06.internal (phl-compute-06.nyi.internal [10.202.2.46]) by mailfhigh.nyi.internal (Postfix) with ESMTP id 14AF111471B9 for ; Fri, 16 Aug 2024 15:37:03 -0400 (EDT) Received: from phl-mailfrontend-01 ([10.202.2.162]) by phl-compute-06.internal (MEProxy); Fri, 16 Aug 2024 15:37:03 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=rwec.co.uk; h=cc :content-transfer-encoding:content-type:content-type:date:date :from:from:in-reply-to:in-reply-to:message-id:mime-version :references:reply-to:subject:subject:to:to; s=fm3; t=1723837023; x=1723923423; bh=yVRfACP4BEqZ67qk2Ah9TDCxD4BdQ8Hgm0MXYBrwgXk=; b= SwtkXp7k4pbPqkBPSRFMMIshhKQj8I0nRTLYyyPKQSe0xacGS8j8XEHqlqZotx5A 7NCka5zGlkytdS45VQX1/xK+KCBtVbsa+46m5nuXtPnRN38fCZTAoHqrlz7OTHHj Y/Ewirktfuis8sjy91d+nydSn2L7HULO1N2apJFkQ1OjQnf0npU/1XrgmrtUkFbs x8Gf13SKr/WqVDFaDG+x2hY9pog3Hs5+BVldBM/Ios03RqVtjL2z1jGRrZNUYqgU DY5kftdtuFGx23hn0H5sMRrcLfvMvCPcVy9P9koKCZet1Wy03JCU2i5u5Q4wG6dD lsPgdwF7vr5hwjNuO62zpg== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :content-type:date:date:feedback-id:feedback-id:from:from :in-reply-to:in-reply-to:message-id:mime-version:references :reply-to:subject:subject:to:to:x-me-proxy:x-me-proxy :x-me-sender:x-me-sender:x-sasl-enc; s=fm3; t=1723837023; x= 1723923423; bh=yVRfACP4BEqZ67qk2Ah9TDCxD4BdQ8Hgm0MXYBrwgXk=; b=J ag+gjtiJnEJE9f54O9dcfSC7PyU4Wfg0wjgYLp25+Awyngexg3RTd1fZnQ4b+s8T Y3JRiAJGabld0nQAfJOh6RXPiQHxztxZ+aMG4Elko33YucpAlK+wzT+lqXkLsXnR snkIchMMYz30A1FULJtOCaCUs/CkpnsEPDCd2pXLTXEU7lEfPAzJEzH7IeAKI/ny V+wOIknXb6CTvAVDPyTzHWyvrCd1vtsFZI5FJjME3ke85a3RMyUN9Pq51vrtgpfA ahhLVTWPMCXCtRfmrRrKzjQKgpuY2RE5Ap1Hgwk/advxZB6xS2Z7Wh1bzIaDEk7p M9qGjZE9BFuAUdyJuULHg== X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddruddtkedgudefjecutefuodetggdotefrod ftvfcurfhrohhfihhlvgemucfhrghsthforghilhdpggftfghnshhusghstghrihgsvgdp uffrtefokffrpgfnqfghnecuuegrihhlohhuthemuceftddtnecunecujfgurhepfffhvf fufggjfhfkgggtgfesthhqmhdttderjeenucfhrhhomhepfdftohifrghnucfvohhmmhhi nhhsucglkffoufhorfgnfdcuoehimhhsohhprdhphhhpsehrfigvtgdrtghordhukheqne cuggftrfgrthhtvghrnhepheelffetiefgveduteefudegtdduveeludegueegleehiefh hefgtdekveevgfelnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilh hfrhhomhepihhmshhophdrphhhphesrhifvggtrdgtohdruhhkpdhnsggprhgtphhtthho pedupdhmohguvgepshhmthhpohhuthdprhgtphhtthhopehinhhtvghrnhgrlhhssehlih hsthhsrdhphhhprdhnvght X-ME-Proxy: Feedback-ID: id5114917:Fastmail Received: by mail.messagingengine.com (Postfix) with ESMTPA for ; Fri, 16 Aug 2024 15:37:02 -0400 (EDT) Date: Fri, 16 Aug 2024 20:36:59 +0100 To: internals@lists.php.net Subject: =?US-ASCII?Q?Re=3A_=5BPHP-DEV=5D=5BDiscussion=5D_Should_All_S?= =?US-ASCII?Q?tring_Functions_Become_Multi-Byte_Safe=3F?= User-Agent: K-9 Mail for Android In-Reply-To: <270D6057-626D-4720-B44A-3CB7A7B9320B@newclarity.net> References: <1AFE8300-D363-43D8-A989-15D001B9879C@newclarity.net> <270D6057-626D-4720-B44A-3CB7A7B9320B@newclarity.net> Message-ID: Precedence: bulk list-help: list-post: List-Id: internals.lists.php.net x-ms-reactions: disallow MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable From: imsop.php@rwec.co.uk ("Rowan Tommins [IMSoP]") On 16 August 2024 19:44:22 BST, Mike Schinkel wrot= e: >Let me see if I understand your argument correctly? You are asserting th= at Unicode is "too complex" to be handled in the standard library so that c= omplexity should instead be shouldered individually by each and every PHP d= eveloper who needs to work with Unicode text in PHP, which is many PHP deve= lopers if not eventually most=2E Is that your argument? Not really, no=2E I'm definitely in favour of including more Unicode-based= string handling functionality, by improving and extending ext/intl, or com= ing up with new convenience wrappers for common tasks=2E=20 What I'm always sceptical of is the idea that you could ever consider such= functionality "complete", or that "Unicode support" can ever be a single d= eliverable, rather than an ongoing aspiration=2E (And consequently, I'm sce= ptical of any language which says it has achieved that=2E) I also think "Unicode support" is probably the wrong angle to approach fro= m; it leads to features like IntlChar, which technically provides access to= tons of data from the Unicode standard, but practically has no use for 99%= of PHP developers=2E Instead we should be talking about "internationalisat= ion support", of which handling different writing systems is one (fairly bi= g) part=2E For instance, I would welcome proposals like "here's some functions for ha= ndling locale-specific case folding and normalisation-based matching", "her= e's some functions for limiting the storage size of a string without produc= ing garbage characters", etc=2E As well as related things which aren't just= about text encoding, like "here's some functions for working with locale-s= pecific date formatting" (or even just "here's some documentation for how y= ou're supposed to use ext/intl's date classes")=2E >> We also have the "mbstring" extension, =2E=2E=2E > >Interesting historical factoid, but how is that really relevant to includ= ing Unicode into the standard library? I was just summarising the current situation, to work out where we could g= o next=2E Any attempt to extend string handling functionality is likely to = build on either ext/intl or ext/mbstring, so it's useful to understand how = they differ=2E Regards, Rowan Tommins [IMSoP]