Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:106038 Return-Path: Delivered-To: mailing list internals@lists.php.net Received: (qmail 77529 invoked from network); 22 Jun 2019 23:28:33 -0000 Received: from unknown (HELO mail-lj1-f170.google.com) (209.85.208.170) by pb1.pair.com with SMTP; 22 Jun 2019 23:28:33 -0000 Received: by mail-lj1-f170.google.com with SMTP id a21so8992084ljh.7 for ; Sat, 22 Jun 2019 13:44:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=vIHY6sDljw20NUryrL4Xk7wxwaBnLsJwxD4lcYTDito=; b=sLU8SXmXfuzCwfWQ2BZHJ9Wo6Jz6HiV+q3QNdVS2VQqRG2iJrmntL9g3v+Oq3kQH0Y X/RmFzjKLD3B/TgcwjgFTs1gYtJxTrQZIytcyv22u8tjuhT4KL8/8N/LA3RTWuxt4QI/ qm1YxF71TlEBIuQQ7A2Wkphnjil6DepaHbPUCCItuVZXoF0bfA22meheUnfWTVHLrQoq 9HjNoSejoC9vVTjOPBCDsfHrJiAP9YofBrwAyfnAQwdtmGsqCKuakAIiHirIQu96oIBg mAXVfEYNOe1MEzTk7IFr/dfHZcNY4kOTo5Zxq+5rgnB0WsKVte39UZRNwjiYBFANI4a3 q0Kw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=vIHY6sDljw20NUryrL4Xk7wxwaBnLsJwxD4lcYTDito=; b=Ui/txQNQw6YLMcOTD94i1MNvzNWP8Y7OlXmRDJsmb1l3rTjeNuhGhdEQ5ytL832JFw UnGaPiLeC4jaj4p3V1aZ7RNsBNIt5z+0tK28YVDgRR715GCyUM48pqx9C7XYvJngNAT+ ajWV0X7FJZt/3CFuXWJEqud6sDxgthaJH26Ykl0Y52VcVksPt1S6v0CIjvKmDuuTQW+N xZWTY3yA4bdVZi+yA1vZQDG8BENsT6+s7duQgsrjM8bGH2V0UHuRnEHAw6ymGzcglrrd qvNwzLV2cXT5jAGlc2b+Nh5ZVUZxrl0RqdZRlRRZ52QEwlri/2/hzXbQQa6Y5/Xs1Azj S10g== X-Gm-Message-State: APjAAAXK85Et1iZAxR/PoU5pRcloZzX8i1yXJq+MGJAaqgdOqHlgeCYM CgT0gGl+1NGlfLjY1oOdST9pQMAr6qleHFUJyBU= X-Google-Smtp-Source: APXvYqwnps37QkoQo1SirwkmdNBWLfFGZwVNM6/UK8x1lB5HOrEADDScT9AI9gE4t8U1eQUMW2+dnfRmf6VscUgMZho= X-Received: by 2002:a2e:7c15:: with SMTP id x21mr13259986ljc.55.1561236241058; Sat, 22 Jun 2019 13:44:01 -0700 (PDT) MIME-Version: 1.0 References: <8442f1fa5544b2ca03e7cebbc64e8e5c@wkhudgins.info> <683c5da474e13283030cac3d0c0ec080@wkhudgins.info> <2c37999d1e5372ae6ab48bfce5420796@wkhudgins.info> In-Reply-To: Date: Sat, 22 Jun 2019 22:43:43 +0200 Message-ID: To: Bruce Weirdan Cc: will@wkhudgins.info, PHP internals Content-Type: multipart/alternative; boundary="0000000000008f0fa7058befa30b" Subject: Re: [PHP-DEV] [RFC] Desire to move RFC add_str_begin_and_end_functions to a vote From: nikita.ppv@gmail.com (Nikita Popov) --0000000000008f0fa7058befa30b Content-Type: text/plain; charset="UTF-8" On Sat, Jun 22, 2019 at 10:27 PM Bruce Weirdan wrote: > On Sat, Jun 22, 2019 at 6:32 PM Nikita Popov wrote: > > > > The normal str_starts_with() function is perfectly safe to use on UTF-8 > strings, > > Only if you assume strings to be normalized to the same form. Checking if > NFC > string starts with NFD substring by checking them bit by bit is going > to yield false negatives [1] > > [1] https://3v4l.org/4HgUL > That's correct, but not really relevant in the context of the discussion, as mbstring does not perform Unicode normalization, so mb_* functions wouldn't change anything about this. (Not that basic string operations should be performing implicit Unicode normalization...) Nikita --0000000000008f0fa7058befa30b--