Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:113696 Return-Path: Delivered-To: mailing list internals@lists.php.net Received: (qmail 5454 invoked from network); 22 Mar 2021 17:59:17 -0000 Received: from unknown (HELO php-smtp4.php.net) (45.112.84.5) by pb1.pair.com with SMTP; 22 Mar 2021 17:59:17 -0000 Received: from php-smtp4.php.net (localhost [127.0.0.1]) by php-smtp4.php.net (Postfix) with ESMTP id 04A481804DD for ; Mon, 22 Mar 2021 10:54:37 -0700 (PDT) X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on php-smtp4.php.net X-Spam-Level: X-Spam-Status: No, score=-0.6 required=5.0 tests=BAYES_00,BODY_8BITS, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, NICE_REPLY_A,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL, SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.2 X-Spam-Virus: No X-Envelope-From: Received: from mail-ej1-f53.google.com (mail-ej1-f53.google.com [209.85.218.53]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by php-smtp4.php.net (Postfix) with ESMTPS for ; Mon, 22 Mar 2021 10:54:36 -0700 (PDT) Received: by mail-ej1-f53.google.com with SMTP id kt15so13254718ejb.12 for ; Mon, 22 Mar 2021 10:54:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-transfer-encoding:content-language; bh=hyVeAxdEM4nZCGFnxe/cwHN8KFIO51rYS7SeJF8Cg60=; b=WqgzLsa0NLqSkseyTmYVltkmthTGEDeCONQcXl6hZTEkHK9eOCbmmts+UnaWQ9gcro s9lzUt5uMSbQk6cJTUdS5ALq0egZZYZ20slMxpUuKuqmJvC0b/4/RMZ6F1R6L5l2ea6G EvcTG8/5VAXOAASMO8Ot8JwNQ9YZpZSZM26FQ1xMEnc/mB/XlZQfR9HGHJ0vtUBYbq3L qW3dwEqlIVF1Lj+G9bEdk8tT9k0n1h7/TIQtaEnUD3SmG1aup/oCYutCK+9f4NEWV0sS fxKZW2asruCRNbLMS0m6aIkDcQecGlG9jXIk8GJGjCOYITmMCce+cVwLKAV4CfE+kwwJ d0Ug== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=hyVeAxdEM4nZCGFnxe/cwHN8KFIO51rYS7SeJF8Cg60=; b=iFBqQr9vpo9h1AZvSo/A0amYX7r3ENdeBR3Snar/o4nylwooVzWCIzcIS7chUtxYRQ wyTVwRdRGoeP+Sd9OmewExMUooLEnPw514E14byQ82LfZ6nBm4Fs+e/sAGra01e3Uf/U zSBhm1V4Ay7b5DzxOXwpQ2wyYrT5NT/dUKFWQynDAHxJxDlSXJlAVt9MJ9qKyo3s6sWB /39jVuFsZIRbkjqh/d7hHCI6WiJ9TU/gy7V+SLsmV2zOegX7f+MyAV3vJhZ8l0M4lKU+ rm9F5s/CFEARuOyT/z4fIFFu5GSpG0YhsJoMrLg8aHno8+x0ZeBbKhaFa/U7dc/UGxg4 Gh0A== X-Gm-Message-State: AOAM5314EQOc7RgQQeDFKhsSZsTNRl9tQoszq6mMrXfJCxAyWDQ5OEah oNdcN+ZGqBeTQKJ0eHvZDehYGrTE76A= X-Google-Smtp-Source: ABdhPJzrY7ExarEZfUXUpUg/joznseGqkMRWhka7vXWXX7F6qhdBliLQLv5rlyh9K8dOxzj/WY9yFQ== X-Received: by 2002:a17:906:5495:: with SMTP id r21mr958403ejo.471.1616435673689; Mon, 22 Mar 2021 10:54:33 -0700 (PDT) Received: from [192.168.0.22] (cpc104104-brig22-2-0-cust548.3-3.cable.virginm.net. [82.10.58.37]) by smtp.googlemail.com with ESMTPSA id m7sm11443337edp.81.2021.03.22.10.54.32 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 22 Mar 2021 10:54:33 -0700 (PDT) To: internals@lists.php.net References: <693767b5-a25b-b4d9-f535-6b985bf26d67@gmail.com> <29d5329c-bea2-7944-4820-515d4a10ae86@alec.pl> <16ecfc31-33aa-4223-fb67-b5a4b5895f05@gmail.com> <11e9a312-ed10-412e-506d-ccf9f24457f8@alec.pl> Message-ID: Date: Mon, 22 Mar 2021 17:54:32 +0000 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.8.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-GB Subject: Re: [PHP-DEV] What should we do with utf8_encode and utf8_decode? From: rowan.collins@gmail.com (Rowan Tommins) On 22/03/2021 17:38, Alexandru Pătrănescu wrote: > As Rowan mentioned, base64_encode would have worked. But that means one > quarter of the available max column space would be lost as a downside. Depending on the data, abusing Latin1-to-UTF8 translation can easily result in a longer string than base64. $str = '🤡🤡'; echo strlen(base64_encode($str)); // 12 echo strlen(utf8_encode($str)); // 16 Regards, -- Rowan Tommins [IMSoP]