Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:102940 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 99533 invoked from network); 21 Jul 2018 12:32:36 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 21 Jul 2018 12:32:36 -0000 Authentication-Results: pb1.pair.com header.from=rasmus@lerdorf.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=rasmus@lerdorf.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain lerdorf.com designates 209.85.218.41 as permitted sender) X-PHP-List-Original-Sender: rasmus@lerdorf.com X-Host-Fingerprint: 209.85.218.41 mail-oi0-f41.google.com Received: from [209.85.218.41] ([209.85.218.41:46925] helo=mail-oi0-f41.google.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id C2/66-47674-4E7235B5 for ; Sat, 21 Jul 2018 08:32:36 -0400 Received: by mail-oi0-f41.google.com with SMTP id y207-v6so25906234oie.13 for ; Sat, 21 Jul 2018 05:32:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lerdorf-com.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=n9f8H9rilDeqL2Q/LBgnRdqfit1Hio9yMKHLcqlQG9Q=; b=UUFN6pHHJ8VWIH2WbkufvOlFpv5dI5JxNC7NFju4eT1uq8oTqH/IB0I6zFgobVHfbY CzL5U2hPeL96CEfnwmS3jamEN8TnDEkZgbI4qbedick+oQvLVtSihsfg9C6s4pKnsGAp SqoY50iozEF8yZYCEwCIStLZBX5DetC36hEuSSQDwzx2PetJPalpH6+SKGcRI1qP1HJU xaF9QhVJN4vlzSSmD2dXQtmYjPJPw7Ys0UAPQNF0GarRX6DhWECP5s+g4FY9cEIrEFa2 XH5Nw6IYN4+lEF9q9rUICvb+696oCRBtka0IF+PlN0LUhVEyXOxXN5nJW7++gzjqukRu dnnQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=n9f8H9rilDeqL2Q/LBgnRdqfit1Hio9yMKHLcqlQG9Q=; b=DWOq/CJHTM4PJK7hxadeAWwdkZKDNhXgqfA5O9OxhRejgc1Q5xZTaKD9Itc0jP2Ivk P1Ek2+cyQePhm+PBDRzGA2502qxtxRT8mF5juqA1WmEKB6/2PVEQKrxxS+HWfOb9qsXJ AYkpRNQjKbyAOGkL7KI/blZlnEuP62y/HTLvK4yqRowhhDQ3ztTGemmf/Gc5nz13V1Fs IGH3bHYU0e4vLnoaB/b4ajtiymtIroggPPFB3syftvOYAKH93glv7wlFIHmWoI3UGVdo gsF2HsEmVoL5PiuaY6GkXtbiH5B2dL1xqO6KeGNVUAP3p1v7PNWOiNvuYIgYHYRe4KaP SW/w== X-Gm-Message-State: AOUpUlGImk09aHXPM4+G6aGbwCErLKkdXnpiTSRzZhQOdkpw0hBXeUv6 NfjnUrUTU7M15vK/F4BF1Ie+nj1YRcJScrtaAaNu9Q== X-Google-Smtp-Source: AAOMgpdIhyoJMyEqFEkL6uHd1vgdtsnSJVK9gT/LSvqXPyfQ4Jy5BCCaf8skgLe/GZtXt0HlN+5DYlqKbr2Kdaq1ZmY= X-Received: by 2002:aca:cf97:: with SMTP id f145-v6mr1679332oig.131.1532176353882; Sat, 21 Jul 2018 05:32:33 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a4a:4786:0:0:0:0:0 with HTTP; Sat, 21 Jul 2018 05:31:53 -0700 (PDT) In-Reply-To: References: <3ce44a21a935f3d458bd4fea99db89a4fd2c9603.camel@ku.edu> Date: Sat, 21 Jul 2018 08:31:53 -0400 Message-ID: To: "Hoffman, Zachary Robert" Cc: "mapopa@gmail.com" , "me@kelunik.com" , "internals@lists.php.net" Content-Type: multipart/alternative; boundary="0000000000004e8ab60571819bb2" Subject: Re: [PHP-DEV] bugs.php.net downtime From: rasmus@lerdorf.com (Rasmus Lerdorf) --0000000000004e8ab60571819bb2 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Uh, ok, something obviously went wrong there. Checking. On Sat, Jul 21, 2018 at 8:30 AM, Rasmus Lerdorf wrote: > For future reference, here is what I did to fix the encoding problem: > > MariaDB [phpbugsdb]> select sdesc from bugdb where id=3D76553; > +----------------------------------------------------------- > ------------------------------------------------------------ > ---------------------------------------------+ > | sdesc > > | > +----------------------------------------------------------- > ------------------------------------------------------------ > ---------------------------------------------+ > | =C3=90=CB=9C=C3=90=C2=BC=C3=91 =C3=90=C2=BF=C3=90=C2=B5=C3=91=E2=82=AC= =C3=90=C2=B5=C3=90=C2=BC=C3=90=C2=B5=C3=90=C2=BD=C3=90=C2=BD=C3=90=C2=BE=C3= =90=C2=B9 =C3=90=C2=BC=C3=90=C2=BE=C3=90=C2=B6=C3=90=C2=B5=C3=91=E2=80=9A = =C3=91 =C3=90=C2=BE=C3=90=C2=B4=C3=90=C2=B5=C3=91=E2=82=AC=C3=90=C2=B6=C3= =90=C2=B0=C3=91=E2=80=9A=C3=91=C5=92 =C3=91=C6=92=C3=90=C2=BF=C3=91=E2=82= =AC=C3=90=C2=B0=C3=90=C2=B2=C3=90=C2=BB=C3=91 > =C3=91=C5=BD=C3=91=E2=80=B0=C3=90=C2=B8=C3=90=C2=B5 > | > +----------------------------------------------------------- > ------------------------------------------------------------ > ---------------------------------------------+ > 1 row in set (0.00 sec) > > MariaDB [phpbugsdb]> alter table bugdb drop index email; > Query OK, 76298 rows affected (0.85 sec) > Records: 76298 Duplicates: 0 Warnings: 0 > > MariaDB [phpbugsdb]> alter table bugdb modify sdesc varbinary(80) NOT NUL= L > DEFAULT '', modify ldesc binary NOT NULL, modify email varbinary(40) NOT > NULL DEFAULT ''; > Query OK, 76298 rows affected, 65535 warnings (0.65 sec) > Records: 76298 Duplicates: 0 Warnings: 76091 > > MariaDB [phpbugsdb]> alter table bugdb modify sdesc varchar(80) CHARACTER > SET utf8mb4 NOT NULL DEFAULT '', modify ldesc text CHARACTER SET utf8mb4 > NOT NULL, modify email varchar(40) CHARACTER SET utf8mb4 NOT NULL DEFAULT > ''; > Query OK, 76298 rows affected, 127 warnings (0.57 sec) > Records: 76298 Duplicates: 0 Warnings: 127 > > MariaDB [phpbugsdb]> alter table bugdb add FULLTEXT INDEX `email` > (`email`,`sdesc`,`ldesc`); > Query OK, 76298 rows affected (1.56 sec) > Records: 76298 Duplicates: 0 Warnings: 0 > > MariaDB [phpbugsdb]> select sdesc from bugdb where id=3D76553; > +----------------------------------------------------------- > -----------------------+ > | sdesc > | > +----------------------------------------------------------- > -----------------------+ > | =D0=98=D0=BC=D1=8F =D0=BF=D0=B5=D1=80=D0=B5=D0=BC=D0=B5=D0=BD=D0=BD=D0= =BE=D0=B9 =D0=BC=D0=BE=D0=B6=D0=B5=D1=82 =D1=81=D0=BE=D0=B4=D0=B5=D1=80=D0= =B6=D0=B0=D1=82=D1=8C =D1=83=D0=BF=D1=80=D0=B0=D0=B2=D0=BB=D1=8F=D1=8E=D1= =89=D0=B8=D0=B5 > | > +----------------------------------------------------------- > -----------------------+ > 1 row in set (0.00 sec) > > The trick was to convert the columns to binary first. When I went straigh= t > from latin1 to utf8 I got the utf8 equivalent of the latin1 characters. B= y > telling it that the data was actually binary first, it converted from > binary to utf8 which appears to have worked. There were some warnings, > which I assume are invalid utf8 byte sequences somewhere. > > -Rasmus > --0000000000004e8ab60571819bb2--