Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:23433 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 69206 invoked by uid 1010); 15 May 2006 22:19:33 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 69191 invoked from network); 15 May 2006 22:19:33 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 15 May 2006 22:19:33 -0000 X-PHP-List-Original-Sender: ceo@l-i-e.com X-Host-Fingerprint: 67.139.134.202 o2.hostbaby.com FreeBSD 4.7-5.2 (or MacOS X 10.2-10.3) (2) Received: from ([67.139.134.202:2639] helo=o2.hostbaby.com) by pb1.pair.com (ecelerity 2.0 beta r(6323M)) with SMTP id CF/47-19568-37EF8644 for ; Mon, 15 May 2006 18:19:31 -0400 Received: (qmail 97225 invoked by uid 98); 15 May 2006 22:19:32 -0000 Received: from 127.0.0.1 by o2.hostbaby.com (envelope-from , uid 1013) with qmail-scanner-1.25 ( Clear:RC:1(127.0.0.1):. Processed in 0.085863 secs); 15 May 2006 22:19:32 -0000 X-Qmail-Scanner-Mail-From: ceo@l-i-e.com via o2.hostbaby.com X-Qmail-Scanner: 1.25 (Clear:RC:1(127.0.0.1):. Processed in 0.085863 secs) Received: from unknown (HELO l-i-e.com) (127.0.0.1) by localhost with SMTP; 15 May 2006 22:19:32 -0000 Received: from 209.254.223.2 (SquirrelMail authenticated user ceo@l-i-e.com) by www.l-i-e.com with HTTP; Mon, 15 May 2006 17:19:32 -0500 (CDT) Message-ID: <1708.209.254.223.2.1147731572.squirrel@www.l-i-e.com> In-Reply-To: <7abd987289270ef00178d1bbf8e32c0b@gravitonic.com> References: <003d01c672ef$6cde8370$0100a8c0@pc07653> <517361123ef66114064e2866c95e3caa@gravitonic.com> <000901c67857$c20fa190$0100a8c0@pc07653> <7abd987289270ef00178d1bbf8e32c0b@gravitonic.com> Date: Mon, 15 May 2006 17:19:32 -0500 (CDT) To: "PHPdev" Reply-To: ceo@l-i-e.com User-Agent: Hostbaby Webmail MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal Subject: Re: [PHP-DEV] [php6] accepting an ascii string only From: ceo@l-i-e.com ("Richard Lynch") On Mon, May 15, 2006 4:19 pm, Andrei Zmievski wrote: > That assumes there are a hundred places where you want to receive an > ASCII string. Are they really that prevalent? How many of the extension libraries are Unicode-ready? You see an awful lot of users with quasi-Unicode data that gets into their database from un-scrubbed (or minimally-scrubbed) form data, and then they just blindly pass it off to the extension library. This usually results in "bug reports" of "weird characters" or cries for help in PHP-General -- And one look at User Contributed comments on http://php.net/str_replace will tell you that your average PHP Developer has NO CLUE about Unicode, and, to be honest, doesn't really care that much. They're writing a boutique site for a client who doesn't have any aspirations to a World Market, despite being "on" the World Wide Web. So when all these internal strings change overnight into Unicode and the mutli-byte data is blindly shoved out to tidy, libpdf, mysql, postgresql, GD, imagemagick, shell scripts, and no less than four (4) different XML parsing libraries over the years, you tell me: What's gonna happen? Is it going to magically "fix" all those weird-looking characters because PHP is shoving a Unicode string out to the extension library and all the extension libraries are ready for Unicode? Is it going to "just work" albeit with funky-looking characters just like before because the extension libraries aren't ready for Unicode yet, just like they aren't thread-safe so Apache 2 is kinda pointless? Or is it going to blow up in their faces because PHP is suddenly assuming that all these extension libraries can cope with Unicode strings? Personally, I'm terrified as a PHP Developer by the Unicode change. I *know* I don't have the skillset to handle Unicode issues. If you guys don't make it "just work" -- I'm screwed, along with HUGE segments of your install-base. How hard do you really want to make it for extensions to be written to deal with Unicode strings if they're not Unicode-ready? I have NO IDEA what the answers to these questions are. But I sure hope you guys do... :-) -- Like Music? http://l-i-e.com/artists.htm