Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:38062 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 82416 invoked from network); 1 Jun 2008 21:27:50 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 1 Jun 2008 21:27:50 -0000 X-Host-Fingerprint: 24.228.80.153 ool-18e45099.dyn.optonline.net Received: from [24.228.80.153] ([24.228.80.153:14353] helo=localhost.localdomain) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id 3F/E7-35077-45413484 for ; Sun, 01 Jun 2008 17:27:49 -0400 Message-ID: <3F.E7.35077.45413484@pb1.pair.com> To: internals@lists.php.net Date: Sun, 01 Jun 2008 17:27:54 -0400 User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.5) Gecko/20041206 Thunderbird/1.0 Mnenhy/0.6.0.104 MIME-Version: 1.0 References: <8A.30.24593.DBF2E384@pb1.pair.com> <483F6261.4040802@thewritingpot.com> In-Reply-To: <483F6261.4040802@thewritingpot.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Posted-By: 24.228.80.153 Subject: Re: [PHP-DEV] Unicode and XML From: edwardzyang@thewritingpot.com ("Edward Z. Yang") Edward Z. Yang wrote: > My proposal is to introduce a new filter (for the filter extension) > which performs codepoint sanitization appropriate for HTML/XML contexts > (alternatively, this could be an option on the FILTER_DEFAULT filter, > which would be for Unicode strings, I assume). This filter would be > turned ON by default, and users could turn it off using a special > option. Thus, codepoint sanitization would work invisibly for users who > don't care, and would be accessible to users who do (i.e. those who > don't mind mucking around with unpaired surrogates or the like. This [1] > gives quite a good explanation about what this is all about). Time to squeak. Are there any comments on this proposal? -- Edward Z. Yang GnuPG: 0x869C48DA HTML Purifier Anti-XSS Filter [[ 3FA8 E9A9 7385 B691 A6FC B3CB A933 BE7D 869C 48DA ]]