Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:28041 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 91919 invoked by uid 1010); 15 Feb 2007 19:12:49 -0000 Delivered-To: ezmlm-scan-internals@lists.php.net Delivered-To: ezmlm-internals@lists.php.net Received: (qmail 91904 invoked from network); 15 Feb 2007 19:12:49 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 15 Feb 2007 19:12:49 -0000 X-Host-Fingerprint: 80.123.98.46 unknown Received: from [80.123.98.46] ([80.123.98.46:20152] helo=localhost.localdomain) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id E8/D1-06535-1B0B4D54 for ; Thu, 15 Feb 2007 14:12:49 -0500 Message-ID: To: internals@lists.php.net Date: Thu, 15 Feb 2007 20:12:47 +0100 User-Agent: Thunderbird 1.5.0.9 (X11/20070103) MIME-Version: 1.0 References: <7A.8A.07898.B7A8FC54@pb1.pair.com> In-Reply-To: <7A.8A.07898.B7A8FC54@pb1.pair.com> Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 7bit X-Posted-By: 80.123.98.46 Subject: Re: inconsistency in using utf-8 From: mike@php.net (Michael Wallner) Mathias Bank wrote: > If you use tiny_html > (http://de2.php.net/manual/en/function.tidy-repair-string.php), you have > to use "utf8" to encode your data in utf-8. > > If you use html_entity_decode() > (http://de2.php.net/manual/en/function.html-entity-decode.php), you have > to use "UTF-8". > > This difference is not necessary. Can it be possible to use one > notation. With current implementation, you always has to look into the > documentation, which notation you have to use. libtidy has a limited set of supported charsets, which is defined within the library, so we can't do anything about that. http://php.net/tidy_set_encoding -- Michael