Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:59774 Return-Path: Mailing-List: contact internals-help@lists.php.net; run by ezmlm Delivered-To: mailing list internals@lists.php.net Received: (qmail 63936 invoked from network); 12 Apr 2012 01:53:29 -0000 Received: from unknown (HELO lists.php.net) (127.0.0.1) by localhost with SMTP; 12 Apr 2012 01:53:29 -0000 Authentication-Results: pb1.pair.com header.from=john.lesueur@gmail.com; sender-id=pass Authentication-Results: pb1.pair.com smtp.mail=john.lesueur@gmail.com; spf=pass; sender-id=pass Received-SPF: pass (pb1.pair.com: domain gmail.com designates 74.125.82.54 as permitted sender) X-PHP-List-Original-Sender: john.lesueur@gmail.com X-Host-Fingerprint: 74.125.82.54 mail-wg0-f54.google.com Received: from [74.125.82.54] ([74.125.82.54:62987] helo=mail-wg0-f54.google.com) by pb1.pair.com (ecelerity 2.1.1.9-wez r(12769M)) with ESMTP id B8/5D-23245-895368F4 for ; Wed, 11 Apr 2012 21:53:29 -0400 Received: by wgbdq13 with SMTP id dq13so1213879wgb.11 for ; Wed, 11 Apr 2012 18:53:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=YCgitW6ckLYefytcf2ODd+XDJ26Zwyl+KNPMWB29FmU=; b=DhQ7UNiC4BItTgp4EFftWA6fS+kmEyB5y49NeZQqM7P8Y9EGQ0FINnHPJZKV/ysD2R gXjF95lT6gsFgmSCTFnaDRQZAkhuC239jHjkPvJWrOU9HaPpQyTHlEFPRpEHtsgNasd3 CjmYeHPGdRGNJhxjAPn9aBPDGs84YUzejjjXjeAZMZ396wMNYT5JDgGk06pb7axqcvbV VpYfKtNvKZFpGEVNm3qJ/mjUpUk02CSfloH9796Mi03Fl7FvsylJ3oo5YrNtxoxDpAZY WXKBhhMlj6kTa64HZXxoK2zZpwF2Fwz/0TkTU4tzhcwD9iE5spbCIkkWJ9lX7rtm0Gbq rVeQ== MIME-Version: 1.0 Received: by 10.216.135.106 with SMTP id t84mr345421wei.74.1334195605823; Wed, 11 Apr 2012 18:53:25 -0700 (PDT) Received: by 10.216.208.224 with HTTP; Wed, 11 Apr 2012 18:53:25 -0700 (PDT) In-Reply-To: <4F862808.1020405@lerdorf.com> References: <4F850D06.10701@sugarcrm.com> <4F8515AF.8060706@sugarcrm.com> <4F851FE4.7000706@sugarcrm.com> <4F8539E0.1090701@sugarcrm.com> <4F859063.1010401@lerdorf.com> <4F862808.1020405@lerdorf.com> Date: Wed, 11 Apr 2012 19:53:25 -0600 Message-ID: To: Rasmus Lerdorf Cc: John Crenshaw , Stas Malyshev , Yasuo Ohgaki , "internals@lists.php.net" Content-Type: multipart/alternative; boundary=0016e6dee74a25609b04bd719fd8 Subject: Re: [PHP-DEV] Re: Disabling PHP tags by php.ini and CLI options From: john.lesueur@gmail.com (John LeSueur) --0016e6dee74a25609b04bd719fd8 Content-Type: text/plain; charset=ISO-8859-1 On Wed, Apr 11, 2012 at 6:55 PM, Rasmus Lerdorf wrote: > On 04/11/2012 10:38 AM, John Crenshaw wrote: > > From: Rasmus Lerdorf [mailto:rasmus@lerdorf.com] > >> I guess he is saying that it prevents: > >> > >> Random bytes > >> > >> More random bytes > >> > >> Where random bytes might be an image file so finfo_file() might > identify it as a valid image > > > > Right, but anyone can trivially construct a fully valid bitmap with a > starting byte sequence of `42 4D 3B 2F 2A`, which resolves to `BM;/*`. PHP > will decide that BM meant 'BM', effectively skipping it, then the open > comment will slide the PHP interpreter past any remaining header stuff. You > can close the comment and place the actual code payload anywhere in the > image data. The early bytes in other image formats are similarly > exploitable. As far as I can tell there is really no security win here. > > > >> 4. Only protecting against mid-script injections and not top-of-script > injections is a somewhat subtle concept when the real problem is the > vulnerable include $_GET['filename'] hole. If this really is a prevalent > problem, maybe instead of trying to mitigate the symptoms, why don't we try > to attack the actual cause of the problem. I would love to hear some ideas > along those lines that don't fundamentally change the nature of PHP for > somewhat cloudy benefits. > >> > >> -Rasmus > > > > It's disturbingly common. Probably 90% of the automated attacks I see in > the 404 error logs are trying to exploit various inclusion vulnerabilities. > > > > One idea that comes to mind immediately is the old taint RFC: > https://wiki.php.net/rfc/taint. This doesn't actually prevent LFI, but it > (optionally) warns the developer that they did something very bad, > regardless of whether it actually caused a problem with the specific input > data. I'd really love to see that one finalized and implemented. > > > > Another wild alternative could be to have a non-trivial string format > internally, where PHP strings are actually a set of distinct blocks which > each contain encoding information. This would make it possible to > concatenate strings just as always, but since the attributes of each block > are known the entire string contents could be manipulated to an arbitrary > final encoding, (or rejected as impossible to safely convert) when the > string is actually used. In the include case this isn't really very > different from taint, because safe conversion is impossible, but for things > like XSS and SQL injection it could actually *fix* the otherwise vulnerable > code. A simplified example of how this might work: > > I think you may be overthinking it. I was thinking more along the lines > of having some rules for include/require. Something like every > non-relative include/require must start with a const string and any > variable part cannot have '..' in it. As in: > > Say $variable gets set to '/etc/passwd' then > > include $variable; > > would fail because it is an absolute-path include without a leading > const. However, this would work: > > include '/tmp/' . $variable; > > And so would this: > > include INSTALL_PATH . $variable; > > Relative includes including the ones that are relative to include_path > wouldn't change. If the bad guys can write to the doc root or a > directory below the doc root then they don't need LFI, they can just hit > the path directly from their browser. > > Obviously still disruptive and there would be some BC breaks, but I bet > it would be more effective than trying to optionally turn off the parser > in the right places. > > -Rasmus > > If this were in serious consideration, wouldn't it need to be two parameters, so that the engine can check them separately? If we really are concerned about developers creating LFI vulnerabilities[1], I think this is the way to do it. It's a little like setting open_basedir before every include, but requiring the open_basedir to be a constant. Developers can deliberately sidestep this if they need to use a variable, like: define('install_dir', $config['install_dir']); include myPath, $file; As you said, there's still a lot of disruption, so it's only worth it if we really think that preventing developers from writing vulnerable code is required, because education won't work. Overall, I'm ambivalent about allowing some files to omit in those files. If we do so, it feels more like the file itself should somehow communicate that it should be parsed differently[2], rather than making it the responsibility of the including script. Any security benefit is more like a side effect than the motivating factor. Relying on php's failure to parse a file isn't security, it's an accident. Thanks, John [1] there really are a lot out there, and from the little I saw, it's usually in CMS's which are often used by non-php programmers, and the plugins are often written as a first-time, let's try this out kind of thing. [2] starting with "