Newsgroups: php.internals
Path: news.php.net
Xref: news.php.net php.internals:115026
Message-ID: <9B304735-E0AD-4CC0-98BF-AAF4CE5FA52C@koalephant.com>
Content-Type: multipart/alternative;
	boundary="Apple-Mail=_6BE22978-5608-4B69-A63D-974163C1A3F6"
Mime-Version: 1.0 (Mac OS X Mail 14.0 \(3654.100.0.2.22\))
Date: Tue, 22 Jun 2021 21:05:03 +0700
In-Reply-To: <CAFv4g+GJrX6uNSAq=kAXDk2Lx=NLx=GxAw9Qh+cpZwbaVdgtJA@mail.gmail.com>
Cc: Benjamin Morel <benjamin.morel@gmail.com>,
 Derick Rethans <derick@php.net>,
 PHP Internals <internals@lists.php.net>,
 Yasuo Ohgaki <yohgaki@ohgaki.net>
To: Craig Francis <craig@craigfrancis.co.uk>
References: <CAFv4g+GjXqiZkA2fawt7jHDtDAKjB0rv=Lfbcobu7bn8B=wvpw@mail.gmail.com>
 <CAGa2bXbGv33zHfRpu-hN_wNcNKPpFw_ed1ZRnn96tpDf0ROcLg@mail.gmail.com>
 <0CD1762E-6094-4DEB-B1B5-22CFBDAAFF44@php.net>
 <CAG9XoMQUUYGhA6QL3Se4FNJgydjzX0MAeR_CWr5O1SH8xgypLA@mail.gmail.com>
 <CAFv4g+ET4YDpygjKqGbAwxt+jkA8kWTeUrmTcR3hbFGEGNgmDg@mail.gmail.com>
 <BFFD3FBB-721B-491B-ADDB-DE3CA535BB52@koalephant.com>
 <CAFv4g+GJrX6uNSAq=kAXDk2Lx=NLx=GxAw9Qh+cpZwbaVdgtJA@mail.gmail.com>
Subject: Re: [PHP-DEV] [RFC] is_trusted - was is_literal
From: php-lists@koalephant.com (Stephen Reay)

--Apple-Mail=_6BE22978-5608-4B69-A63D-974163C1A3F6
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8


> On 22 Jun 2021, at 20:13, Craig Francis <craig@craigfrancis.co.uk> =
wrote:
>=20
> On Tue, 22 Jun 2021 at 09:59, Stephen Reay <php-lists@koalephant.com =
<mailto:php-lists@koalephant.com>> wrote:
> So I just want to make sure I understand the progression on this so =
far. It started out with people wanting a way to check that a string was =
a literal string, in code somewhere, and does not come from user input. =
Ok makes sense. The name makes sense too.
>=20
>=20
>=20
> The primary reason was never just to define literal strings, the =
intention has always been to create a practical, implementable solution =
to address the issue of Injection Vulnerabilities (SQl/HTML/CLI/etc).
>=20

Preventing injection vulnerabilities may be your goal but I=E2=80=99m =
talking about the intended behaviour of this one function. Your original =
email says this:

>> Distinguishing strings from a trusted developer from strings that may =
be attacker controlled



If you feel that somehow doesn=E2=80=99t mean the same as "check that a =
string was a literal string, in code somewhere, and does not come from =
user input=E2=80=9D, then we need to crack open a dictionary and work =
out which words one of us doesn=E2=80=99t know the meaning of.




> The name `is_literal()` has always just been a placeholder, it came up =
when I first started looking at this problem because that was the most =
obvious thing I knew we could anchor around. (Unfortunately I think it =
was easy to make assumptions based solely on that name, rather than =
focussing on the issue it is meant to address).
>=20
> So, we cannot look for literals only - while it was part of the =
solution, it was very much incomplete. Bearing in mind, there is =
considerable amount of existing code and tutorials out there which =
include integers in their SQL/HTML/CLI/etc, and they are perfectly safe =
in doing so. Making a solution which does not support integers is not =
going to be adopted/used because the task of rewriting and changing =
everything, for no benefit, will not be considered by developers.
>=20

There is a considerable amount of existing code that includes strings in =
SQL, HTML without danger too. Plenty of string values are fine, and =
plenty of integer values are fine. That doesn=E2=80=99t mean we should =
just blindly trust a value that came from the user, just because it=E2=80=99=
s a number.
The saying goes =E2=80=9Cnever trust user input=E2=80=9D not =E2=80=9Cneve=
r trust user input unless it=E2=80=99s a number=E2=80=9D.=20


> Likewise, a lot of code already builds SQL/HTML/CLI/etc strings via =
concatenation and sprintf(), and forcing everyone to use a query builder =
is likely to cause most people to not even consider using this.
>=20

If they won=E2=80=99t adopt an existing solution to the problem why =
would they adopt this?
You=E2=80=99ve said very recently that this is not intended to be used =
directly by most developers, and instead used within libraries and =
frameworks. It seems a little weird to then make concessions that will =
defeat the stated goal, in the name of adoption.=20


> It's all well thinking of one thing that might =
theoretically/idealistically solve the issue, but it also needs to have =
a plan on how this will be practically implemented and used by =
developers (which this has done).
> =20


Having a plan for how to implement something doesn=E2=80=99t help much =
when the thing you=E2=80=99re implementing deliberately ignores a =
specific type of =E2=80=98untrusted=E2=80=99 input..





--Apple-Mail=_6BE22978-5608-4B69-A63D-974163C1A3F6--