Newsgroups: php.internals
Path: news.php.net
Xref: news.php.net php.internals:126260
Precedence: bulk
MIME-Version: 1.0
References: <CAGVaEcmDxxbFzNxXnF=jmdUDnQTk55zgorbsqQqyP8w2Z=-iaw@mail.gmail.com>
 <CAEKnhAFeiJESkqJ0o9PuU59VFK1hdfk2xeEDJn_BbsaF0sBy7Q@mail.gmail.com>
 <CAGVaEcnOWGYPPUTYn1OZkxCvLmXSnzrQm73YB=ucRMuOA6DCpA@mail.gmail.com>
 <CAEKnhAHO7z1_fW6Gkko4GDm3GGS+0sD9AEuMyhna+PQ_5Kx+2g@mail.gmail.com> <CAGVaEckDBNijoJ5U+sbmyzfp8rD=SuuDY8p3i5=M26-52qs5YQ@mail.gmail.com>
In-Reply-To: <CAGVaEckDBNijoJ5U+sbmyzfp8rD=SuuDY8p3i5=M26-52qs5YQ@mail.gmail.com>
Date: Fri, 31 Jan 2025 16:30:07 +0100
Message-ID: <CAEKnhAEWupsfksLKoOCOkt5c=MgKWyJhGARB1Spc=VY2tHVs2w@mail.gmail.com>
Subject: Re: [PHP-DEV] [RFC] Introducing pm.max_memory for PHP-FPM
To: Arkadiy Kulev <eth@ethaniel.com>
Cc: internals@lists.php.net
Content-Type: multipart/alternative; boundary="0000000000005a0f9f062d0238e1"
From: bukka@php.net (Jakub Zelenka)

--0000000000005a0f9f062d0238e1
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Fri, Jan 31, 2025 at 4:06=E2=80=AFPM Arkadiy Kulev <eth@ethaniel.com> wr=
ote:

> Hi!
>
>
>> This wouldn't really work because FPM does not control the script during
>>>> execution and would have to check it out after each allocation which i=
s not
>>>> really viable.
>>>>
>>>
>>>>
>>> Thanks for the feedback! I agree that monitoring memory usage after eac=
h
>>> allocation would be infeasible. However, my suggestion was actually to
>>> check memory usage *only once per request*, specifically at request
>>> shutdown, when FPM regains control and before assigning another request=
 to
>>> that worker.
>>>
>>
>>>
>> I think that would require a different name because it would not reflect
>> max memory usage in any way - it would be measured at the time when the
>> memory usage is lowest. We could maybe set some options that would measu=
re
>> the maximum of increase of memory between requests - I mean difference
>> between lowest memory usage (most likely after the first request) and th=
en
>> compare against current usage after the latest request and set limit on
>> this difference. Not sure about the name for that. Maybe something like
>> pm.max_memory_increase or something like that.
>>
>
> I see where you=E2=80=99re coming from, but I believe measuring memory =
=E2=80=9Cdelta=E2=80=9D or
> =E2=80=9Cincrease=E2=80=9D could be confusing for end users. In practice,=
 admins and
> developers often glance at tools like top or ps to see *current* memory
> usage for each FPM worker, spot outliers, and then set a threshold
> accordingly.
>
> If we start talking about =E2=80=9Clowest usage=E2=80=9D vs. =E2=80=9Ccur=
rent usage=E2=80=9D and a =E2=80=9Cmax
> increase,=E2=80=9D that becomes much harder to translate to real-world
> monitoring=E2=80=94and it=E2=80=99s non-intuitive compared to simply read=
ing the value
> right off top and setting a limit. So, from a usability standpoint, I
> think a direct measurement of resident memory (as people see in common
> system tools) would be the most straightforward and least confusing.
>
It's probably less confusing than setting pm.max_memory to some value and
then see that the process allocates much more. We could potentially call it
pm.max_idle_memory or something that clearly shows that it's not a total
max memory.


> Just to be clear, "memory_limit" helps kill runaway scripts mid-request.
>>> By contrast, the newly proposed pm.max_memory is meant to catch process=
es
>>> with a slow leak across multiple requests. We only need to check at the=
 end
>>> of each request, which is presumably when the worker returns control to=
 FPM.
>>>
>>
>> There is one thing to note that memory_limit actually measure only memor=
y
>> allocated through the per request php memory allocator so it's not actua=
lly
>> limit on total usage including the standard allocator memory usage. So
>> there would be still a use case for total limit using cgroups but I agre=
e
>> that the more important use is to catch slow leaks which the above shoul=
d
>> help with in a better way than pm.max_requests.
>>
>
> You=E2=80=99re absolutely right that cgroups handle total memory usage=E2=
=80=94including
> memory outside the PHP allocator=E2=80=94more accurately. But as I=E2=80=
=99ve mentioned
> before, relying on cgroups to limit memory typically means an OOM kill th=
at
> can happen at *any* moment, often right in the middle of a request.
> That=E2=80=99s precisely what I=E2=80=99m trying to avoid.
> The whole idea behind pm.max_memory is to allow a *graceful* check *after=
*
> each request completes, so we can recycle the worker before starting a ne=
w
> request. That way, no request gets abruptly killed. cgroups don=E2=80=99t=
 really
> accommodate that scenario=E2=80=94they=E2=80=99re great for overall resou=
rce control but
> not for per-request, child-level recycling within FPM.
>

Yeah it should really be the last resort. Agreed that for this particular
case, the solution above would be better.

Regards,

Jakub

--0000000000005a0f9f062d0238e1
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><br><div class=3D"gmail_quote g=
mail_quote_container"><div dir=3D"ltr" class=3D"gmail_attr">On Fri, Jan 31,=
 2025 at 4:06=E2=80=AFPM Arkadiy Kulev &lt;<a href=3D"mailto:eth@ethaniel.c=
om">eth@ethaniel.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quo=
te" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204=
);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote"><div>Hi!</d=
iv><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0=
px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div =
dir=3D"ltr"><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote" st=
yle=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padd=
ing-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote"><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote=
">This wouldn&#39;t really work because FPM does not control the script dur=
ing execution and would have to check it out after each allocation which is=
 not really viable.</div></div></blockquote></div></div></blockquote></div>=
</div></blockquote><blockquote class=3D"gmail_quote" style=3D"margin:0px 0p=
x 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div d=
ir=3D"ltr"><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddi=
ng-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote"><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote=
">=C2=A0</div></div></blockquote></div></div></blockquote></div></div></blo=
ckquote><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex=
;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr">=
<div class=3D"gmail_quote"><blockquote class=3D"gmail_quote" style=3D"margi=
n:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex=
"><div dir=3D"ltr"><div class=3D"gmail_quote"><div><div>Thanks for the feed=
back! I agree that monitoring memory usage after each allocation would be i=
nfeasible. However, my suggestion was actually to check memory usage <b>onl=
y once per request</b>, specifically at request shutdown, when FPM regains =
control and before assigning another request to that worker.=C2=A0=C2=A0</d=
iv></div></div></div></blockquote></div></div></blockquote><blockquote clas=
s=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid r=
gb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quot=
e"><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bord=
er-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div =
class=3D"gmail_quote"><div><div>=C2=A0</div></div></div></div></blockquote>=
</div></div></blockquote><blockquote class=3D"gmail_quote" style=3D"margin:=
0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">=
<div dir=3D"ltr"><div class=3D"gmail_quote"><div>I think that would require=
 a different name because it would not reflect max memory usage in any way =
- it would be measured at the time when the memory usage is lowest. We coul=
d maybe set some options that would measure the maximum of increase of memo=
ry between requests - I mean difference between lowest memory usage (most l=
ikely after the first request) and then compare against current usage after=
 the latest request and set limit on this difference. Not sure about the na=
me for that. Maybe something like pm.max_memory_increase or something like =
that.</div></div></div></blockquote><div><div>=C2=A0</div><div>I see where =
you=E2=80=99re coming from, but I believe measuring memory =E2=80=9Cdelta=
=E2=80=9D or =E2=80=9Cincrease=E2=80=9D could be confusing for end users. I=
n practice, admins and developers often glance at tools like=C2=A0<code>top=
</code>=C2=A0or=C2=A0<code>ps</code>=C2=A0to see=C2=A0<em>current</em>=C2=
=A0memory usage for each FPM worker, spot outliers, and then set a threshol=
d accordingly.</div><div><p>If we start talking about =E2=80=9Clowest usage=
=E2=80=9D vs. =E2=80=9Ccurrent usage=E2=80=9D and a =E2=80=9Cmax increase,=
=E2=80=9D that becomes much harder to translate to real-world monitoring=E2=
=80=94and it=E2=80=99s non-intuitive compared to simply reading the value r=
ight off=C2=A0<code>top</code>=C2=A0and setting a limit. So, from a usabili=
ty standpoint, I think a direct measurement of resident memory (as people s=
ee in common system tools) would be the most straightforward and least conf=
using.</p></div></div></div></div></blockquote><div>It&#39;s probably less =
confusing than setting pm.max_memory to some value and then see that the pr=
ocess allocates much more. We could potentially call it pm.max_idle_memory =
or something that clearly shows that it&#39;s not a total max memory.</div>=
<div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px =
0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=
=3D"ltr"><div class=3D"gmail_quote"><div></div><blockquote class=3D"gmail_q=
uote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,2=
04);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_quote"><blockquo=
te class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px =
solid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gma=
il_quote"><div></div><div>Just to be clear, &quot;memory_limit&quot; helps =
kill runaway scripts mid-request. By contrast, the newly proposed pm.max_me=
mory is meant to catch processes with a slow leak across multiple requests.=
 We only need to check at the end of each request, which is presumably when=
 the worker returns control to FPM.</div></div></div></blockquote><div><br>=
</div><div>There is one thing to note that memory_limit actually measure on=
ly memory allocated through the per request php memory allocator so it&#39;=
s not actually limit on total usage including the standard allocator memory=
 usage. So there would be still a use case for total limit using cgroups bu=
t I agree that the more important use is to catch slow leaks which the abov=
e should help with in a better way than pm.max_requests.</div></div></div><=
/blockquote><div><br><p>You=E2=80=99re absolutely right that cgroups handle=
 total memory usage=E2=80=94including memory outside the PHP allocator=E2=
=80=94more accurately. But as I=E2=80=99ve mentioned before, relying on cgr=
oups to limit memory typically means an OOM kill that can happen at <em>any=
</em> moment, often right in the middle of a request. That=E2=80=99s precis=
ely what I=E2=80=99m trying to avoid.</p>The whole idea behind <code>pm.max=
_memory</code> is to allow a <strong>graceful</strong> check <em>after</em>=
 each request completes, so we can recycle the worker before starting a new=
 request. That way, no request gets abruptly killed. cgroups don=E2=80=99t =
really accommodate that scenario=E2=80=94they=E2=80=99re great for overall =
resource control but not for per-request, child-level recycling within FPM.=
=C2=A0</div></div></div></blockquote><div><br></div><div>Yeah it should rea=
lly be the last resort. Agreed that for this particular case, the solution =
above would be better.</div><div><br></div><div>Regards,</div><div><br></di=
v><div>Jakub</div></div></div>

--0000000000005a0f9f062d0238e1--