Newsgroups: php.internals Path: news.php.net Xref: news.php.net php.internals:125924 X-Original-To: internals@lists.php.net Delivered-To: internals@lists.php.net Received: from php-smtp4.php.net (php-smtp4.php.net [45.112.84.5]) by qa.php.net (Postfix) with ESMTPS id BD0311A00BD for ; Thu, 7 Nov 2024 16:18:31 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=php.net; s=mail; t=1730996464; bh=cT1itJDZanEdcw/QN04lUTeV2I6wHN6wrTrH5KtxbC0=; h=Date:From:To:cc:Subject:In-Reply-To:References:From; b=do0RqijJWrgf0gwHbvVhRlhM5UFIVdY5zRV9ycOD/V5TEaGCeSG40IjpynwT0edHH vz3l9cVeh9MjMqOmLVTrsSqoVJoKs1cMZE90Ppk2r1g37PzGpbdX+eMpFbDvBsWOyC xdgYVeMQ/vGr/cqy+2AJa3EDlhKjdKTDx3zUPKxi4uDaedqkl9BunS2WkUILeErFT0 i2jw4A/bcoqgLdXQmaUG4xA1ZxdugbV4cHHGcLzNYvlrf16euKcWmxl8JMPWE4ct48 oN+W5FO57XDjF3zUz0fYLg58xjxZP/vNvrBqQMkBcAqo2CdVgnDQwxUj4QFc2lSg7N +DiVVVWEysn5A== Received: from php-smtp4.php.net (localhost [127.0.0.1]) by php-smtp4.php.net (Postfix) with ESMTP id F33DF18007C for ; Thu, 7 Nov 2024 16:21:02 +0000 (UTC) X-Spam-Checker-Version: SpamAssassin 4.0.0 (2022-12-13) on php-smtp4.php.net X-Spam-Level: *** X-Spam-Status: No, score=3.6 required=5.0 tests=BAYES_50,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,DMARC_PASS,SPF_HELO_PASS, SPF_SOFTFAIL autolearn=no autolearn_force=no version=4.0.0 X-Spam-Virus: No X-Envelope-From: Received: from xdebug.org (xdebug.org [82.113.146.227]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by php-smtp4.php.net (Postfix) with ESMTPS for ; Thu, 7 Nov 2024 16:21:02 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=php.net; s=mail; t=1730996308; bh=cT1itJDZanEdcw/QN04lUTeV2I6wHN6wrTrH5KtxbC0=; h=Date:From:To:cc:Subject:In-Reply-To:References:From; b=EM0joy4wY9eazQ9vuNHLT+LwCRkkashblCjw51qMG6Ko24ogFrzdj+EEF+EjQ2/pE qFtExUOQmRhUqxclez0Li5uUONGZ/B1evBPUaOtEcMG3aPA8RDu3Rr8B1yOk930Jg9 w8YptoohpjRsSXe2/CLDDjdElFDPumljbPfhGs9UOqbaMAFvSpJIh3zAC80XFjXOhA m7JiLixNkOWLNggiroFODQq8TO+aZgesTYC8md88SKmOqRbdgfxG4/QWQLun6Jjtj4 6T4H9nDJrCaKK/DAkJdUtrHYXlz2WoW74Z6yIM8nV6zYMsAysnAK1tVveycFQ/JBXm zItJKOPEwsc6Q== Received: from localhost (localhost [IPv6:::1]) by xdebug.org (Postfix) with ESMTPS id 709AB10C03E; Thu, 07 Nov 2024 16:18:28 +0000 (GMT) Date: Thu, 7 Nov 2024 16:18:28 +0000 (GMT) To: Larry Garfield cc: php internals Subject: Re: [PHP-DEV] [RFC] PHP.net analytics In-Reply-To: Message-ID: <69954ce9-f4c3-e29c-c2e8-c4f0b3accb5f@php.net> References: <55320aad-758a-4d06-b1bd-3eac2b5a5f71@app.fastmail.com> Precedence: bulk list-help: list-post: List-Id: internals.lists.php.net x-ms-reactions: disallow MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII From: derick@php.net (Derick Rethans) On Tue, 5 Nov 2024, Larry Garfield wrote: > On Fri, Nov 1, 2024, at 6:10 PM, Bob Weinand wrote: > > On 1.11.2024 22:41:29, Larry Garfield wrote: > >> In a similar vein to approving the use of software, Roman Pronskiy > >> asked for my help putting together an RFC on collecting analytics > >> for PHP.net. > >> > >> https://wiki.php.net/rfc/phpnet-analytics > >> > >> Of particular note: > >> > >> * This is self-hosted, first-party only. No third parties get data, so no third parties can do evil things with it. > >> * There is no plan to collect any PII. > >> * The goal is to figure how how to most efficiently spend Foundation money improving php.net, something that is sorely needed. > >> > >> Ideally we'd have this in place by the 8.4 release or shortly > >> thereafter, though I realize that's a little tight on the timeline. > > > > Hey Larry, > > > > I have a couple concerns and questions: > > > > Is there a way to track analytics with only transient data? As in, data > > actually stored is always already anonymized enough that it would be > > unproblematic to share it with everyone? > > Or possibly, is there a retention period for the raw data after which > > only anonymized data remains? > > The plan is to configure Matomo to not collect anything non-anonymous > to begin with, to the extent possible. We're absolutely not talking > about user-stalking like ad companies do, or anything even remotely > close to that. > > I'm not convinced that publishing raw, even anonymized data, is > valuable or responsible. I don't know of any other sites off hand > that publish their raw analytics, and I don't know what purpose that > would serve other than just a principled "radical transparency" > stance, which I generally don't agree with. > > However, having an automated aggregate dashboard similar to > https://analytics.bookstackapp.com/bookstackapp.com (made by a > different tool, but same idea) that we could make public is the goal, > but we don't want to do that until it's been running a while and we're > sure that nothing personally identifiable could leak through that way. I would have no issues publishing this HTML File once a week, created by parsing the log files with: sudo zcat access.log.*.gz | sudo goaccess access.log access.log.1 --anonymize-ip --ignore-crawlers --real-os --max-items=500 --log-format=combined --enable-panel=REQUESTS --output=/tmp/access.html https://derickrethans.nl/files/dump/access.html (for my own site as example). > * How many people are accessing the site from a desktop vs mobile? > * What speed connection do people have? > * How many people are using the in-browser Wasm code runner that is currently being worked on? cf: https://github.com/php/web-php/pull/1097 Server logs contain the browser handle at least. I am not sure whether the Matomo JS tracker actually distinguishes between desktop and mobile, or the other two points you mention here. The connection speed can be stored in server logs (we don't at the moment): https://onecompiler.com/questions/3src8rjkd/how-to-log-application-response-time-with-nginx cheers, Derick -- https://derickrethans.nl | https://xdebug.org | https://dram.io Author of Xdebug. Like it? Consider supporting me: https://xdebug.org/support mastodon: @derickr@phpc.social @xdebug@phpc.social