[Development] Decrease amounth of delivered src packages
Konstantin Tokarev
annulen at yandex.ru
Thu Feb 16 16:53:12 CET 2017
16.02.2017, 02:12, "Mathias Hasselmann" <mathias at taschenorakel.de>:
> Am 15.02.2017 um 13:05 schrieb Dmitry Shachnev:
>> On Wed, Feb 15, 2017 at 12:28:11PM +0100, Kevin Kofler wrote:
>>> I would kindly request you to at least use tar.xz (rather than tar.gz) for
>>> the tarballs. (What you use as the Windows format is something you need to
>>> sort out with the Windows people.) The fact that tar.gz is still the most
>>> downloaded is probably mostly out of habit, or maybe your download site is
>>> directing to them by default (which ought to be fixed anyway, even if you
>>> were to keep both). tar.gz has no advantage over tar.xz, it is just a lot
>>> larger. Switching to the tar.gz tarballs (from the tar.xz tarballs that are
>>> currently used) would increase the size of distributions' source packages
>>> (source RPM etc.) significantly.
>
> If distros care about size they can re-compress.
> Well, and to my experience they do.
>
>>> It is sad that the legacy gzip compression is living a renaissance due to
>>> automatic tarball exports from GitHub and the like producing only that
>>> format. It should finally be retired now that there are algorithms that are
>>> just as open and that compress significantly better. At least for projects
>>> like Qt that produce their own tarballs and are already able to produce xz-
>>> compressed ones, I see no reason whatsoever to switch back to the obsolete
>>> gzip.
>>
>> +1, please leave tar.xz instead of tar.gz.
>>
>> Users of all modern UNIX-like systems are able to decompress tar.xz, so .gz
>> has really no advantage over .xz.
>
> That's a somewhat limited point of view. Yes, xz archives are slightly
> smaller, but to be honest: In the days of 4K video streaming saving
> 100MiB of download size doesn't seem as important as it was.
>
> The actual value of gzip and the reason for its return seems to be its
> significantly lower CPU usage. This is useful to reduce server load on
> heavily utilized services like github. This is useful to reduce
> development roundtrip cycles.
I think nobody here would challenge this statement.
But it has nothing to do with src archives that are compressed once and
decompressed thousands of times.
>
> Just to illustrate this I've collected some numbers on my Thinkpad:
>
> Command | Average | Savings | Archive | Savings
> | CPU time | p. build | size | @100MBit
> ====================================================
> gzip | 00:44.19 | 09:54.58 | 469 MiB | 00:00.00
> gzip -9 | 01:55.58 | 08:43.18 | 465 MiB | 00:00.32
> xz | 08:46.16 | 01:52.60 | 354 MiB | 00:09.20
> xz -9 | 10:38.77 | 00:00.00 | 333 MiB | 00:10.88
You should use pxz instead of xz for fair results.
>
> Is it worth to spend 10 additional minutes per CI cycle just to save our
> users a very few seconds of download time?
>
> Looking at the problems behind providing an official 5.8.1. Looking at
> my personal experience with slow CI systems I clearly vote for speeding
> up the Q/A process and sticking with .zip and .tar.gz.
>
> Besides: Does it really make sense to fully test the expensive .xz and
> .7z builds? In my opinion it would be fully sufficient to only give the
> inexpensive .zip and .gz archives full test coverage. At least the .xz
> could be generated on the fly after uploading to the web server by
> simply decompression:
>
> gunzip < qt-sources.tar.gz | xz -9 > qt-sources.tar.xz
>
> Ciao,
> Mathias
> _______________________________________________
> Development mailing list
> Development at qt-project.org
> http://lists.qt-project.org/mailman/listinfo/development
--
Regards,
Konstantin
More information about the Development
mailing list