[Development] The life of a file name and other possibly mal-encoded strings on non-Windows systems

Thiago Macieira thiago.macieira at intel.com
Thu Oct 9 10:18:17 CEST 2014


On Thursday 09 October 2014 09:55:36 Julien Blanc wrote:
> On 09/10/2014 09:27, Thiago Macieira wrote:
> > The only one that poses trouble are ISO-9660 CD-ROMs that have Rock Ridge
> > extensions for Unix attributes and longer file names. Do people still have
> > CD drives?
> 
> People also have zip files, which unfortunately may have various
> encoding in them, since the only normative encoding for zip files is cp437…

And I've said time and again that the bug lies with the unzipping application. 
Copying CP437 encoded names is nonsense. Besides, CP850 is the best.

> I have a bunch of such zip files, that :
> - cannot be extracted with gui tools
> - will result in bad filenames (including invalid utf-8) when extracted
> using « stupid » command line tools.
> 
> IMHO saying « the problem does not exist » is not a good answer, because
> if it really didn’t this issue would never have been raised. The
> questions to answer are :
> - is it worth breaking lot of code ? (because it will : a good solution
> needs a complete refactor of qt io code, just providing a QFilePath
> class will not be enough)
> - will it be ready before c++ provides a core solution ?
> - is there someone willing to do it ?

I didn't say the problem doesn't exist. It exists.

I'm saying it's smaller than what people are making it to be in this thread.

-- 
Thiago Macieira - thiago.macieira (AT) intel.com
  Software Architect - Intel Open Source Technology Center




More information about the Development mailing list