[Qt-interest] Heuristics for determining text codec of file

Robert Hairgrove evorgriahr at hispeed.ch
Tue Jan 11 12:12:41 CET 2011


On Tue, 2011-01-11 at 15:05 +0900, suzuki toshiya wrote:
> Hi,
> 
> Although I've never tried to build or use, I heard that
> the character encoding detection of Mozilla can be built
> as a standalone module:
> 
> Very old description:
> http://www.mozilla.org/projects/intl/detectorsrc.html
> 
> source code:
> http://hg.mozilla.org/mozilla-central/file/3ac595ba8c43/extensions/universalchardet
> 
> If you think Mozilla's detection is sufficient for you,
> please try.

On second look, I have to modify the Mozilla source code in order to use
it like this. Unfortunately, my app will be LGPLed, so looks like I will
have to roll my own. Besides, as you have pointed out, it is very old
code... and looks like Cyrillic isn't handled too well.




More information about the Qt-interest-old mailing list