← Back to branch summary

~ubuntu-branches/ubuntu/wily/smplayer/wily

~ubuntu-branches/ubuntu/wily/smplayer/wily

« back to all changes in this revision

Viewing changes to zlib-1.2.6/doc/txtvsbin.txt

Committer: Package Import Robot
Author(s): Maia Kozheva, Maia Kozheva, Alessio Treglia
Date: 2012-04-14 12:01:57 UTC
mfrom: (1.1.13)
mto: (20.2.1 sid)
mto: This revision was merged to the branch mainline in revision 23.
Revision ID: package-import@ubuntu.com-20120414120157-mndwobcslgisomso

http://bugs.debian.org/638279

[ Maia Kozheva ]
* New upstream release:
  - Changes since 0.7.1:
    + A toolbar editor has been added. Now it's possible to select the
      buttons and controls that want to appear in the toolbars.
    + New video filters: gradfun, blur and sharpen.
    + Now it's possible to change the GUI (default, mini, mpc) at runtime,
      no restart required.
    + sub files from opensubtitles should work again.
    + (Youtube) Recognize short urls (like this one:
      http://y2u.be/F5OcZBVPwOA)
    + Better support for chapters in video files.
    + Bug fix: remote m3u files work from the favorites menu or command line.
    + Internal changes in the single instance option (switch to
      QtSingleApplication).
  - Fixes since 0.7.0:
    + SMPlayer took more than 10 seconds to show when running for the very
      first time.
    + The links to download subtitles from Opensubtitles were wrong.
    + SMPlayer crashed in the favorite editor when trying to select a file
      if the KDE open dialog was used.
  - Changes since 0.7.0:
    + By default the screenshots are saved in the user's pictures folder
      instead of the SMPlayer's config folder.
    + Now it's possible to change the opensubtitles server.
    + Youtube: seeking is slow with flv videos, so now flv videos have the
      lowest priority.
    + Youtube: now it's possible to search and download videos from youtube.
      This is provided by an external application (in linux you have to
      install an independent package: smtube).
* debian/copyright:
  - Rewrite according to DEP-5 specification.
* debian/control:
  - Depend on mplayer2 | mplayer. (Closes: #638279)
  - Update Standards-Version to 3.9.3.
* Remove debian/patches/handle_local_urls.diff, merged upstream.

[ Alessio Treglia ]
* Mention smplayer is also a front-end for MPlayer2.
* Fix small typo in the description.

files added:
docs/pt

docs/pt/faq.html

docs/pt/gpl.html

getrev.cmd

os2/smplayer.ico

os2/smplayer_closed.ICO

os2/smplayer_open.ICO

setup/smplayer.nsi

src/chapters.cpp

src/chapters.h

src/editabletoolbar.cpp

src/editabletoolbar.h

src/filehash.cpp

src/filehash.h

src/findsubtitles/fixsubs.cpp

src/findsubtitles/fixsubs.h

src/icons-png/tubebrowser.png

src/myapplication.cpp

src/myapplication.h

src/qtsingleapplication

src/qtsingleapplication/QtLockedFile

src/qtsingleapplication/QtSingleApplication

src/qtsingleapplication/qtlocalpeer.cpp

src/qtsingleapplication/qtlocalpeer.h

src/qtsingleapplication/qtlockedfile.cpp

src/qtsingleapplication/qtlockedfile.h

src/qtsingleapplication/qtlockedfile_unix.cpp

src/qtsingleapplication/qtlockedfile_win.cpp

src/qtsingleapplication/qtsingleapplication.cpp

src/qtsingleapplication/qtsingleapplication.h

src/qtsingleapplication/qtsingleapplication.pri

src/qtsingleapplication/qtsinglecoreapplication.cpp

src/qtsingleapplication/qtsinglecoreapplication.h

src/qtsingleapplication/qtsinglecoreapplication.pri

src/simplehttp.cpp

src/simplehttp.h

src/toolbareditor.ui

zlib-1.2.6

zlib-1.2.6/CMakeLists.txt

zlib-1.2.6/ChangeLog

zlib-1.2.6/FAQ

zlib-1.2.6/INDEX

zlib-1.2.6/Makefile

zlib-1.2.6/Makefile.in

zlib-1.2.6/README

zlib-1.2.6/adler32.c

zlib-1.2.6/amiga

zlib-1.2.6/amiga/Makefile.pup

zlib-1.2.6/amiga/Makefile.sas

zlib-1.2.6/as400

zlib-1.2.6/as400/bndsrc

zlib-1.2.6/as400/compile.clp

zlib-1.2.6/as400/readme.txt

zlib-1.2.6/as400/zlib.inc

zlib-1.2.6/compress.c

zlib-1.2.6/configure

zlib-1.2.6/contrib

zlib-1.2.6/contrib/README.contrib

zlib-1.2.6/contrib/ada

zlib-1.2.6/contrib/ada/buffer_demo.adb

zlib-1.2.6/contrib/ada/mtest.adb

zlib-1.2.6/contrib/ada/read.adb

zlib-1.2.6/contrib/ada/readme.txt

zlib-1.2.6/contrib/ada/test.adb

zlib-1.2.6/contrib/ada/zlib-streams.adb

zlib-1.2.6/contrib/ada/zlib-streams.ads

zlib-1.2.6/contrib/ada/zlib-thin.adb

zlib-1.2.6/contrib/ada/zlib-thin.ads

zlib-1.2.6/contrib/ada/zlib.adb

zlib-1.2.6/contrib/ada/zlib.ads

zlib-1.2.6/contrib/ada/zlib.gpr

zlib-1.2.6/contrib/amd64

zlib-1.2.6/contrib/amd64/amd64-match.S

zlib-1.2.6/contrib/asm686

zlib-1.2.6/contrib/asm686/README.686

zlib-1.2.6/contrib/asm686/match.S

zlib-1.2.6/contrib/blast

zlib-1.2.6/contrib/blast/Makefile

zlib-1.2.6/contrib/blast/README

zlib-1.2.6/contrib/blast/blast.c

zlib-1.2.6/contrib/blast/blast.h

zlib-1.2.6/contrib/blast/test.pk

zlib-1.2.6/contrib/blast/test.txt

zlib-1.2.6/contrib/delphi

zlib-1.2.6/contrib/delphi/ZLib.pas

zlib-1.2.6/contrib/delphi/ZLibConst.pas

zlib-1.2.6/contrib/delphi/readme.txt

zlib-1.2.6/contrib/delphi/zlibd32.mak

zlib-1.2.6/contrib/dotzlib

zlib-1.2.6/contrib/dotzlib/DotZLib

zlib-1.2.6/contrib/dotzlib/DotZLib.build

zlib-1.2.6/contrib/dotzlib/DotZLib.chm

zlib-1.2.6/contrib/dotzlib/DotZLib.sln

zlib-1.2.6/contrib/dotzlib/DotZLib/AssemblyInfo.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/ChecksumImpl.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/CircularBuffer.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/CodecBase.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/Deflater.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/DotZLib.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/DotZLib.csproj

zlib-1.2.6/contrib/dotzlib/DotZLib/GZipStream.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/Inflater.cs

zlib-1.2.6/contrib/dotzlib/DotZLib/UnitTests.cs

zlib-1.2.6/contrib/dotzlib/LICENSE_1_0.txt

zlib-1.2.6/contrib/dotzlib/readme.txt

zlib-1.2.6/contrib/gcc_gvmat64

zlib-1.2.6/contrib/gcc_gvmat64/gvmat64.S

zlib-1.2.6/contrib/infback9

zlib-1.2.6/contrib/infback9/README

zlib-1.2.6/contrib/infback9/infback9.c

zlib-1.2.6/contrib/infback9/infback9.h

zlib-1.2.6/contrib/infback9/inffix9.h

zlib-1.2.6/contrib/infback9/inflate9.h

zlib-1.2.6/contrib/infback9/inftree9.c

zlib-1.2.6/contrib/infback9/inftree9.h

zlib-1.2.6/contrib/inflate86

zlib-1.2.6/contrib/inflate86/inffas86.c

zlib-1.2.6/contrib/inflate86/inffast.S

zlib-1.2.6/contrib/iostream

zlib-1.2.6/contrib/iostream/test.cpp

zlib-1.2.6/contrib/iostream/zfstream.cpp

zlib-1.2.6/contrib/iostream/zfstream.h

zlib-1.2.6/contrib/iostream2

zlib-1.2.6/contrib/iostream2/zstream.h

zlib-1.2.6/contrib/iostream2/zstream_test.cpp

zlib-1.2.6/contrib/iostream3

zlib-1.2.6/contrib/iostream3/README

zlib-1.2.6/contrib/iostream3/TODO

zlib-1.2.6/contrib/iostream3/test.cc

zlib-1.2.6/contrib/iostream3/zfstream.cc

zlib-1.2.6/contrib/iostream3/zfstream.h

zlib-1.2.6/contrib/masmx64

zlib-1.2.6/contrib/masmx64/bld_ml64.bat

zlib-1.2.6/contrib/masmx64/gvmat64.asm

zlib-1.2.6/contrib/masmx64/inffas8664.c

zlib-1.2.6/contrib/masmx64/inffasx64.asm

zlib-1.2.6/contrib/masmx64/readme.txt

zlib-1.2.6/contrib/masmx86

zlib-1.2.6/contrib/masmx86/bld_ml32.bat

zlib-1.2.6/contrib/masmx86/inffas32.asm

zlib-1.2.6/contrib/masmx86/match686.asm

zlib-1.2.6/contrib/masmx86/readme.txt

zlib-1.2.6/contrib/minizip

zlib-1.2.6/contrib/minizip/Makefile

zlib-1.2.6/contrib/minizip/Makefile.am

zlib-1.2.6/contrib/minizip/MiniZip64_Changes.txt

zlib-1.2.6/contrib/minizip/MiniZip64_info.txt

zlib-1.2.6/contrib/minizip/configure.ac

zlib-1.2.6/contrib/minizip/crypt.h

zlib-1.2.6/contrib/minizip/ioapi.c

zlib-1.2.6/contrib/minizip/ioapi.h

zlib-1.2.6/contrib/minizip/iowin32.c

zlib-1.2.6/contrib/minizip/iowin32.h

zlib-1.2.6/contrib/minizip/make_vms.com

zlib-1.2.6/contrib/minizip/miniunz.c

zlib-1.2.6/contrib/minizip/minizip.c

zlib-1.2.6/contrib/minizip/minizip.pc.in

zlib-1.2.6/contrib/minizip/mztools.c

zlib-1.2.6/contrib/minizip/mztools.h

zlib-1.2.6/contrib/minizip/unzip.c

zlib-1.2.6/contrib/minizip/unzip.h

zlib-1.2.6/contrib/minizip/zip.c

zlib-1.2.6/contrib/minizip/zip.h

zlib-1.2.6/contrib/pascal

zlib-1.2.6/contrib/pascal/example.pas

zlib-1.2.6/contrib/pascal/readme.txt

zlib-1.2.6/contrib/pascal/zlibd32.mak

zlib-1.2.6/contrib/pascal/zlibpas.pas

zlib-1.2.6/contrib/puff

zlib-1.2.6/contrib/puff/Makefile

zlib-1.2.6/contrib/puff/README

zlib-1.2.6/contrib/puff/puff.c

zlib-1.2.6/contrib/puff/puff.h

zlib-1.2.6/contrib/puff/pufftest.c

zlib-1.2.6/contrib/puff/zeros.raw

zlib-1.2.6/contrib/testzlib

zlib-1.2.6/contrib/testzlib/testzlib.c

zlib-1.2.6/contrib/testzlib/testzlib.txt

zlib-1.2.6/contrib/untgz

zlib-1.2.6/contrib/untgz/Makefile

zlib-1.2.6/contrib/untgz/Makefile.msc

zlib-1.2.6/contrib/untgz/untgz.c

zlib-1.2.6/contrib/vstudio

zlib-1.2.6/contrib/vstudio/readme.txt

zlib-1.2.6/contrib/vstudio/vc10

zlib-1.2.6/contrib/vstudio/vc10/miniunz.vcxproj

zlib-1.2.6/contrib/vstudio/vc10/miniunz.vcxproj.filters

zlib-1.2.6/contrib/vstudio/vc10/miniunz.vcxproj.user

zlib-1.2.6/contrib/vstudio/vc10/minizip.vcxproj

zlib-1.2.6/contrib/vstudio/vc10/minizip.vcxproj.filters

zlib-1.2.6/contrib/vstudio/vc10/minizip.vcxproj.user

zlib-1.2.6/contrib/vstudio/vc10/testzlib.vcxproj

zlib-1.2.6/contrib/vstudio/vc10/testzlib.vcxproj.filters

zlib-1.2.6/contrib/vstudio/vc10/testzlib.vcxproj.user

zlib-1.2.6/contrib/vstudio/vc10/testzlibdll.vcxproj

zlib-1.2.6/contrib/vstudio/vc10/testzlibdll.vcxproj.filters

zlib-1.2.6/contrib/vstudio/vc10/testzlibdll.vcxproj.user

zlib-1.2.6/contrib/vstudio/vc10/zlib.rc

zlib-1.2.6/contrib/vstudio/vc10/zlibstat.vcxproj

zlib-1.2.6/contrib/vstudio/vc10/zlibstat.vcxproj.filters

zlib-1.2.6/contrib/vstudio/vc10/zlibstat.vcxproj.user

zlib-1.2.6/contrib/vstudio/vc10/zlibvc.def

zlib-1.2.6/contrib/vstudio/vc10/zlibvc.sln

zlib-1.2.6/contrib/vstudio/vc10/zlibvc.vcxproj

zlib-1.2.6/contrib/vstudio/vc10/zlibvc.vcxproj.filters

zlib-1.2.6/contrib/vstudio/vc10/zlibvc.vcxproj.user

zlib-1.2.6/contrib/vstudio/vc9

zlib-1.2.6/contrib/vstudio/vc9/miniunz.vcproj

zlib-1.2.6/contrib/vstudio/vc9/minizip.vcproj

zlib-1.2.6/contrib/vstudio/vc9/testzlib.vcproj

zlib-1.2.6/contrib/vstudio/vc9/testzlibdll.vcproj

zlib-1.2.6/contrib/vstudio/vc9/zlib.rc

zlib-1.2.6/contrib/vstudio/vc9/zlibstat.vcproj

zlib-1.2.6/contrib/vstudio/vc9/zlibvc.def

zlib-1.2.6/contrib/vstudio/vc9/zlibvc.sln

zlib-1.2.6/contrib/vstudio/vc9/zlibvc.vcproj

zlib-1.2.6/crc32.c

zlib-1.2.6/crc32.h

zlib-1.2.6/deflate.c

zlib-1.2.6/deflate.h

zlib-1.2.6/doc

zlib-1.2.6/doc/algorithm.txt

zlib-1.2.6/doc/rfc1950.txt

zlib-1.2.6/doc/rfc1951.txt

zlib-1.2.6/doc/rfc1952.txt

zlib-1.2.6/doc/txtvsbin.txt

zlib-1.2.6/examples

zlib-1.2.6/examples/README.examples

zlib-1.2.6/examples/enough.c

zlib-1.2.6/examples/fitblk.c

zlib-1.2.6/examples/gun.c

zlib-1.2.6/examples/gzappend.c

zlib-1.2.6/examples/gzjoin.c

zlib-1.2.6/examples/gzlog.c

zlib-1.2.6/examples/gzlog.h

zlib-1.2.6/examples/zlib_how.html

zlib-1.2.6/examples/zpipe.c

zlib-1.2.6/examples/zran.c

zlib-1.2.6/gzclose.c

zlib-1.2.6/gzguts.h

zlib-1.2.6/gzlib.c

zlib-1.2.6/gzread.c

zlib-1.2.6/gzwrite.c

zlib-1.2.6/infback.c

zlib-1.2.6/inffast.c

zlib-1.2.6/inffast.h

zlib-1.2.6/inffixed.h

zlib-1.2.6/inflate.c

zlib-1.2.6/inflate.h

zlib-1.2.6/inftrees.c

zlib-1.2.6/inftrees.h

zlib-1.2.6/make_vms.com

zlib-1.2.6/msdos

zlib-1.2.6/msdos/Makefile.bor

zlib-1.2.6/msdos/Makefile.dj2

zlib-1.2.6/msdos/Makefile.emx

zlib-1.2.6/msdos/Makefile.msc

zlib-1.2.6/msdos/Makefile.tc

zlib-1.2.6/nintendods

zlib-1.2.6/nintendods/Makefile

zlib-1.2.6/nintendods/README

zlib-1.2.6/old

zlib-1.2.6/old/Makefile.riscos

zlib-1.2.6/old/README

zlib-1.2.6/old/descrip.mms

zlib-1.2.6/old/os2

zlib-1.2.6/old/os2/Makefile.os2

zlib-1.2.6/old/os2/zlib.def

zlib-1.2.6/old/visual-basic.txt

zlib-1.2.6/qnx

zlib-1.2.6/qnx/package.qpg

zlib-1.2.6/test

zlib-1.2.6/test/example.c

zlib-1.2.6/test/infcover.c

zlib-1.2.6/test/minigzip.c

zlib-1.2.6/treebuild.xml

zlib-1.2.6/trees.c

zlib-1.2.6/trees.h

zlib-1.2.6/uncompr.c

zlib-1.2.6/watcom

zlib-1.2.6/watcom/watcom_f.mak

zlib-1.2.6/watcom/watcom_l.mak

zlib-1.2.6/win32

zlib-1.2.6/win32/DLL_FAQ.txt

zlib-1.2.6/win32/Makefile.bor

zlib-1.2.6/win32/Makefile.emx

zlib-1.2.6/win32/Makefile.gcc

zlib-1.2.6/win32/Makefile.msc

zlib-1.2.6/win32/README-WIN32.txt

zlib-1.2.6/win32/VisualC.txt

zlib-1.2.6/win32/zlib.def

zlib-1.2.6/win32/zlib1.rc

zlib-1.2.6/zconf.h

zlib-1.2.6/zconf.h.cmakein

zlib-1.2.6/zconf.h.in

zlib-1.2.6/zlib.3

zlib-1.2.6/zlib.3.pdf

zlib-1.2.6/zlib.h

zlib-1.2.6/zlib.map

zlib-1.2.6/zlib.pc.in

zlib-1.2.6/zlib2ansi

zlib-1.2.6/zutil.c

zlib-1.2.6/zutil.h

files removed:
.pc

.pc/.version

.pc/applied-patches

.pc/handle_local_urls.diff

.pc/handle_local_urls.diff/src

.pc/handle_local_urls.diff/src/smplayer.cpp

Audio_equalizer.txt

Configuring_the_toolbars.txt

debian/patches

debian/patches/handle_local_urls.diff

debian/patches/series

getrev

getrev/compile.bat

getrev/getrev.pro

getrev/main.cpp

setup/smplayer.win32.nsi

src/findsubtitles/simplehttp.cpp

src/findsubtitles/simplehttp.h

src/myclient.cpp

src/myclient.h

src/myserver.cpp

src/myserver.h

src/qtlockedfile

src/qtlockedfile/QtLockedFile

src/qtlockedfile/qtlockedfile.cpp

src/qtlockedfile/qtlockedfile.h

src/qtlockedfile/qtlockedfile.pri

src/qtlockedfile/qtlockedfile_unix.cpp

src/qtlockedfile/qtlockedfile_win.cpp

files modified:
Changelog

Not_so_obvious_things.txt

Readme.txt

Release_notes.txt

build_os2.cmd

clean_windows.bat

compile_windows.bat

compile_windows_portable.bat

create_deb.sh

debian-rvm/changelog-orig

debian-rvm/control

debian-rvm/docs

debian/changelog

debian/control

debian/copyright

docs/en/faq.html

docs/ja/faq.html

get_svn_revision.sh

os2/liesmich.os2

os2/lisezmoi.os2

os2/readme.os2

setup/scripts/install_smplayer.cmd

setup/scripts/make_pkgs.cmd

setup/translations/basque.nsh

setup/translations/catalan.nsh

setup/translations/croatian.nsh

setup/translations/czech.nsh

setup/translations/danish.nsh

setup/translations/dutch.nsh

setup/translations/english.nsh

setup/translations/finnish.nsh

setup/translations/french.nsh

setup/translations/german.nsh

setup/translations/hebrew.nsh

setup/translations/hungarian.nsh

setup/translations/italian.nsh

setup/translations/japanese.nsh

setup/translations/korean.nsh

setup/translations/norwegian.nsh

setup/translations/polish.nsh

setup/translations/portuguese.nsh

setup/translations/russian.nsh

setup/translations/simpchinese.nsh

setup/translations/slovak.nsh

setup/translations/slovenian.nsh

setup/translations/spanish.nsh

setup/translations/tradchinese.nsh

smplayer.desktop

smplayer.spec

smplayer_enqueue.desktop

src/about.cpp

src/basegui.cpp

src/basegui.h

src/baseguiplus.cpp

src/baseguiplus.h

src/clhelp.cpp

src/config.h

src/core.cpp

src/core.h

src/defaultgui.cpp

src/defaultgui.h

src/favoriteeditor.cpp

src/filechooser.cpp

src/filesettingshash.cpp

src/filters.cpp

src/findsubtitles/filedownloader/filedownloader.cpp

src/findsubtitles/findsubtitles.pro

src/findsubtitles/findsubtitlesconfigdialog.cpp

src/findsubtitles/findsubtitlesconfigdialog.h

src/findsubtitles/findsubtitlesconfigdialog.ui

src/findsubtitles/findsubtitleswindow.cpp

src/findsubtitles/findsubtitleswindow.h

src/findsubtitles/osparser.cpp

src/findsubtitles/osparser.h

src/floatingwidget.cpp

src/floatingwidget.h

src/helper.cpp

src/icons.qrc

src/languages.cpp

src/languages.h

src/main.cpp

src/mediadata.cpp

src/mediadata.h

src/mediasettings.cpp

src/mediasettings.h

src/minigui.cpp

src/minigui.h

src/mpcgui/mpcgui.cpp

src/mpcgui/mpcgui.h

src/mplayerprocess.cpp

src/mplayerprocess.h

src/paths.cpp

src/prefadvanced.cpp

src/prefadvanced.h

src/prefadvanced.ui

src/preferences.cpp

src/preferences.h

src/prefinterface.cpp

src/prefinterface.h

src/prefinterface.ui

src/prefperformance.cpp

src/prefperformance.h

src/selectcolorbutton.cpp

src/selectcolorbutton.h

src/shortcuts/default.keys

src/smplayer.cpp

src/smplayer.h

src/smplayer.pro

src/smplayer.rc

src/smplayer_os2.rc

src/toolbareditor.cpp

src/toolbareditor.h

src/translations/smplayer_ar_SY.ts

src/translations/smplayer_bg.ts

src/translations/smplayer_ca.ts

src/translations/smplayer_cs.ts

src/translations/smplayer_da.ts

src/translations/smplayer_de.ts

src/translations/smplayer_el_GR.ts

src/translations/smplayer_en_US.ts

src/translations/smplayer_es.ts

src/translations/smplayer_et.ts

src/translations/smplayer_eu.ts

src/translations/smplayer_fi.ts

src/translations/smplayer_fr.ts

src/translations/smplayer_gl.ts

src/translations/smplayer_hr.ts

src/translations/smplayer_hu.ts

src/translations/smplayer_it.ts

src/translations/smplayer_ja.ts

src/translations/smplayer_ka.ts

src/translations/smplayer_ko.ts

src/translations/smplayer_ku.ts

src/translations/smplayer_lt.ts

src/translations/smplayer_mk.ts

src/translations/smplayer_nl.ts

src/translations/smplayer_pl.ts

src/translations/smplayer_pt.ts

src/translations/smplayer_pt_BR.ts

src/translations/smplayer_ro_RO.ts

src/translations/smplayer_ru_RU.ts

src/translations/smplayer_sk.ts

src/translations/smplayer_sl_SI.ts

src/translations/smplayer_sr.ts

src/translations/smplayer_sv.ts

src/translations/smplayer_tr.ts

src/translations/smplayer_uk_UA.ts

src/translations/smplayer_vi_VN.ts

src/translations/smplayer_zh_CN.ts

src/translations/smplayer_zh_TW.ts

src/tvlist.cpp

src/version.cpp

src/youtube/retrieveyoutubeurl.cpp

Show diffs side-by-side

added added

removed removed

zlib-1.2.6/doc/txtvsbin.txt

1

A Fast Method for Identifying Plain Text Files

2

==============================================

3

4

5

Introduction

6

------------

7

8

Given a file coming from an unknown source, it is sometimes desirable

9

to find out whether the format of that file is plain text. Although

10

this may appear like a simple task, a fully accurate detection of the

11

file type requires heavy-duty semantic analysis on the file contents.

12

It is, however, possible to obtain satisfactory results by employing

13

various heuristics.

14

15

Previous versions of PKZip and other zip-compatible compression tools

16

were using a crude detection scheme: if more than 80% (4/5) of the bytes

17

found in a certain buffer are within the range [7..127], the file is

18

labeled as plain text, otherwise it is labeled as binary. A prominent

19

limitation of this scheme is the restriction to Latin-based alphabets.

20

Other alphabets, like Greek, Cyrillic or Asian, make extensive use of

21

the bytes within the range [128..255], and texts using these alphabets

22

are most often misidentified by this scheme; in other words, the rate

23

of false negatives is sometimes too high, which means that the recall

24

is low. Another weakness of this scheme is a reduced precision, due to

25

the false positives that may occur when binary files containing large

26

amounts of textual characters are misidentified as plain text.

27

28

In this article we propose a new, simple detection scheme that features

29

a much increased precision and a near-100% recall. This scheme is

30

designed to work on ASCII, Unicode and other ASCII-derived alphabets,

31

and it handles single-byte encodings (ISO-8859, MacRoman, KOI8, etc.)

32

and variable-sized encodings (ISO-2022, UTF-8, etc.). Wider encodings

33

(UCS-2/UTF-16 and UCS-4/UTF-32) are not handled, however.

34

35

36

The Algorithm

37

-------------

38

39

The algorithm works by dividing the set of bytecodes [0..255] into three

40

categories:

41

- The white list of textual bytecodes:

42

9 (TAB), 10 (LF), 13 (CR), 32 (SPACE) to 255.

43

- The gray list of tolerated bytecodes:

44

7 (BEL), 8 (BS), 11 (VT), 12 (FF), 26 (SUB), 27 (ESC).

45

- The black list of undesired, non-textual bytecodes:

46

0 (NUL) to 6, 14 to 31.

47

48

If a file contains at least one byte that belongs to the white list and

49

no byte that belongs to the black list, then the file is categorized as

50

plain text; otherwise, it is categorized as binary. (The boundary case,

51

when the file is empty, automatically falls into the latter category.)

52

53

54

Rationale

55

---------

56

57

The idea behind this algorithm relies on two observations.

58

59

The first observation is that, although the full range of 7-bit codes

60

[0..127] is properly specified by the ASCII standard, most control

61

characters in the range [0..31] are not used in practice. The only

62

widely-used, almost universally-portable control codes are 9 (TAB),

63

10 (LF) and 13 (CR). There are a few more control codes that are

64

recognized on a reduced range of platforms and text viewers/editors:

65

7 (BEL), 8 (BS), 11 (VT), 12 (FF), 26 (SUB) and 27 (ESC); but these

66

codes are rarely (if ever) used alone, without being accompanied by

67

some printable text. Even the newer, portable text formats such as

68

XML avoid using control characters outside the list mentioned here.

69

70

The second observation is that most of the binary files tend to contain

71

control characters, especially 0 (NUL). Even though the older text

72

detection schemes observe the presence of non-ASCII codes from the range

73

[128..255], the precision rarely has to suffer if this upper range is

74

labeled as textual, because the files that are genuinely binary tend to

75

contain both control characters and codes from the upper range. On the

76

other hand, the upper range needs to be labeled as textual, because it

77

is used by virtually all ASCII extensions. In particular, this range is

78

used for encoding non-Latin scripts.

79

80

Since there is no counting involved, other than simply observing the

81

presence or the absence of some byte values, the algorithm produces

82

consistent results, regardless what alphabet encoding is being used.

83

(If counting were involved, it could be possible to obtain different

84

results on a text encoded, say, using ISO-8859-16 versus UTF-8.)

85

86

There is an extra category of plain text files that are "polluted" with

87

one or more black-listed codes, either by mistake or by peculiar design

88

considerations. In such cases, a scheme that tolerates a small fraction

89

of black-listed codes would provide an increased recall (i.e. more true

90

positives). This, however, incurs a reduced precision overall, since

91

false positives are more likely to appear in binary files that contain

92

large chunks of textual data. Furthermore, "polluted" plain text should

93

be regarded as binary by general-purpose text detection schemes, because

94

general-purpose text processing algorithms might not be applicable.

95

Under this premise, it is safe to say that our detection method provides

96

a near-100% recall.

97

98

Experiments have been run on many files coming from various platforms

99

and applications. We tried plain text files, system logs, source code,

100

formatted office documents, compiled object code, etc. The results

101

confirm the optimistic assumptions about the capabilities of this

102

algorithm.

103

104

105

--

106

Cosmin Truta

107

Last updated: 2006-May-28

Older »