1
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
4
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
5
<title>Documentation [Universal Encoding Detector]</title>
6
<link rel="stylesheet" href="css/chardet.css" type="text/css">
7
<link rev="made" href="mailto:mark@diveintomark.org">
8
<meta name="generator" content="DocBook XSL Stylesheets V1.65.1">
9
<meta name="keywords" content="character, set, encoding, detection, Python, XML, feed">
10
<link rel="start" href="index.html" title="Documentation">
11
<link rel="next" href="faq.html" title="Frequently asked questions">
13
<body id="chardet-feedparser-org" class="docs">
14
<div class="z" id="intro"><div class="sectionInner"><div class="sectionInner2">
15
<div class="s" id="pageHeader">
16
<h1><a href="/">Universal Encoding Detector</a></h1>
17
<p>Character encoding auto-detection in Python. As smart as your browser. Open source.</p>
19
<div class="s" id="quickSummary"><ul>
21
<a href="http://chardet.feedparser.org/download/">Download</a> ·</li>
23
<a href="index.html">Documentation</a> ·</li>
24
<li class="li3"><a href="faq.html" title="Frequently Asked Questions">FAQ</a></li>
27
<div id="main"><div id="mainInner">
28
<p id="breadcrumb">You are here: <span class="thispage">Documentation</span></p>
29
<div class="article" lang="en">
30
<div class="titlepage">
36
<span class="section"><a href="faq.html">Frequently asked questions</a></span><ul>
37
<li><span class="section"><a href="faq.html#faq.intro">What is character encoding?</a></span></li>
38
<li><span class="section"><a href="faq.html#faq.what">What is character encoding auto-detection?</a></span></li>
39
<li><span class="section"><a href="faq.html#faq.impossible">Isn't that impossible?</a></span></li>
40
<li><span class="section"><a href="faq.html#faq.who">Who wrote this detection algorithm?</a></span></li>
41
<li><span class="section"><a href="faq.html#faq.yippie">Yippie! Screw the standards, I'll just auto-detect everything!</a></span></li>
42
<li><span class="section"><a href="faq.html#faq.why">Why bother with auto-detection if it's slow, inaccurate, and non-standard?</a></span></li>
45
<li><span class="section"><a href="supported-encodings.html">Supported encodings</a></span></li>
47
<span class="section"><a href="usage.html">Usage</a></span><ul>
48
<li><span class="section"><a href="usage.html#usage.basic">Basic usage</a></span></li>
49
<li><span class="section"><a href="usage.html#usage.advanced">Advanced usage</a></span></li>
53
<span class="section"><a href="how-it-works.html">How it works</a></span><ul>
54
<li><span class="section"><a href="how-it-works.html#how.bom">UTF-n with a BOM</a></span></li>
55
<li><span class="section"><a href="how-it-works.html#how.esc">Escaped encodings</a></span></li>
56
<li><span class="section"><a href="how-it-works.html#how.mb">Multi-byte encodings</a></span></li>
57
<li><span class="section"><a href="how-it-works.html#how.sb">Single-byte encodings</a></span></li>
58
<li><span class="section"><a href="how-it-works.html#how.windows1252">windows-1252</a></span></li>
61
<li><span class="section"><a href="history.html">Revision history</a></span></li>
62
<li><span class="appendix"><a href="license.html">Terms of use</a></span></li>
65
<div class="footernavigation">
66
<div style="float: left"></div>
67
<div style="text-align: right">
68
<a class="NavigationArrow" href="faq.html">Frequently asked questions</a> →</div>
71
<div id="footer"><p class="copyright">Copyright © 2006, 2007, 2008 Mark Pilgrim · <a href="mailto:mark@diveintomark.org">mark@diveintomark.org</a> · <a href="license.html">Terms of use</a></p></div>
1
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
4
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
5
<title>Documentation [Universal Encoding Detector]</title>
6
<link rel="stylesheet" href="css/chardet.css" type="text/css">
7
<link rev="made" href="mailto:mark@diveintomark.org">
8
<meta name="generator" content="DocBook XSL Stylesheets V1.65.1">
9
<meta name="keywords" content="character, set, encoding, detection, Python, XML, feed">
10
<link rel="start" href="index.html" title="Documentation">
11
<link rel="next" href="faq.html" title="Frequently asked questions">
13
<body id="chardet-feedparser-org" class="docs">
14
<div class="z" id="intro"><div class="sectionInner"><div class="sectionInner2">
15
<div class="s" id="pageHeader">
16
<h1><a href="/">Universal Encoding Detector</a></h1>
17
<p>Character encoding auto-detection in Python. As smart as your browser. Open source.</p>
19
<div class="s" id="quickSummary"><ul>
21
<a href="http://chardet.feedparser.org/download/">Download</a> ·</li>
23
<a href="index.html">Documentation</a> ·</li>
24
<li class="li3"><a href="faq.html" title="Frequently Asked Questions">FAQ</a></li>
27
<div id="main"><div id="mainInner">
28
<p id="breadcrumb">You are here: <span class="thispage">Documentation</span></p>
29
<div class="article" lang="en">
30
<div class="titlepage">
36
<span class="section"><a href="faq.html">Frequently asked questions</a></span><ul>
37
<li><span class="section"><a href="faq.html#faq.intro">What is character encoding?</a></span></li>
38
<li><span class="section"><a href="faq.html#faq.what">What is character encoding auto-detection?</a></span></li>
39
<li><span class="section"><a href="faq.html#faq.impossible">Isn’t that impossible?</a></span></li>
40
<li><span class="section"><a href="faq.html#faq.who">Who wrote this detection algorithm?</a></span></li>
41
<li><span class="section"><a href="faq.html#faq.yippie">Yippie! Screw the standards, I’ll just auto-detect everything!</a></span></li>
42
<li><span class="section"><a href="faq.html#faq.why">Why bother with auto-detection if it’s slow, inaccurate, and non-standard?</a></span></li>
45
<li><span class="section"><a href="supported-encodings.html">Supported encodings</a></span></li>
47
<span class="section"><a href="usage.html">Usage</a></span><ul>
48
<li><span class="section"><a href="usage.html#usage.basic">Basic usage</a></span></li>
49
<li><span class="section"><a href="usage.html#usage.advanced">Advanced usage</a></span></li>
53
<span class="section"><a href="how-it-works.html">How it works</a></span><ul>
54
<li><span class="section"><a href="how-it-works.html#how.bom">UTF-n with a BOM</a></span></li>
55
<li><span class="section"><a href="how-it-works.html#how.esc">Escaped encodings</a></span></li>
56
<li><span class="section"><a href="how-it-works.html#how.mb">Multi-byte encodings</a></span></li>
57
<li><span class="section"><a href="how-it-works.html#how.sb">Single-byte encodings</a></span></li>
58
<li><span class="section"><a href="how-it-works.html#how.windows1252">windows-1252</a></span></li>
61
<li><span class="section"><a href="history.html">Revision history</a></span></li>
62
<li><span class="appendix"><a href="license.html">Terms of use</a></span></li>
65
<div class="footernavigation">
66
<div style="float: left"></div>
67
<div style="text-align: right">
68
<a class="NavigationArrow" href="faq.html">Frequently asked questions</a> →</div>
71
<div id="footer"><p class="copyright">Copyright © 2006, 2007, 2008 Mark Pilgrim · <a href="mailto:mark@diveintomark.org">mark@diveintomark.org</a> · <a href="license.html">Terms of use</a></p></div>