← Back to branch summary

~ubuntu-branches/ubuntu/oneiric/lxml/oneiric

~ubuntu-branches/ubuntu/oneiric/lxml/oneiric

« back to all changes in this revision

Viewing changes to doc/elementsoup.txt

Committer: Bazaar Package Importer
Author(s): Matthias Klose
Date: 2009-08-27 09:09:23 UTC
mfrom: (1.3.2 upstream)
Revision ID: james.westby@ubuntu.com-20090827090923-fwhvka191ir73s3x

Tags: 2.2.2-1

http://bugs.debian.org/525961

http://bugs.debian.org/521714

* New upstream version. Closes: #525961.
- Includes html5parser. Closes: #521714.

files added:
IDEAS.txt

buildlibxml.py

doc/html/api/lxml.etree.SerialisationError-class.html

doc/html/api/lxml.etree._XSLTQuotedStringParam-class.html

doc/html/api/lxml.html.html5parser-module.html

doc/html/api/lxml.html.html5parser-pysrc.html

doc/html/api/lxml.html.html5parser.HTMLParser-class.html

doc/html/api/lxml.html.html5parser.XHTMLParser-class.html

doc/html/api/lxml.tests.test_etree.ETreeWriteTestCase-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase.ParseAndExtendWorker-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase.ParseWorker-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase.ReverseWorker-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase.RotateWorker-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase.SerialiseWorker-class.html

doc/html/api/lxml.tests.test_threading.ThreadPipelineTestCase.Worker-class.html

doc/html/api/lxml.tests.test_xmlschema.ETreeXMLSchemaResolversTestCase-class.html

doc/html/api/lxml.tests.test_xmlschema.ETreeXMLSchemaResolversTestCase.simple_resolver-class.html

doc/html/api/toc-lxml.html.html5parser-module.html

doc/html/changes-2.2.2.html

doc/html/html5parser.html

doc/html/proxies.png

doc/html5parser.txt

doc/pdf

doc/pdf/pubkey.asc

src/lxml/cleanup.pxi

src/lxml/html/_html5builder.py

src/lxml/html/html5parser.py

src/lxml/lxml-version.h

src/lxml/lxml.etree.pxi

src/lxml/tests/test_import.xsd

src/lxml/tests/test_inc.xsd

files removed:
doc/html/api/lxml.tests.test_xmlschema.ETreeXMLSchemaTestCase.simple_resolver-class.html

doc/html/changes-2.1.5.html

files modified:
CHANGES.txt

INSTALL.txt

MANIFEST.in

Makefile

PKG-INFO

TODO.txt

benchmark/bench_etree.py

benchmark/bench_xpath.py

debian/changelog

debian/control

debian/copyright

doc/FAQ.txt

doc/build.txt

doc/docstructure.py

doc/element_classes.txt

doc/elementsoup.txt

doc/html/FAQ.html

doc/html/api.html

doc/html/api/api-objects.txt

doc/html/api/class-tree.html

doc/html/api/deprecated-index.html

doc/html/api/elementtree.ElementTree-module.html

doc/html/api/elementtree.ElementTree-pysrc.html

doc/html/api/elementtree.ElementTree.Element-class.html

doc/html/api/elementtree.ElementTree.ElementTree-class.html

doc/html/api/elementtree.ElementTree.ParseError-class.html

doc/html/api/elementtree.ElementTree.QName-class.html

doc/html/api/elementtree.ElementTree.TreeBuilder-class.html

doc/html/api/elementtree.ElementTree.XMLParser-class.html

doc/html/api/elementtree.ElementTree._IterParseIterator-class.html

doc/html/api/elementtree.ElementTree._SimpleElementPath-class.html

doc/html/api/exceptions.AssertionError-class.html

doc/html/api/help.html

doc/html/api/identifier-index-A.html

doc/html/api/identifier-index-B.html

doc/html/api/identifier-index-C.html

doc/html/api/identifier-index-D.html

doc/html/api/identifier-index-E.html

doc/html/api/identifier-index-F.html

doc/html/api/identifier-index-G.html

doc/html/api/identifier-index-H.html

doc/html/api/identifier-index-I.html

doc/html/api/identifier-index-J.html

doc/html/api/identifier-index-K.html

doc/html/api/identifier-index-L.html

doc/html/api/identifier-index-M.html

doc/html/api/identifier-index-N.html

doc/html/api/identifier-index-O.html

doc/html/api/identifier-index-P.html

doc/html/api/identifier-index-Q.html

doc/html/api/identifier-index-R.html

doc/html/api/identifier-index-S.html

doc/html/api/identifier-index-T.html

doc/html/api/identifier-index-U.html

doc/html/api/identifier-index-V.html

doc/html/api/identifier-index-W.html

doc/html/api/identifier-index-X.html

doc/html/api/identifier-index-Y.html

doc/html/api/identifier-index-Z.html

doc/html/api/identifier-index-_.html

doc/html/api/identifier-index.html

doc/html/api/lxml-module.html

doc/html/api/lxml-pysrc.html

doc/html/api/lxml.ElementInclude-module.html

doc/html/api/lxml.ElementInclude-pysrc.html

doc/html/api/lxml.ElementInclude.FatalIncludeError-class.html

doc/html/api/lxml.builder-module.html

doc/html/api/lxml.builder-pysrc.html

doc/html/api/lxml.builder.ElementMaker-class.html

doc/html/api/lxml.cssselect-module.html

doc/html/api/lxml.cssselect-pysrc.html

doc/html/api/lxml.cssselect.Attrib-class.html

doc/html/api/lxml.cssselect.CSSSelector-class.html

doc/html/api/lxml.cssselect.Class-class.html

doc/html/api/lxml.cssselect.CombinedSelector-class.html

doc/html/api/lxml.cssselect.Element-class.html

doc/html/api/lxml.cssselect.ExpressionError-class.html

doc/html/api/lxml.cssselect.Function-class.html

doc/html/api/lxml.cssselect.Hash-class.html

doc/html/api/lxml.cssselect.Or-class.html

doc/html/api/lxml.cssselect.Pseudo-class.html

doc/html/api/lxml.cssselect.SelectorSyntaxError-class.html

doc/html/api/lxml.cssselect.String-class.html

doc/html/api/lxml.cssselect.Symbol-class.html

doc/html/api/lxml.cssselect.Token-class.html

doc/html/api/lxml.cssselect.TokenStream-class.html

doc/html/api/lxml.cssselect.XPathExpr-class.html

doc/html/api/lxml.cssselect.XPathExprOr-class.html

doc/html/api/lxml.cssselect._UniToken-class.html

doc/html/api/lxml.doctestcompare-module.html

doc/html/api/lxml.doctestcompare-pysrc.html

doc/html/api/lxml.doctestcompare.LHTMLOutputChecker-class.html

doc/html/api/lxml.doctestcompare.LXMLOutputChecker-class.html

doc/html/api/lxml.doctestcompare._RestoreChecker-class.html

doc/html/api/lxml.etree-module.html

doc/html/api/lxml.etree.AncestorsIterator-class.html

doc/html/api/lxml.etree.AttributeBasedElementClassLookup-class.html

doc/html/api/lxml.etree.C14NError-class.html

doc/html/api/lxml.etree.CDATA-class.html

doc/html/api/lxml.etree.CommentBase-class.html

doc/html/api/lxml.etree.CustomElementClassLookup-class.html

doc/html/api/lxml.etree.DTD-class.html

doc/html/api/lxml.etree.DTDError-class.html

doc/html/api/lxml.etree.DTDParseError-class.html

doc/html/api/lxml.etree.DTDValidateError-class.html

doc/html/api/lxml.etree.DocInfo-class.html

doc/html/api/lxml.etree.DocumentInvalid-class.html

doc/html/api/lxml.etree.ETCompatXMLParser-class.html

doc/html/api/lxml.etree.ETXPath-class.html

doc/html/api/lxml.etree.ElementBase-class.html

doc/html/api/lxml.etree.ElementChildIterator-class.html

doc/html/api/lxml.etree.ElementClassLookup-class.html

doc/html/api/lxml.etree.ElementDefaultClassLookup-class.html

doc/html/api/lxml.etree.ElementDepthFirstIterator-class.html

doc/html/api/lxml.etree.ElementNamespaceClassLookup-class.html

doc/html/api/lxml.etree.ElementTextIterator-class.html

doc/html/api/lxml.etree.EntityBase-class.html

doc/html/api/lxml.etree.Error-class.html

doc/html/api/lxml.etree.ErrorDomains-class.html

doc/html/api/lxml.etree.ErrorLevels-class.html

doc/html/api/lxml.etree.ErrorTypes-class.html

doc/html/api/lxml.etree.FallbackElementClassLookup-class.html

doc/html/api/lxml.etree.HTMLParser-class.html

doc/html/api/lxml.etree.LxmlError-class.html

doc/html/api/lxml.etree.LxmlRegistryError-class.html

doc/html/api/lxml.etree.LxmlSyntaxError-class.html

doc/html/api/lxml.etree.NamespaceRegistryError-class.html

doc/html/api/lxml.etree.PIBase-class.html

doc/html/api/lxml.etree.ParseError-class.html

doc/html/api/lxml.etree.ParserBasedElementClassLookup-class.html

doc/html/api/lxml.etree.ParserError-class.html

doc/html/api/lxml.etree.PyErrorLog-class.html

doc/html/api/lxml.etree.PythonElementClassLookup-class.html

doc/html/api/lxml.etree.QName-class.html

doc/html/api/lxml.etree.RelaxNG-class.html

doc/html/api/lxml.etree.RelaxNGError-class.html

doc/html/api/lxml.etree.RelaxNGErrorTypes-class.html

doc/html/api/lxml.etree.RelaxNGParseError-class.html

doc/html/api/lxml.etree.RelaxNGValidateError-class.html

doc/html/api/lxml.etree.Resolver-class.html

doc/html/api/lxml.etree.Schematron-class.html

doc/html/api/lxml.etree.SchematronError-class.html

doc/html/api/lxml.etree.SchematronParseError-class.html

doc/html/api/lxml.etree.SchematronValidateError-class.html

doc/html/api/lxml.etree.SiblingsIterator-class.html

doc/html/api/lxml.etree.TreeBuilder-class.html

doc/html/api/lxml.etree.XInclude-class.html

doc/html/api/lxml.etree.XIncludeError-class.html

doc/html/api/lxml.etree.XMLParser-class.html

doc/html/api/lxml.etree.XMLSchema-class.html

doc/html/api/lxml.etree.XMLSchemaError-class.html

doc/html/api/lxml.etree.XMLSchemaParseError-class.html

doc/html/api/lxml.etree.XMLSchemaValidateError-class.html

doc/html/api/lxml.etree.XMLSyntaxError-class.html

doc/html/api/lxml.etree.XPath-class.html

doc/html/api/lxml.etree.XPathDocumentEvaluator-class.html

doc/html/api/lxml.etree.XPathElementEvaluator-class.html

doc/html/api/lxml.etree.XPathError-class.html

doc/html/api/lxml.etree.XPathEvalError-class.html

doc/html/api/lxml.etree.XPathFunctionError-class.html

doc/html/api/lxml.etree.XPathResultError-class.html

doc/html/api/lxml.etree.XPathSyntaxError-class.html

doc/html/api/lxml.etree.XSLT-class.html

doc/html/api/lxml.etree.XSLTAccessControl-class.html

doc/html/api/lxml.etree.XSLTApplyError-class.html

doc/html/api/lxml.etree.XSLTError-class.html

doc/html/api/lxml.etree.XSLTExtension-class.html

doc/html/api/lxml.etree.XSLTExtensionError-class.html

doc/html/api/lxml.etree.XSLTParseError-class.html

doc/html/api/lxml.etree.XSLTSaveError-class.html

doc/html/api/lxml.etree._AppendOnlyElementProxy-class.html

doc/html/api/lxml.etree._Attrib-class.html

doc/html/api/lxml.etree._AttribIterator-class.html

doc/html/api/lxml.etree._BaseContext-class.html

doc/html/api/lxml.etree._BaseErrorLog-class.html

doc/html/api/lxml.etree._BaseParser-class.html

doc/html/api/lxml.etree._ClassNamespaceRegistry-class.html

doc/html/api/lxml.etree._Comment-class.html

doc/html/api/lxml.etree._Document-class.html

doc/html/api/lxml.etree._DomainErrorLog-class.html

doc/html/api/lxml.etree._Element-class.html

doc/html/api/lxml.etree._ElementIterator-class.html

doc/html/api/lxml.etree._ElementStringResult-class.html

doc/html/api/lxml.etree._ElementTagMatcher-class.html

doc/html/api/lxml.etree._ElementTree-class.html

doc/html/api/lxml.etree._ElementUnicodeResult-class.html

doc/html/api/lxml.etree._Entity-class.html

doc/html/api/lxml.etree._ErrorLog-class.html

doc/html/api/lxml.etree._ExceptionContext-class.html

doc/html/api/lxml.etree._ExsltRegExp-class.html

doc/html/api/lxml.etree._FeedParser-class.html

doc/html/api/lxml.etree._FileReaderContext-class.html

doc/html/api/lxml.etree._FilelikeWriter-class.html

doc/html/api/lxml.etree._FunctionNamespaceRegistry-class.html

doc/html/api/lxml.etree._IDDict-class.html

doc/html/api/lxml.etree._InputDocument-class.html

doc/html/api/lxml.etree._IterparseContext-class.html

doc/html/api/lxml.etree._ListErrorLog-class.html

doc/html/api/lxml.etree._LogEntry-class.html

doc/html/api/lxml.etree._NamespaceRegistry-class.html

doc/html/api/lxml.etree._ParserContext-class.html

doc/html/api/lxml.etree._ParserDictionaryContext-class.html

doc/html/api/lxml.etree._ParserSchemaValidationContext-class.html

doc/html/api/lxml.etree._ProcessingInstruction-class.html

doc/html/api/lxml.etree._PythonSaxParserTarget-class.html

doc/html/api/lxml.etree._ReadOnlyElementProxy-class.html

doc/html/api/lxml.etree._ResolverContext-class.html

doc/html/api/lxml.etree._ResolverRegistry-class.html

doc/html/api/lxml.etree._RotatingErrorLog-class.html

doc/html/api/lxml.etree._SaxParserContext-class.html

doc/html/api/lxml.etree._SaxParserTarget-class.html

doc/html/api/lxml.etree._TargetParserContext-class.html

doc/html/api/lxml.etree._TargetParserResult-class.html

doc/html/api/lxml.etree._TempStore-class.html

doc/html/api/lxml.etree._Validator-class.html

doc/html/api/lxml.etree._XPathContext-class.html

doc/html/api/lxml.etree._XPathEvaluatorBase-class.html

doc/html/api/lxml.etree._XPathFunctionNamespaceRegistry-class.html

doc/html/api/lxml.etree._XSLTContext-class.html

doc/html/api/lxml.etree._XSLTProcessingInstruction-class.html

doc/html/api/lxml.etree._XSLTResolverContext-class.html

doc/html/api/lxml.etree._XSLTResultTree-class.html

doc/html/api/lxml.etree.__ContentOnlyElement-class.html

doc/html/api/lxml.etree.iterparse-class.html

doc/html/api/lxml.etree.iterwalk-class.html

doc/html/api/lxml.html-module.html

doc/html/api/lxml.html-pysrc.html

doc/html/api/lxml.html.CheckboxGroup-class.html

doc/html/api/lxml.html.CheckboxValues-class.html

doc/html/api/lxml.html.ElementSoup-module.html

doc/html/api/lxml.html.ElementSoup-pysrc.html

doc/html/api/lxml.html.FieldsDict-class.html

doc/html/api/lxml.html.FormElement-class.html

doc/html/api/lxml.html.HTMLParser-class.html

doc/html/api/lxml.html.HtmlComment-class.html

doc/html/api/lxml.html.HtmlElement-class.html

doc/html/api/lxml.html.HtmlElementClassLookup-class.html

doc/html/api/lxml.html.HtmlEntity-class.html

doc/html/api/lxml.html.HtmlMixin-class.html

doc/html/api/lxml.html.HtmlProcessingInstruction-class.html

doc/html/api/lxml.html.InputElement-class.html

doc/html/api/lxml.html.InputGetter-class.html

doc/html/api/lxml.html.InputMixin-class.html

doc/html/api/lxml.html.LabelElement-class.html

doc/html/api/lxml.html.MultipleSelectOptions-class.html

doc/html/api/lxml.html.RadioGroup-class.html

doc/html/api/lxml.html.SelectElement-class.html

doc/html/api/lxml.html.TextareaElement-class.html

doc/html/api/lxml.html.XHTMLParser-class.html

doc/html/api/lxml.html._MethodFunc-class.html

doc/html/api/lxml.html.builder-module.html

doc/html/api/lxml.html.builder-pysrc.html

doc/html/api/lxml.html.clean-module.html

doc/html/api/lxml.html.clean-pysrc.html

doc/html/api/lxml.html.clean.Cleaner-class.html

doc/html/api/lxml.html.defs-module.html

doc/html/api/lxml.html.defs-pysrc.html

doc/html/api/lxml.html.diff-module.html

doc/html/api/lxml.html.diff-pysrc.html

doc/html/api/lxml.html.diff.DEL_END-class.html

doc/html/api/lxml.html.diff.DEL_START-class.html

doc/html/api/lxml.html.diff.InsensitiveSequenceMatcher-class.html

doc/html/api/lxml.html.diff.NoDeletes-class.html

doc/html/api/lxml.html.diff.href_token-class.html

doc/html/api/lxml.html.diff.tag_token-class.html

doc/html/api/lxml.html.diff.token-class.html

doc/html/api/lxml.html.formfill-module.html

doc/html/api/lxml.html.formfill-pysrc.html

doc/html/api/lxml.html.formfill.DefaultErrorCreator-class.html

doc/html/api/lxml.html.formfill.FormNotFound-class.html

doc/html/api/lxml.html.soupparser-module.html

doc/html/api/lxml.html.soupparser-pysrc.html

doc/html/api/lxml.html.usedoctest-module.html

doc/html/api/lxml.html.usedoctest-pysrc.html

doc/html/api/lxml.objectify-module.html

doc/html/api/lxml.objectify.BoolElement-class.html

doc/html/api/lxml.objectify.ElementMaker-class.html

doc/html/api/lxml.objectify.FloatElement-class.html

doc/html/api/lxml.objectify.IntElement-class.html

doc/html/api/lxml.objectify.LongElement-class.html

doc/html/api/lxml.objectify.NoneElement-class.html

doc/html/api/lxml.objectify.NumberElement-class.html

doc/html/api/lxml.objectify.ObjectPath-class.html

doc/html/api/lxml.objectify.ObjectifiedDataElement-class.html

doc/html/api/lxml.objectify.ObjectifiedElement-class.html

doc/html/api/lxml.objectify.ObjectifyElementClassLookup-class.html

doc/html/api/lxml.objectify.PyType-class.html

doc/html/api/lxml.objectify.StringElement-class.html

doc/html/api/lxml.objectify._ObjectifyElementMakerCaller-class.html

doc/html/api/lxml.pyclasslookup-module.html

doc/html/api/lxml.pyclasslookup-pysrc.html

doc/html/api/lxml.sax-module.html

doc/html/api/lxml.sax-pysrc.html

doc/html/api/lxml.sax.ElementTreeContentHandler-class.html

doc/html/api/lxml.sax.ElementTreeProducer-class.html

doc/html/api/lxml.sax.SaxError-class.html

doc/html/api/lxml.tests-module.html

doc/html/api/lxml.tests-pysrc.html

doc/html/api/lxml.tests.common_imports-module.html

doc/html/api/lxml.tests.common_imports-pysrc.html

doc/html/api/lxml.tests.common_imports.HelperTestCase-class.html

doc/html/api/lxml.tests.common_imports.LargeFileLike-class.html

doc/html/api/lxml.tests.common_imports.LargeFileLikeUnicode-class.html

doc/html/api/lxml.tests.common_imports.SillyFileLike-class.html

doc/html/api/lxml.tests.test_classlookup-module.html

doc/html/api/lxml.tests.test_classlookup-pysrc.html

doc/html/api/lxml.tests.test_classlookup.ClassLookupTestCase-class.html

doc/html/api/lxml.tests.test_css-module.html

doc/html/api/lxml.tests.test_css-pysrc.html

doc/html/api/lxml.tests.test_css.CSSTestCase-class.html

doc/html/api/lxml.tests.test_dtd-module.html

doc/html/api/lxml.tests.test_dtd-pysrc.html

doc/html/api/lxml.tests.test_dtd.ETreeDtdTestCase-class.html

doc/html/api/lxml.tests.test_elementtree-module.html

doc/html/api/lxml.tests.test_elementtree-pysrc.html

doc/html/api/lxml.tests.test_elementtree.CElementTreeTestCase-class.html

doc/html/api/lxml.tests.test_elementtree.ETreeTestCase-class.html

doc/html/api/lxml.tests.test_elementtree.ETreeTestCaseBase-class.html

doc/html/api/lxml.tests.test_elementtree.ElementTreeTestCase-class.html

doc/html/api/lxml.tests.test_errors-module.html

doc/html/api/lxml.tests.test_errors-pysrc.html

doc/html/api/lxml.tests.test_errors.ErrorTestCase-class.html

doc/html/api/lxml.tests.test_etree-module.html

doc/html/api/lxml.tests.test_etree-pysrc.html

doc/html/api/lxml.tests.test_etree.ETreeC14NTestCase-class.html

doc/html/api/lxml.tests.test_etree.ETreeOnlyTestCase-class.html

doc/html/api/lxml.tests.test_etree.ETreeXIncludeTestCase-class.html

doc/html/api/lxml.tests.test_etree.ElementIncludeTestCase-class.html

doc/html/api/lxml.tests.test_etree.XIncludeTestCase-class.html

doc/html/api/lxml.tests.test_htmlparser-module.html

doc/html/api/lxml.tests.test_htmlparser-pysrc.html

doc/html/api/lxml.tests.test_htmlparser.HtmlParserTestCase-class.html

doc/html/api/lxml.tests.test_io-module.html

doc/html/api/lxml.tests.test_io-pysrc.html

doc/html/api/lxml.tests.test_io.ETreeIOTestCase-class.html

doc/html/api/lxml.tests.test_io.ElementTreeIOTestCase-class.html

doc/html/api/lxml.tests.test_io.IOTestCaseBase-class.html

doc/html/api/lxml.tests.test_nsclasses-module.html

doc/html/api/lxml.tests.test_nsclasses-pysrc.html

doc/html/api/lxml.tests.test_nsclasses.ETreeNamespaceClassesTestCase-class.html

doc/html/api/lxml.tests.test_nsclasses.ETreeNamespaceClassesTestCase.bluff_class-class.html

doc/html/api/lxml.tests.test_nsclasses.ETreeNamespaceClassesTestCase.default_class-class.html

doc/html/api/lxml.tests.test_nsclasses.ETreeNamespaceClassesTestCase.maeh_class-class.html

doc/html/api/lxml.tests.test_objectify-module.html

doc/html/api/lxml.tests.test_objectify-pysrc.html

doc/html/api/lxml.tests.test_objectify.ObjectifyTestCase-class.html

doc/html/api/lxml.tests.test_pyclasslookup-module.html

doc/html/api/lxml.tests.test_pyclasslookup-pysrc.html

doc/html/api/lxml.tests.test_pyclasslookup.PyClassLookupTestCase-class.html

doc/html/api/lxml.tests.test_relaxng-module.html

doc/html/api/lxml.tests.test_relaxng-pysrc.html

doc/html/api/lxml.tests.test_relaxng.ETreeRelaxNGTestCase-class.html

doc/html/api/lxml.tests.test_sax-module.html

doc/html/api/lxml.tests.test_sax-pysrc.html

doc/html/api/lxml.tests.test_sax.ETreeSaxTestCase-class.html

doc/html/api/lxml.tests.test_schematron-module.html

doc/html/api/lxml.tests.test_schematron-pysrc.html

doc/html/api/lxml.tests.test_schematron.ETreeSchematronTestCase-class.html

doc/html/api/lxml.tests.test_threading-module.html

doc/html/api/lxml.tests.test_threading-pysrc.html

doc/html/api/lxml.tests.test_threading.ThreadingTestCase-class.html

doc/html/api/lxml.tests.test_unicode-module.html

doc/html/api/lxml.tests.test_unicode-pysrc.html

doc/html/api/lxml.tests.test_unicode.UnicodeTestCase-class.html

doc/html/api/lxml.tests.test_xmlschema-module.html

doc/html/api/lxml.tests.test_xmlschema-pysrc.html

doc/html/api/lxml.tests.test_xmlschema.ETreeXMLSchemaTestCase-class.html

doc/html/api/lxml.tests.test_xpathevaluator-module.html

doc/html/api/lxml.tests.test_xpathevaluator-pysrc.html

doc/html/api/lxml.tests.test_xpathevaluator.ETreeETXPathClassTestCase-class.html

doc/html/api/lxml.tests.test_xpathevaluator.ETreeXPathClassTestCase-class.html

doc/html/api/lxml.tests.test_xpathevaluator.ETreeXPathTestCase-class.html

doc/html/api/lxml.tests.test_xslt-module.html

doc/html/api/lxml.tests.test_xslt-pysrc.html

doc/html/api/lxml.tests.test_xslt.ETreeXSLTTestCase-class.html

doc/html/api/lxml.tests.test_xslt.Py3XSLTTestCase-class.html

doc/html/api/lxml.usedoctest-module.html

doc/html/api/lxml.usedoctest-pysrc.html

doc/html/api/module-tree.html

doc/html/api/redirect.html

doc/html/api/str-class.html

doc/html/api/toc-everything.html

doc/html/api/toc-lxml.etree-module.html

doc/html/api/toc-lxml.html-module.html

doc/html/api/toc-lxml.tests.test_etree-module.html

doc/html/api/toc-lxml.tests.test_threading-module.html

doc/html/api/toc-lxml.tests.test_xmlschema-module.html

doc/html/api/toc.html

doc/html/build.html

doc/html/capi.html

doc/html/compatibility.html

doc/html/credits.html

doc/html/cssselect.html

doc/html/element_classes.html

doc/html/elementsoup.html

doc/html/extensions.html

doc/html/index.html

doc/html/installation.html

doc/html/intro.html

doc/html/lxml-source-howto.html

doc/html/lxml2.html

doc/html/lxmlhtml.html

doc/html/objectify.html

doc/html/parsing.html

doc/html/performance.html

doc/html/resolvers.html

doc/html/sax.html

doc/html/style.css

doc/html/tutorial.html

doc/html/validation.html

doc/html/xpathxslt.html

doc/main.txt

doc/mkhtml.py

doc/mklatex.py

doc/objectify.txt

doc/parsing.txt

doc/performance.txt

doc/s5/lxml-ep2008.html

doc/tutorial.txt

doc/xpathxslt.txt

ez_setup.py

setup.py

setupinfo.py

src/lxml.egg-info/PKG-INFO

src/lxml.egg-info/SOURCES.txt

src/lxml/_elementpath.py

src/lxml/apihelpers.pxi

src/lxml/classlookup.pxi

src/lxml/cstd.pxd

src/lxml/docloader.pxi

src/lxml/etree_defs.h

src/lxml/extensions.pxi

src/lxml/html/__init__.py

src/lxml/html/clean.py

src/lxml/html/diff.py

src/lxml/html/soupparser.py

src/lxml/html/tests/hackers-org-data/style-url-js.data

src/lxml/html/tests/test_autolink.txt

src/lxml/html/tests/test_clean_embed.txt

src/lxml/html/tests/test_diff.txt

src/lxml/html/tests/test_elementsoup.py

src/lxml/html/tests/test_rewritelinks.txt

src/lxml/iterparse.pxi

src/lxml/lxml.etree.c

src/lxml/lxml.etree.h

src/lxml/lxml.etree.pyx

src/lxml/lxml.etree_api.h

src/lxml/lxml.objectify.c

src/lxml/lxml.objectify.pyx

src/lxml/nsclasses.pxi

src/lxml/objectpath.pxi

src/lxml/parser.pxi

src/lxml/parsertarget.pxi

src/lxml/proxy.pxi

src/lxml/python.pxd

src/lxml/readonlytree.pxi

src/lxml/saxparser.pxi

src/lxml/serializer.pxi

src/lxml/tests/test_elementtree.py

src/lxml/tests/test_etree.py

src/lxml/tests/test_io.py

src/lxml/tests/test_nsclasses.py

src/lxml/tests/test_objectify.py

src/lxml/tests/test_threading.py

src/lxml/tests/test_xmlschema.py

src/lxml/tests/test_xpathevaluator.py

src/lxml/tests/test_xslt.py

src/lxml/tree.pxd

src/lxml/xmlerror.pxd

src/lxml/xmlerror.pxi

src/lxml/xmlid.pxi

src/lxml/xmlparser.pxd

src/lxml/xmlschema.pxd

src/lxml/xmlschema.pxi

src/lxml/xpath.pxi

src/lxml/xslt.pxd

src/lxml/xslt.pxi

src/lxml/xsltext.pxi

version.txt

versioninfo.py

Show diffs side-by-side

added added

removed removed

doc/elementsoup.txt

2

2

BeautifulSoup Parser

3

3

====================

4

4

5

BeautifulSoup_ is a Python package that parses broken HTML. While libxml2

6

(and thus lxml) can also parse broken HTML, BeautifulSoup is a bit more

7

forgiving and has superiour `support for encoding detection`_.

5

BeautifulSoup_ is a Python package that parses broken HTML, just like

6

lxml supports it based on the parser of libxml2. BeautifulSoup uses a

7

different parsing approach. It is not a real HTML parser but uses

8

regular expressions to dive through tag soup. It is therefore more

9

forgiving in some cases and less good in others. It is not uncommon

10

that lxml/libxml2 parses and fixes broken HTML better, but

11

BeautifulSoup has superiour `support for encoding detection`_. It

12

very much depends on the input which parser works better.

8

13

9

14

.. _BeautifulSoup: http://www.crummy.com/software/BeautifulSoup/

10

15

.. _`support for encoding detection`: http://www.crummy.com/software/BeautifulSoup/documentation.html#Beautiful%20Soup%20Gives%20You%20Unicode%2C%20Dammit

11

16

.. _ElementSoup: http://effbot.org/zone/element-soup.htm

12

17

13

lxml can benefit from the parsing capabilities of BeautifulSoup

14

through the ``lxml.html.soupparser`` module. It provides three main

15

functions: ``fromstring()`` and ``parse()`` to parse a string or file

16

using BeautifulSoup, and ``convert_tree()`` to convert an existing

17

BeautifulSoup tree into a list of top-level Elements.

18

To prevent users from having to choose their parser library in

19

advance, lxml can interface to the parsing capabilities of

20

BeautifulSoup through the ``lxml.html.soupparser`` module. It

21

provides three main functions: ``fromstring()`` and ``parse()`` to

22

parse a string or file using BeautifulSoup into an ``lxml.html``

23

document, and ``convert_tree()`` to convert an existing BeautifulSoup

24

tree into a list of top-level Elements.

18

25

19

26

20

27

Parsing with the soupparser

Older »