tagsoup-0.7: Parsing and extracting information from (possibly malformed) HTML documentsContentsIndex
tagsoup-0.7: Parsing and extracting information from (possibly malformed) HTML documents
TagSoup is a library for extracting information out of unstructured HTML code, sometimes known as tag-soup. The HTML does not have to be well formed, or render properly within any particular framework. This library is for situations where the author of the HTML is not cooperating with the person trying to extract the information, but is also not trying to hide the information.
Modules
show/hideText
show/hideHTML
Text.HTML.Download
show/hideText.HTML.TagSoup
Text.HTML.TagSoup.Entity
Text.HTML.TagSoup.Match
Text.HTML.TagSoup.Parser
Text.HTML.TagSoup.Render
Text.HTML.TagSoup.Tree
Text.HTML.TagSoup.Type
Produced by Haddock version 0.8