Class HTMLPurifier_Lexer_PH5P
| Inheritance | HTMLPurifier_Lexer_PH5P » HTMLPurifier_Lexer_DOMLex » HTMLPurifier_Lexer |
|---|
Experimental HTML5-based parser using Jeroen van der Meer's PH5P library.
Occupies space in the HTML5 pseudo-namespace, which may cause conflicts.
Public Properties
| Property | Type | Description | Defined By |
|---|---|---|---|
| $tracksLineNumbers | Whether or not this lexer implements line-number/column-number tracking. | HTMLPurifier_Lexer |
Protected Properties
| Property | Type | Description | Defined By |
|---|---|---|---|
| $_special_entity2str | Most common entity to raw value conversion table for special entities. | HTMLPurifier_Lexer |
Public Methods
| Method | Description | Defined By |
|---|---|---|
| __construct() | HTMLPurifier_Lexer_DOMLex | |
| callbackArmorCommentEntities() | Callback function that entity-izes ampersands in comments so that callbackUndoCommentSubst doesn't clobber them | HTMLPurifier_Lexer_DOMLex |
| callbackUndoCommentSubst() | Callback function for undoing escaping of stray angled brackets in comments | HTMLPurifier_Lexer_DOMLex |
| create() | Retrieves or sets the default Lexer as a Prototype Factory. | HTMLPurifier_Lexer |
| extractBody() | Takes a string of HTML (fragment or document) and returns the content | HTMLPurifier_Lexer |
| muteErrorHandler() | An error handler that mutes all errors | HTMLPurifier_Lexer_DOMLex |
| normalize() | Takes a piece of HTML and normalizes it by converting entities, fixing encoding, extracting bits, and other good stuff. | HTMLPurifier_Lexer |
| parseData() | Parses special entities into the proper characters. | HTMLPurifier_Lexer |
| tokenizeHTML() | HTMLPurifier_Lexer_PH5P |
Protected Methods
| Method | Description | Defined By |
|---|---|---|
| CDATACallback() | Callback function for escapeCDATA() that does the work. | HTMLPurifier_Lexer |
| createEndNode() | HTMLPurifier_Lexer_DOMLex | |
| createStartNode() | HTMLPurifier_Lexer_DOMLex | |
| escapeCDATA() | Translates CDATA sections into regular sections (through escaping). | HTMLPurifier_Lexer |
| escapeCommentedCDATA() | Special CDATA case that is especially convoluted for | HTMLPurifier_Lexer |
| removeIEConditional() | Special Internet Explorer conditional comments should be removed. | HTMLPurifier_Lexer |
| tokenizeDOM() | Iterative function that tokenizes a node, putting it into an accumulator. | HTMLPurifier_Lexer_DOMLex |
| transformAttrToAssoc() | Converts a DOMNamedNodeMap of DOMAttr objects into an assoc array. | HTMLPurifier_Lexer_DOMLex |
| wrapHTML() | Wraps an HTML fragment in the necessary HTML | HTMLPurifier_Lexer_DOMLex |
Method Details
| public HTMLPurifier_Token[] tokenizeHTML ( $html, $config, $context ) | ||
| $html | string | |
| $config | HTMLPurifier_Config | |
| $context | HTMLPurifier_Context | |