Class HTMLPurifier_Lexer_PH5P
Inheritance | HTMLPurifier_Lexer_PH5P » HTMLPurifier_Lexer_DOMLex » HTMLPurifier_Lexer |
---|
Experimental HTML5-based parser using Jeroen van der Meer's PH5P library.
Occupies space in the HTML5 pseudo-namespace, which may cause conflicts.
Public Properties
Property | Type | Description | Defined By |
---|---|---|---|
$tracksLineNumbers | Whether or not this lexer implements line-number/column-number tracking. | HTMLPurifier_Lexer |
Protected Properties
Property | Type | Description | Defined By |
---|---|---|---|
$_special_entity2str | Most common entity to raw value conversion table for special entities. | HTMLPurifier_Lexer |
Public Methods
Method | Description | Defined By |
---|---|---|
__construct() | HTMLPurifier_Lexer_DOMLex | |
callbackArmorCommentEntities() | Callback function that entity-izes ampersands in comments so that callbackUndoCommentSubst doesn't clobber them | HTMLPurifier_Lexer_DOMLex |
callbackUndoCommentSubst() | Callback function for undoing escaping of stray angled brackets in comments | HTMLPurifier_Lexer_DOMLex |
create() | Retrieves or sets the default Lexer as a Prototype Factory. | HTMLPurifier_Lexer |
extractBody() | Takes a string of HTML (fragment or document) and returns the content | HTMLPurifier_Lexer |
muteErrorHandler() | An error handler that mutes all errors | HTMLPurifier_Lexer_DOMLex |
normalize() | Takes a piece of HTML and normalizes it by converting entities, fixing encoding, extracting bits, and other good stuff. | HTMLPurifier_Lexer |
parseData() | Parses special entities into the proper characters. | HTMLPurifier_Lexer |
tokenizeHTML() | HTMLPurifier_Lexer_PH5P |
Protected Methods
Method | Description | Defined By |
---|---|---|
CDATACallback() | Callback function for escapeCDATA() that does the work. | HTMLPurifier_Lexer |
createEndNode() | HTMLPurifier_Lexer_DOMLex | |
createStartNode() | HTMLPurifier_Lexer_DOMLex | |
escapeCDATA() | Translates CDATA sections into regular sections (through escaping). | HTMLPurifier_Lexer |
escapeCommentedCDATA() | Special CDATA case that is especially convoluted for | HTMLPurifier_Lexer |
removeIEConditional() | Special Internet Explorer conditional comments should be removed. | HTMLPurifier_Lexer |
tokenizeDOM() | Iterative function that tokenizes a node, putting it into an accumulator. | HTMLPurifier_Lexer_DOMLex |
transformAttrToAssoc() | Converts a DOMNamedNodeMap of DOMAttr objects into an assoc array. | HTMLPurifier_Lexer_DOMLex |
wrapHTML() | Wraps an HTML fragment in the necessary HTML | HTMLPurifier_Lexer_DOMLex |
Method Details
public HTMLPurifier_Token[] tokenizeHTML ( $html, $config, $context ) | ||
$html | string | |
$config | HTMLPurifier_Config | |
$context | HTMLPurifier_Context |