HTML

DOCTYPE 标记

解析错误。忽略该标记。

名为 "html" 的开始标签

使用 "in body" 插入模式处理该标记。

名为 "head" 的开始标签

为该标记插入一个 HTML 元素。

将 head 元素指针设为刚创建的 head 元素。

把插入模式切换到 "in head"。

名为 "head", "body", "html", "br" 的结束标签

执行下面“其他情况”描述的步骤。

任何其他结束标签

解析错误。忽略该标记。

其他情况

为 "head" 开始标签标记插入一个 HTML 元素，不设任何属性。

把 head 元素指针设为刚创建的 head 元素。

把插入模式设为 "in head"。

重新处理当前标记。

13.2.6.4.4 The "in head" insertion mode

When the user agent is to apply the rules for the "in head" insertion mode, the user agent must handle the token as follows:

A character token that is one of U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE

Insert the character.

A comment token

Insert a comment.

A DOCTYPE token

Parse error. Ignore the token.

A start tag whose tag name is "html"

Process the token using the rules for the "in body" insertion mode.

A start tag whose tag name is one of: "base", "basefont", "bgsound", "link"

Insert an HTML element for the token. Immediately pop the current node off the stack of open elements.

Acknowledge the token's self-closing flag, if it is set.

A start tag whose tag name is "meta"

Insert an HTML element for the token. Immediately pop the current node off the stack of open elements.

Acknowledge the token's self-closing flag, if it is set.

If the element has a charset attribute, and getting an encoding from its value results in an encoding, and the confidence is currently tentative, then change the encoding to the resulting encoding.

Otherwise, if the element has an http-equiv attribute whose value is an ASCII case-insensitive match for the string "Content-Type", and the element has a content attribute, and applying the algorithm for extracting a character encoding from a meta element to that attribute's value returns an encoding, and the confidence is currently tentative, then change the encoding to the extracted encoding.

A start tag whose tag name is "title"

Follow the generic RCDATA element parsing algorithm.

A start tag whose tag name is "noscript", if the scripting flag is enabled

A start tag whose tag name is one of: "noframes", "style"

Follow the generic raw text element parsing algorithm.

A start tag whose tag name is "noscript", if the scripting flag is disabled

Insert an HTML element for the token.

Switch the insertion mode to "in head noscript".

A start tag whose tag name is "script"

Run these steps:

Let the adjusted insertion location be the appropriate place for inserting a node.
Create an element for the token in the HTML namespace, with the intended parent being the element in which the adjusted insertion location finds itself.
Set the element's parser document to the Document, and unset the element's "non-blocking" flag.

This ensures that, if the script is external, any document.write() calls in the script will execute in-line, instead of blowing the document away, as would happen in most other cases. It also prevents the script from executing until the end tag is seen.
If the parser was created as part of the HTML fragment parsing algorithm, then mark the script element as "already started". (fragment case)
If the parser was invoked via the document.write() or document.writeln() methods, then optionally mark the script element as "already started". (For example, the user agent might use this clause to prevent execution of cross-origin scripts inserted via document.write() under slow network conditions, or when the page has already taken a long time to load.)
Insert the newly created element at the adjusted insertion location.
Push the element onto the stack of open elements so that it is the new current node.
Switch the tokenizer to the script data state.
Let the original insertion mode be the current insertion mode.
Switch the insertion mode to "text".

An end tag whose tag name is "head"

Pop the current node (which will be the head element) off the stack of open elements.

Switch the insertion mode to "after head".

An end tag whose tag name is one of: "body", "html", "br"

Act as described in the "anything else" entry below.

A start tag whose tag name is "template"

Insert an HTML element for the token.

Insert a marker at the end of the list of active formatting elements.

Set the frameset-ok flag to "not ok".

Switch the insertion mode to "in template".

Push "in template" onto the stack of template insertion modes so that it is the new current template insertion mode.

An end tag whose tag name is "template"

If there is no template element on the stack of open elements, then this is a parse error; ignore the token.

Otherwise, run these steps:

Generate all implied end tags thoroughly.
If the current node is not a template element, then this is a parse error.
Pop elements from the stack of open elements until a template element has been popped from the stack.
Clear the list of active formatting elements up to the last marker.
Pop the current template insertion mode off the stack of template insertion modes.
Reset the insertion mode appropriately.

A start tag whose tag name is "head"

Any other end tag

Parse error. Ignore the token.

Anything else

Pop the current node (which will be the head element) off the stack of open elements.

Switch the insertion mode to "after head".

Reprocess the token.

13.2.6.4.5 "in head noscript" 插入模式

当用户代理应用 "in head noscript" 插入模式的规则时，用户代理必须按以下规则处理标记：

DOCTYPE 标记

解析错误。忽略该标记。

名为 "html" 的开始标签

使用 "in body" 插入模式处理该标记。

名为 "noscript" 的结束标签

Pop the 当前节点 (which will be a noscript element) from the 打开元素栈; the new 当前节点 will be a head element.

把当前节点（是一个 noscript 元素）从打开元素栈弹出，新的当前节点会是一个 head 元素。

把插入模式切换到 "in head"。

字符标记 U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), 或 U+0020 SPACE

注释标记

名为 "basefont", "bgsound", "link", "meta", "noframes", "style" 的开始标签

使用 "in head" 插入模式处理该标记。

名为 "br" 的结束标签

执行下面“任何其他情况”所描述的步骤。

名为 "head", "noscript" 的开始标签

任何其他结束标签

解析错误。忽略该标记。

任何其他情况

解析错误.

把当前节点（是一个 noscript 元素）从打开元素栈弹出，新的当前节点会是一个 head 元素。

把插入模式切换到 "in head"。

重新处理该标记。

13.2.6.4.6 "after head" 插入模式

当用户代理应用 "after head" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记 U+0009 CHARACTER TABULATION，U+000A LINE FEED (LF)，U+000C FORM FEED (FF)， U+000D CARRIAGE RETURN (CR) 或 U+0020 SPACE

插入一个字符。

注释标记

DOCTYPE 标记

解析错误。忽略该标记。

名为 "html" 的开始标签

使用 "in body" 插入模式的规则处理该标记。

名为 "body" 的开始标签

为该标记插入一个 HTML 元素。

设置 frameset-ok 标志为 "not ok"。

将插入模式切换到 "in body"。

名为 "frameset" 的开始标签

插入一个 HTML 元素为该标记。

将插入模式切换到 "in frameset"。

名为 "base"，"basefont"，"bgsound"，"link"，"meta"， "noframes"，"script"，"style"，"template"，"title" 的开始标签

把 head 元素指针指向的那个元素压入打开元素栈。

使用 "in head" 插入模式的规则处理该标记。

把 head 元素指针指向的元素从打开元素栈移除。（这时它可能不是当前节点）

head 元素指针这时不会是 null。

名为 "template" 的结束标签

使用 "in head" 插入模式的规则处理该标记。

名为 "body"，"html"，"br" 的结束标签

执行下面 "任何其他情况" 的步骤。

名为 "head" 的开始标签

任何其他结束标签

解析错误。忽略该标记。

任何其他情况

为 "body" 开始标签标记插入一个 HTML 元素，不设任何属性。

将插入模式切换到 "in body"。

重新处理当前标记。

13.2.6.4.7 The "in body" insertion mode

When the user agent is to apply the rules for the "in body" insertion mode, the user agent must handle the token as follows:

A character token that is U+0000 NULL

Parse error. Ignore the token.

A character token that is one of U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE

Reconstruct the active formatting elements, if any.

Any other character token

Reconstruct the active formatting elements, if any.

Set the frameset-ok flag to "not ok".

A comment token

Insert a comment.

A DOCTYPE token

Parse error. Ignore the token.

A start tag whose tag name is "html"

If there is a template element on the stack of open elements, then ignore the token.

Otherwise, for each attribute on the token, check to see if the attribute is already present on the top element of the stack of open elements. If it is not, add the attribute and its corresponding value to that element.

A start tag whose tag name is one of: "base", "basefont", "bgsound", "link", "meta", "noframes", "script", "style", "template", "title"

An end tag whose tag name is "template"

Process the token using the rules for the "in head" insertion mode.

A start tag whose tag name is "body"

If the second element on the stack of open elements is not a body element, if the stack of open elements has only one node on it, or if there is a template element on the stack of open elements, then ignore the token. (fragment case)

Otherwise, set the frameset-ok flag to "not ok"; then, for each attribute on the token, check to see if the attribute is already present on the body element (the second element) on the stack of open elements, and if it is not, add the attribute and its corresponding value to that element.

A start tag whose tag name is "frameset"

If the stack of open elements has only one node on it, or if the second element on the stack of open elements is not a body element, then ignore the token. (fragment case)

If the frameset-ok flag is set to "not ok", ignore the token.

Otherwise, run the following steps:

Remove the second element on the stack of open elements from its parent node, if it has one.
Pop all the nodes from the bottom of the stack of open elements, from the current node up to, but not including, the root html element.
Insert an HTML element for the token.
Switch the insertion mode to "in frameset".

An end-of-file token

If the stack of template insertion modes is not empty, then process the token using the rules for the "in template" insertion mode.

Otherwise, follow these steps:

If there is a node in the stack of open elements that is not either a dd element, a dt element, an li element, an optgroup element, an option element, a p element, an rb element, an rp element, an rt element, an rtc element, a tbody element, a td element, a tfoot element, a th element, a thead element, a tr element, the body element, or the html element, then this is a parse error.
Stop parsing.

An end tag whose tag name is "body"

If the stack of open elements does not have a body element in scope, this is a parse error; ignore the token.

Otherwise, if there is a node in the stack of open elements that is not either a dd element, a dt element, an li element, an optgroup element, an option element, a p element, an rb element, an rp element, an rt element, an rtc element, a tbody element, a td element, a tfoot element, a th element, a thead element, a tr element, the body element, or the html element, then this is a parse error.

Switch the insertion mode to "after body".

An end tag whose tag name is "html"

If the stack of open elements does not have a body element in scope, this is a parse error; ignore the token.

Switch the insertion mode to "after body".

Reprocess the token.

A start tag whose tag name is one of: "address", "article", "aside", "blockquote", "center", "details", "dialog", "dir", "div", "dl", "fieldset", "figcaption", "figure", "footer", "header", "hgroup", "main", "menu", "nav", "ol", "p", "section", "summary", "ul"

If the stack of open elements has a p element in button scope, then close a p element.

Insert an HTML element for the token.

A start tag whose tag name is one of: "h1", "h2", "h3", "h4", "h5", "h6"

If the stack of open elements has a p element in button scope, then close a p element.

If the current node is an HTML element whose tag name is one of "h1", "h2", "h3", "h4", "h5", or "h6", then this is a parse error; pop the current node off the stack of open elements.

Insert an HTML element for the token.

A start tag whose tag name is one of: "pre", "listing"

If the stack of open elements has a p element in button scope, then close a p element.

Insert an HTML element for the token.

If the next token is a U+000A LINE FEED (LF) character token, then ignore that token and move on to the next one. (Newlines at the start of pre blocks are ignored as an authoring convenience.)

Set the frameset-ok flag to "not ok".

A start tag whose tag name is "form"

If the form element pointer is not null, and there is no template element on the stack of open elements, then this is a parse error; ignore the token.

Otherwise:

If the stack of open elements has a p element in button scope, then close a p element.

Insert an HTML element for the token, and, if there is no template element on the stack of open elements, set the form element pointer to point to the element created.

A start tag whose tag name is "li"

Run these steps:

Set the frameset-ok flag to "not ok".
Initialize node to be the current node (the bottommost node of the stack).
Loop: If node is an li element, then run these substeps:
1. Generate implied end tags, except for li elements.
2. If the current node is not an li element, then this is a parse error.
3. Pop elements from the stack of open elements until an li element has been popped from the stack.
4. Jump to the step labeled done below.
If node is in the special category, but is not an address, div, or p element, then jump to the step labeled done below.
Otherwise, set node to the previous entry in the stack of open elements and return to the step labeled loop.
Done: If the stack of open elements has a p element in button scope, then close a p element.
Finally, insert an HTML element for the token.

A start tag whose tag name is one of: "dd", "dt"

Run these steps:

Set the frameset-ok flag to "not ok".
Initialize node to be the current node (the bottommost node of the stack).
Loop: If node is a dd element, then run these substeps:
1. Generate implied end tags, except for dd elements.
2. If the current node is not a dd element, then this is a parse error.
3. Pop elements from the stack of open elements until a dd element has been popped from the stack.
4. Jump to the step labeled done below.
If node is a dt element, then run these substeps:
1. Generate implied end tags, except for dt elements.
2. If the current node is not a dt element, then this is a parse error.
3. Pop elements from the stack of open elements until a dt element has been popped from the stack.
4. Jump to the step labeled done below.
If node is in the special category, but is not an address, div, or p element, then jump to the step labeled done below.
Otherwise, set node to the previous entry in the stack of open elements and return to the step labeled loop.
Done: If the stack of open elements has a p element in button scope, then close a p element.
Finally, insert an HTML element for the token.

A start tag whose tag name is "plaintext"

If the stack of open elements has a p element in button scope, then close a p element.

Insert an HTML element for the token.

Switch the tokenizer to the PLAINTEXT state.

Once a start tag with the tag name "plaintext" has been seen, that will be the last token ever seen other than character tokens (and the end-of-file token), because there is no way to switch out of the PLAINTEXT state.

A start tag whose tag name is "button"

If the stack of open elements has a button element in scope, then run these substeps:
1. Parse error.
2. Generate implied end tags.
3. Pop elements from the stack of open elements until a button element has been popped from the stack.
Reconstruct the active formatting elements, if any.
Insert an HTML element for the token.
Set the frameset-ok flag to "not ok".

An end tag whose tag name is one of: "address", "article", "aside", "blockquote", "button", "center", "details", "dialog", "dir", "div", "dl", "fieldset", "figcaption", "figure", "footer", "header", "hgroup", "listing", "main", "menu", "nav", "ol", "pre", "section", "summary", "ul"

If the stack of open elements does not have an element in scope that is an HTML element with the same tag name as that of the token, then this is a parse error; ignore the token.

Otherwise, run these steps:

Generate implied end tags.
If the current node is not an HTML element with the same tag name as that of the token, then this is a parse error.
Pop elements from the stack of open elements until an HTML element with the same tag name as the token has been popped from the stack.

An end tag whose tag name is "form"

If there is no template element on the stack of open elements, then run these substeps:

Let node be the element that the form element pointer is set to, or null if it is not set to an element.
Set the form element pointer to null.
If node is null or if the stack of open elements does not have node in scope, then this is a parse error; return and ignore the token.
Generate implied end tags.
If the current node is not node, then this is a parse error.
Remove node from the stack of open elements.

If there is a template element on the stack of open elements, then run these substeps instead:

If the stack of open elements does not have a form element in scope, then this is a parse error; return and ignore the token.
Generate implied end tags.
If the current node is not a form element, then this is a parse error.
Pop elements from the stack of open elements until a form element has been popped from the stack.

An end tag whose tag name is "p"

If the stack of open elements does not have a p element in button scope, then this is a parse error; insert an HTML element for a "p" start tag token with no attributes.

Close a p element.

An end tag whose tag name is "li"

If the stack of open elements does not have an li element in list item scope, then this is a parse error; ignore the token.

Otherwise, run these steps:

Generate implied end tags, except for li elements.
If the current node is not an li element, then this is a parse error.
Pop elements from the stack of open elements until an li element has been popped from the stack.

An end tag whose tag name is one of: "dd", "dt"

If the stack of open elements does not have an element in scope that is an HTML element with the same tag name as that of the token, then this is a parse error; ignore the token.

Otherwise, run these steps:

Generate implied end tags, except for HTML elements with the same tag name as the token.
If the current node is not an HTML element with the same tag name as that of the token, then this is a parse error.
Pop elements from the stack of open elements until an HTML element with the same tag name as the token has been popped from the stack.

An end tag whose tag name is one of: "h1", "h2", "h3", "h4", "h5", "h6"

If the stack of open elements does not have an element in scope that is an HTML element and whose tag name is one of "h1", "h2", "h3", "h4", "h5", or "h6", then this is a parse error; ignore the token.

Otherwise, run these steps:

Generate implied end tags.
If the current node is not an HTML element with the same tag name as that of the token, then this is a parse error.
Pop elements from the stack of open elements until an HTML element whose tag name is one of "h1", "h2", "h3", "h4", "h5", or "h6" has been popped from the stack.

An end tag whose tag name is "sarcasm"

Take a deep breath, then act as described in the "any other end tag" entry below.

A start tag whose tag name is "a"

If the list of active formatting elements contains an a element between the end of the list and the last marker on the list (or the start of the list if there is no marker on the list), then this is a parse error; run the adoption agency algorithm for the token, then remove that element from the list of active formatting elements and the stack of open elements if the adoption agency algorithm didn't already remove it (it might not have if the element is not in table scope).

In the non-conforming stream <a href="a">a<table><a href="b">b</table>x, the first a element would be closed upon seeing the second one, and the "x" character would be inside a link to "b", not to "a". This is despite the fact that the outer a element is not in table scope (meaning that a regular </a> end tag at the start of the table wouldn't close the outer a element). The result is that the two a elements are indirectly nested inside each other — non-conforming markup will often result in non-conforming DOMs when parsed.

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token. Push onto the list of active formatting elements that element.

A start tag whose tag name is one of: "b", "big", "code", "em", "font", "i", "s", "small", "strike", "strong", "tt", "u"

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token. Push onto the list of active formatting elements that element.

A start tag whose tag name is "nobr"

Reconstruct the active formatting elements, if any.

If the stack of open elements has a nobr element in scope, then this is a parse error; run the adoption agency algorithm for the token, then once again reconstruct the active formatting elements, if any.

Insert an HTML element for the token. Push onto the list of active formatting elements that element.

An end tag whose tag name is one of: "a", "b", "big", "code", "em", "font", "i", "nobr", "s", "small", "strike", "strong", "tt", "u"

Run the adoption agency algorithm for the token.

A start tag whose tag name is one of: "applet", "marquee", "object"

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token.

Insert a marker at the end of the list of active formatting elements.

Set the frameset-ok flag to "not ok".

An end tag token whose tag name is one of: "applet", "marquee", "object"

If the stack of open elements does not have an element in scope that is an HTML element with the same tag name as that of the token, then this is a parse error; ignore the token.

Otherwise, run these steps:

Generate implied end tags.
If the current node is not an HTML element with the same tag name as that of the token, then this is a parse error.
Pop elements from the stack of open elements until an HTML element with the same tag name as the token has been popped from the stack.
Clear the list of active formatting elements up to the last marker.

A start tag whose tag name is "table"

If the Document is not set to quirks mode, and the stack of open elements has a p element in button scope, then close a p element.

Insert an HTML element for the token.

Set the frameset-ok flag to "not ok".

Switch the insertion mode to "in table".

An end tag whose tag name is "br"

Parse error. Drop the attributes from the token, and act as described in the next entry; i.e. act as if this was a "br" start tag token with no attributes, rather than the end tag token that it actually is.

A start tag whose tag name is one of: "area", "br", "embed", "img", "keygen", "wbr"

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token. Immediately pop the current node off the stack of open elements.

Acknowledge the token's self-closing flag, if it is set.

Set the frameset-ok flag to "not ok".

A start tag whose tag name is "input"

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token. Immediately pop the current node off the stack of open elements.

Acknowledge the token's self-closing flag, if it is set.

If the token does not have an attribute with the name "type", or if it does, but that attribute's value is not an ASCII case-insensitive match for the string "hidden", then: set the frameset-ok flag to "not ok".

A start tag whose tag name is one of: "param", "source", "track"

Insert an HTML element for the token. Immediately pop the current node off the stack of open elements.

Acknowledge the token's self-closing flag, if it is set.

A start tag whose tag name is "hr"

If the stack of open elements has a p element in button scope, then close a p element.

Insert an HTML element for the token. Immediately pop the current node off the stack of open elements.

Acknowledge the token's self-closing flag, if it is set.

Set the frameset-ok flag to "not ok".

A start tag whose tag name is "image"

Parse error. Change the token's tag name to "img" and reprocess it. (Don't ask.)

A start tag whose tag name is "textarea"

Run these steps:

Insert an HTML element for the token.
If the next token is a U+000A LINE FEED (LF) character token, then ignore that token and move on to the next one. (Newlines at the start of textarea elements are ignored as an authoring convenience.)
Switch the tokenizer to the RCDATA state.
Let the original insertion mode be the current insertion mode.
Set the frameset-ok flag to "not ok".
Switch the insertion mode to "text".

A start tag whose tag name is "xmp"

If the stack of open elements has a p element in button scope, then close a p element.

Reconstruct the active formatting elements, if any.

Set the frameset-ok flag to "not ok".

Follow the generic raw text element parsing algorithm.

A start tag whose tag name is "iframe"

Set the frameset-ok flag to "not ok".

Follow the generic raw text element parsing algorithm.

A start tag whose tag name is "noembed"

A start tag whose tag name is "noscript", if the scripting flag is enabled

Follow the generic raw text element parsing algorithm.

A start tag whose tag name is "select"

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token.

Set the frameset-ok flag to "not ok".

If the insertion mode is one of "in table", "in caption", "in table body", "in row", or "in cell", then switch the insertion mode to "in select in table". Otherwise, switch the insertion mode to "in select".

A start tag whose tag name is one of: "optgroup", "option"

If the current node is an option element, then pop the current node off the stack of open elements.

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token.

A start tag whose tag name is one of: "rb", "rtc"

If the stack of open elements has a ruby element in scope, then generate implied end tags. If the current node is not now a ruby element, this is a parse error.

Insert an HTML element for the token.

A start tag whose tag name is one of: "rp", "rt"

If the stack of open elements has a ruby element in scope, then generate implied end tags, except for rtc elements. If the current node is not now a rtc element or a ruby element, this is a parse error.

Insert an HTML element for the token.

A start tag whose tag name is "math"

Reconstruct the active formatting elements, if any.

Adjust MathML attributes for the token. (This fixes the case of MathML attributes that are not all lowercase.)

Adjust foreign attributes for the token. (This fixes the use of namespaced attributes, in particular XLink.)

Insert a foreign element for the token, in the MathML namespace.

If the token has its self-closing flag set, pop the current node off the stack of open elements and acknowledge the token's self-closing flag.

A start tag whose tag name is "svg"

Reconstruct the active formatting elements, if any.

Adjust SVG attributes for the token. (This fixes the case of SVG attributes that are not all lowercase.)

Adjust foreign attributes for the token. (This fixes the use of namespaced attributes, in particular XLink in SVG.)

Insert a foreign element for the token, in the SVG namespace.

If the token has its self-closing flag set, pop the current node off the stack of open elements and acknowledge the token's self-closing flag.

A start tag whose tag name is one of: "caption", "col", "colgroup", "frame", "head", "tbody", "td", "tfoot", "th", "thead", "tr"

Parse error. Ignore the token.

Any other start tag

Reconstruct the active formatting elements, if any.

Insert an HTML element for the token.

This element will be an ordinary element.

Any other end tag

Run these steps:

Initialize node to be the current node (the bottommost node of the stack).
Loop: If node is an HTML element with the same tag name as the token, then:
1. Generate implied end tags, except for HTML elements with the same tag name as the token.
2. If node is not the current node, then this is a parse error.
3. Pop all the nodes from the current node up to node, including node, then stop these steps.
Otherwise, if node is in the special category, then this is a parse error; ignore the token, and return.
Set node to the previous entry in the stack of open elements.
Return to the step labeled loop.

When the steps above say the user agent is to close a p element, it means that the user agent must run the following steps:

Generate implied end tags, except for p elements.
If the current node is not a p element, then this is a parse error.
Pop elements from the stack of open elements until a p element has been popped from the stack.

The adoption agency algorithm, which takes as its only argument a token token for which the algorithm is being run, consists of the following steps:

Let subject be token's tag name.
If the current node is an HTML element whose tag name is subject, and the current node is not in the list of active formatting elements, then pop the current node off the stack of open elements, and return.
Let outer loop counter be zero.
Outer loop: If outer loop counter is greater than or equal to eight, then return.
Increment outer loop counter by one.
Let formatting element be the last element in the list of active formatting elements that:
- is between the end of the list and the last marker in the list, if any, or the start of the list otherwise, and
- has the tag name subject.
If there is no such element, then return and instead act as described in the "any other end tag" entry above.
If formatting element is not in the stack of open elements, then this is a parse error; remove the element from the list, and return.
If formatting element is in the stack of open elements, but the element is not in scope, then this is a parse error; return.
If formatting element is not the current node, this is a parse error. (But do not return.)
Let furthest block be the topmost node in the stack of open elements that is lower in the stack than formatting element, and is an element in the special category. There might not be one.
If there is no furthest block, then the UA must first pop all the nodes from the bottom of the stack of open elements, from the current node up to and including formatting element, then remove formatting element from the list of active formatting elements, and finally return.
Let common ancestor be the element immediately above formatting element in the stack of open elements.
Let a bookmark note the position of formatting element in the list of active formatting elements relative to the elements on either side of it in the list.
Let node and last node be furthest block. Follow these steps:
1. Let inner loop counter be zero.
2. Inner loop: Increment inner loop counter by one.
3. Let node be the element immediately above node in the stack of open elements, or if node is no longer in the stack of open elements (e.g. because it got removed by this algorithm), the element that was immediately above node in the stack of open elements before node was removed.
4. If node is formatting element, then go to the next step in the overall algorithm.
5. If inner loop counter is greater than three and node is in the list of active formatting elements, then remove node from the list of active formatting elements.
6. If node is not in the list of active formatting elements, then remove node from the stack of open elements and then go back to the step labeled inner loop.
7. Create an element for the token for which the element node was created, in the HTML namespace, with common ancestor as the intended parent; replace the entry for node in the list of active formatting elements with an entry for the new element, replace the entry for node in the stack of open elements with an entry for the new element, and let node be the new element.
8. If last node is furthest block, then move the aforementioned bookmark to be immediately after the new node in the list of active formatting elements.
9. Insert last node into node, first removing it from its previous parent node if any.
10. Let last node be node.
11. Return to the step labeled inner loop.
Insert whatever last node ended up being in the previous step at the appropriate place for inserting a node, but using common ancestor as the override target.
Create an element for the token for which formatting element was created, in the HTML namespace, with furthest block as the intended parent.
Take all of the child nodes of furthest block and append them to the element created in the last step.
Append that new element to furthest block.
Remove formatting element from the list of active formatting elements, and insert the new element into the list of active formatting elements at the position of the aforementioned bookmark.
Remove formatting element from the stack of open elements, and insert the new element into the stack of open elements immediately below the position of furthest block in that stack.
Jump back to the step labeled outer loop.

This algorithm's name, the "adoption agency algorithm", comes from the way it causes elements to change parents, and is in contrast with other possible algorithms for dealing with misnested content.

13.2.6.4.8 The "text" insertion mode

When the user agent is to apply the rules for the "text" insertion mode, the user agent must handle the token as follows:

A character token

This can never be a U+0000 NULL character; the tokenizer converts those to U+FFFD REPLACEMENT CHARACTER characters.

An end-of-file token

If the current node is a script element, mark the script element as "already started".

Pop the current node off the stack of open elements.

Switch the insertion mode to the original insertion mode and reprocess the token.

An end tag whose tag name is "script"

If the JavaScript execution context stack is empty, perform a microtask checkpoint.

Let script be the current node (which will be a script element).

Pop the current node off the stack of open elements.

Switch the insertion mode to the original insertion mode.

Let the old insertion point have the same value as the current insertion point. Let the insertion point be just before the next input character.

Increment the parser's script nesting level by one.

Prepare the script. This might cause some script to execute, which might cause new characters to be inserted into the tokenizer, and might cause the tokenizer to output more tokens, resulting in a reentrant invocation of the parser.

Decrement the parser's script nesting level by one. If the parser's script nesting level is zero, then set the parser pause flag to false.

Let the insertion point have the value of the old insertion point. (In other words, restore the insertion point to its previous value. This value might be the "undefined" value.)

At this stage, if there is a pending parsing-blocking script, then:

If the script nesting level is not zero:

Set the parser pause flag to true, and abort the processing of any nested invocations of the tokenizer, yielding control back to the caller. (Tokenization will resume when the caller returns to the "outer" tree construction stage.)

The tree construction stage of this particular parser is being called reentrantly, say from a call to document.write().

Otherwise:

Run these steps:

Let the script be the pending parsing-blocking script. There is no longer a pending parsing-blocking script.
Block the tokenizer for this instance of the HTML parser, such that the event loop will not run tasks that invoke the tokenizer.
If the parser's Document has a style sheet that is blocking scripts or the script's "ready to be parser-executed" flag is not set: spin the event loop until the parser's Document has no style sheet that is blocking scripts and the script's "ready to be parser-executed" flag is set.
If this parser has been aborted in the meantime, return.

This could happen if, e.g., while the spin the event loop algorithm is running, the browsing context gets closed, or the document.open() method gets invoked on the Document.
Unblock the tokenizer for this instance of the HTML parser, such that tasks that invoke the tokenizer can again be run.
Let the insertion point be just before the next input character.
Increment the parser's script nesting level by one (it should be zero before this step, so this sets it to one).
Execute the script.
Decrement the parser's script nesting level by one. If the parser's script nesting level is zero (which it always should be at this point), then set the parser pause flag to false.
Let the insertion point be undefined again.
If there is once again a pending parsing-blocking script, then repeat these steps from step 1.

Any other end tag

Pop the current node off the stack of open elements.

Switch the insertion mode to the original insertion mode.

13.2.6.4.9 "in table" 插入模式

当用户代理应用 "in table" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记，如果 current node 是 table, tbody, tfoot, thead, 或 tr 元素

令 待处理表格字符标记 为一个空的标记列表。

令原始插入模式为当前插入模式。

将插入模式切换到 "in table text" 后重新处理该标记。

注释标记

DOCTYPE 标记

解析错误。忽略该标记。

名为 "caption" 的开始标签

将栈清除回表格上下文。（见下文）

在活动的格式化原始列表的结尾插入一个标记。

为该标记插入一个 HTML 元素，然后把插入模式切换到 "in caption"。

名为 "colgroup" 的开始标签

将栈清除回表格上下文。（见下文）

为该标记插入一个 HTML 元素，然后把插入模式切换到 "in column group"。

名为 "col" 的开始标签

将栈清除回表格上下文。（见下文）

为 "colgroup" 开始标签标记插入一个 HTML 元素，没有任何属性，然后把插入模式切换到 "in column group"。

重新处理当前标记。

名为 "tbody", "tfoot", "thead" 的开始标签

将栈清除回表格上下文。（见下文）

为该标记插入一个 HTML 元素，然后把插入模式切换到 "in table body"。

名为 "td", "th", "tr" 的开始标签

将栈清除回表格上下文。（见下文）

为 "tbody" 开始标签标记插入一个 HTML 元素，没有任何属性，然后把插入模式切换到 "in table body"。

重新处理当前标记。

名为 "table" 的开始标签

如果打开元素的栈在表格范围内部包含一个 table 元素，忽略该标记。

否则：

从打开元素栈弹栈直到弹出来的是一个 table 元素。

重新处理该标记。

名为 "table" 的结束标签

如果打开元素栈在表格范围内部包含一个 table 元素，这是一个解析错误；忽略该标记。

否则：

从打开元素栈弹栈直到弹出来的是一个 table 元素。

名为 "body", "caption", "col", "colgroup", "html", "tbody", "td", "tfoot", "th", "thead", "tr" 的结束标签

解析错误。忽略该标记。

名为 "style", "script", "template" 的开始标签

名为 "template" 的结束标签

使用 "in head" 插入模式的规则处理该标记。

名为 "input" 的开始标签

如果该标记没有名为 "type" 的属性，或该属性的值不能 ASCII 大小写不敏感地匹配字符串 "hidden"，那么执行下面 "anything else" 描述的步骤。

否则：

为该标记插入一个 HTML 元素。

把那个 input 元素从打开元素栈弹栈。

确认该标记的 self-closing 标志，如果设置了这个标志的话。

名为 "form" 的开始标签

如果在打开元素栈上有一个 template 元素，且 form 元素指针不是 null，忽略该标记。

否则：

为该标记插入一个 HTML 元素，然后设置 form 元素指针为该元素创建的指针。

把那个 form 元素从打开元素栈弹出。

文件尾（EOF）标记

使用 "in body" 插入模式的规则处理该标记。

其他情况

解析错误。启用 foster parenting，使用 "in body" 插入模式处理该标记，然后禁用 foster parenting。

当上述步骤要求 UA 把栈清除回表格上下文时，那么 UA 必须从打开元素栈弹出元素，直到当前节点是 table, template, 或 html 元素。

这与 在表格范围内存在元素 步骤中用到的元素列表是一样的。

在这一处理后，当前节点是一个 html 元素的，就是一个 fragment case。

13.2.6.4.10 The "in table text" insertion mode

When the user agent is to apply the rules for the "in table text" insertion mode, the user agent must handle the token as follows:

A character token that is U+0000 NULL

Parse error. Ignore the token.

Any other character token

Append the character token to the pending table character tokens list.

Anything else

If any of the tokens in the pending table character tokens list are character tokens that are not ASCII whitespace, then this is a parse error: reprocess the character tokens in the pending table character tokens list using the rules given in the "anything else" entry in the "in table" insertion mode.

Otherwise, insert the characters given by the pending table character tokens list.

Switch the insertion mode to the original insertion mode and reprocess the token.

13.2.6.4.11 The "in caption" 插入模式

当用户代理应用 "in caption" 插入模式的规则时，用户代理必须按以下规则处理标记：

名为 "caption" 的结束标签

the 打开元素栈在表格范围内不包含一个 caption 元素，这是一个解析错误；忽略该标记。 (fragment case)

否则:

生成暗示的结束标签。

现在如果当前节点不是 caption 元素，那么这是一个解析错误。

从栈中弹出元素直到得到一个 caption 元素。

将插入模式切换到 "in table"。

名为 "caption"，"col"，"colgroup"，"tbody"，"td"，"tfoot"， "th"，"thead"，"tr" 的开始标签

名为 "table" 的结束标签

如果打开元素栈在表格范围内没有 caption 元素，这是一个解析错误; 忽略该标记。 (fragment case)

否则:

生成暗示的结束标签。

现在如果当前节点不是 caption 元素，那么这是一个解析错误。

从栈中弹出元素直到得到一个 caption 元素。

将插入模式切换到 "in table"。

重新处理该标记。

名为 "body"，"col"，"colgroup"，"html"，"tbody"，"td"， "tfoot"，"th"，"thead"，"tr" 的结束标签

解析错误。忽略该标记。

任何其他标记

使用 "in body" 插入模式的规则处理该标记。

13.2.6.4.12 "in column group" 插入模式

当用户代理应用 "in column group" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记 U+0009 CHARACTER TABULATION，U+000A LINE FEED (LF)，U+000C FORM FEED (FF)，U+000D CARRIAGE RETURN (CR)，或 U+0020 SPACE

插入该字符。

注释标记

DOCTYPE 标记

解析错误。忽略该标记。

名为 "html" 的开始标签

使用 "in body" 插入模式的规则处理该标记。

名为 "col" 的开始标签

插入一个 HTML 元素为该标记。把当前节点立即从打开元素栈弹出。

确认该标记的 self-closing flag，如果设置了该标记的话。

名为 "colgroup" 的结束标签

如果当前节点不是 colgroup 元素，那么解析错误; 忽略该标记。

否则，把当前节点从打开元素栈弹出。将插入模式切换到 "in table"。

名为 "col" 的结束标签

解析错误。忽略该标记。

名为 "template" 的开始标签

名为 "template" 的结束标签

使用 "in head" 插入模式的规则处理该标记。

文件尾（EOF）标记

使用 "in body" 插入模式的规则处理该标记。

任何其他标记

如果当前节点不是 colgroup 元素，那么这是一个解析错误; 忽略该标记。

否则，把当前节点从打开元素栈弹出。

将插入模式切换到 "in table"。

重新处理该标记。

13.2.6.4.13 "in table body" 插入模式

当用户代理应用 "in table body" 插入模式的规则时，用户代理必须按以下规则处理标记：

名为 "tr" 的开始标签

将栈清除回表格体上下文。（见下文）

为该标记插入一个 HTML 元素，然后把插入模式切换到 "in row"。

名为 "th", "td" 的开始标签

将栈清除回表格体上下文。（见下文）

为 "tr" 开始标签标记插入一个 HTML 元素，不设置任何属性，然后把插入模式切换为 "in row"。

重新处理当前标记。

名为 "tbody", "tfoot", "thead" 的结束标签标记

如果打开元素栈在表格范围内没有一个与该标记同名的 HTML 元素，这就是一个解析错误；忽略这个标记。

否则：

将栈清除回表格体上下文。（见下文）

把当前节点从打开元素栈弹出。把插入模式切换到 "in table"。

名为 "caption", "col", "colgroup", "tbody", "tfoot", "thead" 的开始标记

名为 "table" 的结束标签

如果打开元素栈在表格范围内没有一个 tbody, thead, 或 tfoot 元素，这是一个解析错误；忽略该标记。

否则：

将栈清除回表格体上下文。（见下文）

把当前节点从打开元素栈弹出。把插入模式切换到 "in table"。

重新处理当前标记。

名为 "body", "caption", "col", "colgroup", "html", "td", "th", "tr" 的结束标签

解析错误。忽略该标记。

任何其他情况

使用 "in table" 插入模式的规则处理该标记。

当上述步骤中要求 UA 将栈清除回表格体上下文时， UA 必须从打开元素栈弹出元素，直到当前节点是 tbody, tfoot, thead, template 或 html 元素为止。

在这一处理后，当前节点是一个 html 元素的，就是一个 fragment case。

13.2.6.4.14 "in row" 插入模式

当用户代理应用 "in row" 插入模式的规则时，用户代理必须按以下规则处理标记：

名为 "th", "td" 的开始标签

将栈清除回表格行上下文。（见下文）

尾该标记插入一个 HTML 元素，然后b把插入模式切换到 "in cell"。

在活动的格式化元素列表的结尾插入一个标记。

名为 "tr" 的结束标签

如果打开元素标记在 table 范围内不包含一个 tr 元素，这是一个解析错误；忽略该标记。

否则：

将栈清除回表格行上下文。（见下文）

把当前节点（是一个 tr 元素）从打开元素栈弹出。把插入模式切换到 "in table body"。

名为 "caption", "col", "colgroup", "tbody", "tfoot", "thead", "tr" 的开始标签

名为 "table" 的结束标签

如果打开元素栈在 table 范围内不包含 tr 元素，这是一个解析错误；忽略该标记。

否则：

将栈清除回表格行上下文。（见下文）

把当前节点（是一个 tr 元素）从打开元素栈弹出。把插入模式切换到 "in table body"。

重新处理该标记。

名为 "tbody", "tfoot", "thead" 的结束标签

如果打开元素栈在表格范围内没有一个与该标记同名的 HTML 元素，这就是一个解析错误；忽略这个标记。

如果打开元素栈在 table 范围内没有一个 tr 元素，这是一个解析错误；忽略该标记。

否则：

将栈清除回表格行上下文。（见下文）

把当前节点（是一个 tr 元素）从打开元素栈弹出。把插入模式切换到 "in table body"。

重新处理该标记。

名为 "body", "caption", "col", "colgroup", "html", "td", "th" 的结束标签

解析错误。忽略该标记。

其他情况

使用 "in table" 插入模式处理该标记。

当上述步骤中要求 UA 将栈清除回表格行上下文时， UA 必须从打开元素栈弹出元素，直到当前节点是 tr, template 或 html 元素为止。

在这一处理后，当前节点是一个 html 元素的，就是一个 fragment case。

13.2.6.4.15 "in cell" 插入模式

当用户代理应用 "in cell" 插入模式的规则时，用户代理必须按以下规则处理标记：

名为 "td"，"th" 的结束标签

如果打开元素栈在表格范围内没有一个与该标记同名的 HTML 元素，那么这是一个解析错误; 忽略该标记。

否则：

生成暗示的结束标签。

现在如果当前节点不是与该标记同名的 HTML 元素，那么这是一个解析错误。

从打开元素栈弹出元素，直到弹出了与该标记同名的 HTML 元素为止。

把插入模式切换到 "in row"。

名为 "caption"，"col"， "colgroup"，"tbody"，"td"，"tfoot"，"th"，"thead"，"tr" 的开始标签

如果打开元素栈在表格范围内 不包含 td 或 th 元素，那么这是一个解析错误；忽略该标记。 (fragment case)

否则，关闭该单元格（见下文）并重新处理该标记。

名为 "body"，"caption"， "col"，"colgroup"，"html" 的结束标签

解析错误。忽略该标记。

名为 "table"，"tbody"， "tfoot"，"thead"，"tr" 的结束标签

如果打开元素栈在表格范围内不包含与该标记同名的 HTML 元素，那么这是一个解析错误；忽略该标记。

否则，关闭该单元格（见下文）并重新处理该标记。

任何其他标记

使用 "in body" 插入模式的规则处理该标记。

上述步骤中的关闭单元格，是指执行以下算法：

生成暗示的结束标签。
如果现在的当前节点不是 td 或 th 元素，那么这是一个解析错误。
从打开元素栈弹出元素，直到得到一个 td 或 th 元素。
清空直到最后一个标记的活跃的格式化元素列表。
把插入模式切换到 "in row"。

打开元素栈在表格上下文不可能同时包含 td 和 th 元素，当调用关闭单元格算法时也不可能包含它们。

13.2.6.4.16 "in select" 插入模式

当用户代理应用 "in select" 插入模式的规则时，用户代理必须按以下规则处理标记：

一个 U+0000 NULL 字符标记

解析错误。忽略该标记。

任何其他字符标记

插入该标记的字符。

注释标记

DOCTYPE 标记

解析错误。忽略该标记。

名为 "html" 的开始标签

使用 "in body" 插入模式处理该标记。

名为 "option" 的开始标签

如果当前节点是一个 option 元素，把该元素从打开元素栈弹出。

为该标记插入一个 HTML 元素。

名为 "optgroup" 的开始标签

如果当前节点是一个 option 元素，把该元素从打开元素栈弹出。

如果当前节点是一个 optgroup 元素，把该元素从打开元素栈弹出。

为该标记插入一个 HTML 元素。

名为 "optgroup" 的结束标签

首先，如果当前节点是一个 option 元素，而且在打开元素栈中紧接着该节点之前的那个元素是一个 optgroup 元素，那么把当前节点从打开元素栈弹出。

如果当前节点是一个 optgroup 元素，那么把该节点从打开元素栈弹出。否则，这是一个解析错误；忽略该标记。

名为 "option" 的结束标签

如果当前节点是一个 option 元素，就把那个节点从打开元素栈弹出。否则这就是一个解析错误；忽略该标记。

名为 "select" 的结束标签

如果打开元素栈在 select 范围内没有 select 元素，这是一个解析错误；忽略该标记。（fragment case）

否则：

从打开元素栈弹出元素，直到得到一个 select 元素。

名为 "select" 的开始标签

如果打开元素栈在 select 范围内没有一个 select 元素，忽略该标记（fragment case）

否则：

从打开元素栈弹出元素，直到得到一个 select 元素。

就是把它当作结束标签处理了。

名为 "input", "keygen", "textarea" 的开始标签

如果打开元素栈在 select 范围内没有一个 select 元素，忽略该标记。（fragment case）

否则：

从打开元素栈弹出元素直到得到一个 select 元素。

重新处理该标记。

名为 "script", "template" 的开始标签

名为 "template" 的结束标签

使用 the "in head" 插入模式处理该标记。

文件尾（EOF）标记

使用 "in body" 插入模式处理该标记。

其他情况

解析错误。忽略该标记。

13.2.6.4.17 "in select in table" 插入模式

当用户代理应用 "in select in table" 插入模式的规则时，用户代理必须按以下规则处理标记：

名为 "caption", "table", "tbody", "tfoot", "thead", "tr", "td", "th" 的开始标签

从打开元素栈弹出元素，直到得到一个 select 元素。

适当地重置插入模式。

重新处理该标记。

名为 "caption", "table", "tbody", "tfoot", "thead", "tr", "td", "th" 的结束标签

解析错误.

如果打开元素栈在 table 范围内不包含与该标记同名的 HTML 元素，忽略该标记。

否则：

从打开元素栈弹出元素，直到得到一个 select 元素。

适当地重置解析器的插入模式.

重新处理该标记。

其他情况

使用 "in select" 插入模式处理该标记。

13.2.6.4.18 "in template" 插入模式

当用户代理应用 "in template" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记

注释标记

DOCTYPE 标记

使用 "in body" 插入模式的规则处理该标记。

名为 "base", "basefont", "bgsound", "link", "meta", "noframes", "script", "style", "template", "title" 的开始标签

名为 "template" 的结束标签

使用 "in head" 插入模式的规则处理该标记。

名为 "caption", "colgroup", "tbody", "tfoot", "thead" 的开始标签

把当前当前模板插入模式弹出模板插入模式的栈。

把 "in table" 压入模板插入模式的栈，让它称为新的当前模板插入模式。

将插入模式切换到 "in table" 后重新处理该标记。

名为 "col" 的开始标签

把 "in column group" 压入模板插入模式的栈让它称为新的当前模板插入模式。

将插入模式切换到 "in column group" 后重新处理该标记。

名为 "tr" 的开始标签

把 "in table body" 压入模板插入模式的栈让它称为新的当前模板插入模式。

将插入模式切换到 "in table body" 后重新处理该标记。

名为 "td", "th" 的开始标签

把 "in row" 压入模板插入模式的栈让它称为新的当前模板插入模式。

将插入模式切换到 "in row" 后重新处理该标记。

任何其他开始标签

把 "in body" 压入模板插入模式的栈让它称为新的当前模板插入模式。

将插入模式切换到 "in body" 后重新处理该标记。

任何其他结束标签

解析错误。忽略该标签。

文件尾（EOF）标记

如果在打开元素栈上没有 template 元素，就停止解析。（fragment case）

否则这就是一个解析错误。

从打开元素栈弹栈直到弹出来的是一个 template 元素。

Parse error. Insert a U+FFFD REPLACEMENT CHARACTER character.

适当地重置解析器的插入模式.

重新处理该标记。

13.2.6.4.19 "after body" 插入模式

当用户代理应用 "after body" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记 U+0009 CHARACTER TABULATION，U+000A LINE FEED (LF)，U+000C FORM FEED (FF)，U+000D CARRIAGE RETURN (CR)，或 U+0020 SPACE

使用 "in body" 插入模式的规则处理该标记。

注释标记

插入一个注释作为打开元素栈中第一个元素的最后一个子节点。（html 元素）。

DOCTYPE 标记

解析错误。忽略该标记。

名为 "html" 的开始标签

使用 "in body" 插入模式的规则处理该标记。

名为 "html" 的结束标签

如果该解析器最初是作为 HTML 片段解析算法的一部分创建的，那么这是一个解析错误；忽略该标记。（fragment case）

否则，把插入模式切换到 "after after body"。

文件尾（EOF）标记

停止解析。

任何其他标记

解析错误。把插入模式切换到 "in body" 并重新处理该标记。

13.2.6.4.20 "in frameset" 插入模式

当用户代理应用 "in frameset" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记 U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR) 或 U+0020 SPACE

插入该字符。

评论标记

插入该字符。

DOCTYPE 字符

解析错误。忽略该字符。

名为 "html" 的开始标签

使用 "in body" 插入模式处理该标记。

名为 "frameset" 的开始标签

为该标记插入一个 HTML 元素。

名为 "frameset" 的结束标签

如果当前节点是根 html 元素，那么这是一个解析错误；忽略该标记。（fragment case）

否则，把当前节点从打开元素栈弹出。

如果该解析器最初不是作为 HTML 片段解析算法（fragment case）的一部分创建的，且当前节点不再是 frameset 元素，那么把插入模式切换到 "after frameset"。

名为 "frame" 的开始标签

为该标记插入一个 HTML 元素。立即把当前节点供打开元素栈弹出。

确认该标记的 self-closing 标志，如果设置了这个标志的话。

名为 "noframes" 的开始标签

使用 "in head" 插入模式处理该标记。

文件尾（EOF）标记

如果当前节点不是根 html 元素，那么这是一个解析错误。

在 fragment case 中，当前节点只能是根 html 元素。

停止解析。

任何其他标记

解析错误。忽略该标记。

13.2.6.4.21 "after frameset" 插入模式

当用户代理应用 "after frameset" 插入模式的规则时，用户代理必须按以下规则处理标记：

字符标记 U+0009 CHARACTER TABULATION，U+000A LINE FEED (LF)，U+000C FORM FEED (FF)，U+000D CARRIAGE RETURN (CR)，或 U+0020 SPACE: 插入该字符。
注释标记: 插入注释。
DOCTYPE 标记: 解析错误。忽略该标记。
名为 "html" 的开始标签: 使用 "in body" 插入模式的规则处理该标记。
名为 "html" 的结束标签: 将插入模式切换到 "after after frameset"。
名为 "noframes" 的开始标签: 使用 "in head" 插入模式的规则处理该标记。
文件尾（EOF）标记: 停止解析。
任何其他标记: 解析错误。忽略该标记。

13.2.6.4.22 "after after body" 插入模式

当用户代理应用 "after after body" 插入模式的规则时，用户代理必须按以下规则处理标记：

注释标记: 作为 Document 对象的最后一个子节点插入该注释。
DOCTYPE 标记
字符标记 U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), 或 U+0020 SPACE
名为 "html" 的开始标签: 使用 "in body" 插入模式的规则处理该标记。
文件尾（EOF）标记: 停止解析。
任何其他标记: 解析错误。把插入模式切换到 "in body" 并重新处理该标记。

13.2.6.4.23 "after after frameset" 插入模式

当用户代理应用 "after after frameset" 插入模式的规则时，用户代理必须按以下规则处理标记：

注释标记: 插入注释作为 Document 对象的最后一个子节点。
DOCTYPE 标记
字符标记 U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR) 或 U+0020 SPACE
名为 "html" 的开始标签: 使用 "in body" 插入模式的规则处理该标记。
文件尾（EOF）标记: 停止解析。
名为 "noframes" 的开始标签: 使用 "in head" 插入模式的规则处理该标记。
任何其他标记: 解析错误。忽略该标记。

13.2.6.5 The rules for parsing tokens in foreign content

When the user agent is to apply the rules for parsing tokens in foreign content, the user agent must handle the token as follows:

A character token that is U+0000 NULL

A character token that is one of U+0009 CHARACTER TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE

Any other character token

Set the frameset-ok flag to "not ok".

A comment token

Insert a comment.

A DOCTYPE token

Parse error. Ignore the token.

A start tag whose tag name is one of: "b", "big", "blockquote", "body", "br", "center", "code", "dd", "div", "dl", "dt", "em", "embed", "h1", "h2", "h3", "h4", "h5", "h6", "head", "hr", "i", "img", "li", "listing", "menu", "meta", "nobr", "ol", "p", "pre", "ruby", "s", "small", "span", "strong", "strike", "sub", "sup", "table", "tt", "u", "ul", "var"

A start tag whose tag name is "font", if the token has any attributes named "color", "face", or "size"