hast utility to get the plain-text value of a node.
This package is a utility that takes a hast node and gets its plain-text value. It is like the DOMs Node#innerText
, which is a bit nicer than Node#textContent
, because this turns <br>
elements into line breaks and uses '\t'
(tabs) between table cells.
There are some small deviations from the spec, because the DOM has knowledge of associated CSS, and can take into account that elements have display: none
or text-transform
association with them, and this utility can’t do that.
This is a small utility that is useful when you want a plain-text version of a node that is close to how it’s “visible” to users.
This utility is similar to hast-util-to-string
, which is simpler, and more like the Node#textContent
algorithm discussed above.
There is also a package hast-util-from-text
, which sort of does the inverse: it takes a string, sets that as text on the node, while turning line endings into <br>
s
This package is ESM only. In Node.js (version 16+), install with npm:
npm install hast-util-to-text
In Deno with esm.sh
:
import {toText} from 'https://esm.sh/hast-util-to-text@4'
In browsers with esm.sh
:
<script type="module"> import {toText} from 'https://esm.sh/hast-util-to-text@4?bundle' </script>
import {h} from 'hastscript' import {toText} from 'hast-util-to-text' const tree = h('div', [ h('h1', {hidden: true}, 'Alpha.'), h('article', [ h('p', ['Bravo', h('br'), 'charlie.']), h('p', 'Delta echo \t foxtrot.') ]) ]) console.log(toText(tree))
Yields:
Bravo charlie. Delta echo foxtrot.
This package exports the identifier toText
. There is no default export.
Get the plain-text value of a node.
Serialized tree
(string
).
tree
is a comment, returns its value
tree
is a text, applies normal whitespace collapsing to its value
, as defined by the CSS Text spectree
is a root or element, applies an algorithm similar to the innerText
getter as defined by HTML👉 Note: the algorithm acts as if
tree
is being rendered, and as if we’re a CSS-supporting user agent, with scripting enabled.
tree
is an element that is not displayed (such as a head
), we’ll still use the innerText
algorithm instead of switching to textContent
tree
are elements that are not displayed, they are ignoredaudio
) are treated like non-replaced elementsConfiguration (TypeScript type).
whitespace
(Whitespace
, default: 'normal'
) — default whitespace setting to useValid and useful whitespace values (from CSS) (TypeScript type).
type Whitespace = 'normal' | 'nowrap' | 'pre' | 'pre-wrap'
This package is fully typed with TypeScript. It exports the additional types Options
and Whitespace
.
Projects maintained by the unified collective are compatible with maintained versions of Node.js.
When we cut a new major release, we drop support for unmaintained versions of Node. This means we try to keep the current release line, hast-util-to-text@^4
, compatible with Node.js 16.
hast-util-to-text
does not change the syntax tree so there are no openings for cross-site scripting (XSS) attacks.
hast-util-to-string
— get the plain-text value (textContent
)hast-util-from-text
— set the plain-text value (innerText
)hast-util-from-string
— set the plain-text value (textContent
)See contributing.md
in syntax-tree/.github
for ways to get started. See support.md
for ways to get help.
This project has a code of conduct. By interacting with this repository, organization, or community you agree to abide by its terms.
RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4