Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

mdast utility to serialize markdown

License

NotificationsYou must be signed in to change notification settings

syntax-tree/mdast-util-to-markdown

BuildCoverageDownloadsSizeSponsorsBackersChat

mdast utility that turns a syntax tree into markdown.

Contents

What is this?

This package is a utility that takes anmdastsyntax tree as input and turns it into serialized markdown.

This utility is a low level project.It’s used inremark-stringify,which focusses on making it easier to transform content by abstractingthese internals away.

When should I use this?

If you want to handle syntax trees manually, use this.For an easier time processing content, use theremarkecosystem instead.

You can combine this utility with other utilities to add syntax extensions.Notable examples that deeply integrate with it aremdast-util-gfm,mdast-util-mdx,mdast-util-frontmatter,mdast-util-math, andmdast-util-directive.

Install

This package isESM only.In Node.js (version 16+), install withnpm:

npm install mdast-util-to-markdown

In Deno withesm.sh:

import{toMarkdown}from'https://esm.sh/mdast-util-to-markdown@2'

In browsers withesm.sh:

<scripttype="module">import{toMarkdown}from'https://esm.sh/mdast-util-to-markdown@2?bundle'</script>

Use

Say our moduleexample.js looks as follows:

/** *@import {Root} from 'mdast' */import{toMarkdown}from'mdast-util-to-markdown'/**@type {Root} */consttree={type:'root',children:[{type:'blockquote',children:[{type:'thematicBreak'},{type:'paragraph',children:[{type:'text',value:'- a\nb !'},{type:'link',url:'example.com',children:[{type:'text',value:'d'}]}]}]}]}console.log(toMarkdown(tree))

…now runningnode example.js yields:

>***>>\- a>b\![d](example.com)

👉Note: observe the properly escaped characters which would otherwiseturn into a list and image respectively.

API

This package exports the identifiersdefaultHandlersandtoMarkdown.There is no default export.

toMarkdown(tree[, options])

Turn anmdast syntax tree into markdown.

Parameters
  • tree (Node)— tree to serialize
  • options (Options, optional)— configuration
Returns

Serialized markdown representingtree (string).

defaultHandlers

Default (CommonMark) handlers (Handlers).

ConstructName

Construct names for things generated bymdast-util-to-markdown (TypeScripttype).

This is an enum of strings, each being a semantic label, useful to know whenserializing whether we’re for example in a double (") or single (') quotedtitle.

Type
typeConstructName=ConstructNameMap[keyofConstructNameMap]

ConstructNameMap

Interface of registered constructs (TypeScript type).

Type
interfaceConstructNameMap{/* see code */}

When working on extensions that use new constructs, extend the correspondinginterface to register its name:

declare module'mdast-util-to-markdown'{interfaceConstructNameMap{// Register a new construct name (value is used, key should match it).gfmStrikethrough:'gfmStrikethrough'}}

Handle

Handle a particular node (TypeScript type).

Parameters
  • node (any)— expected mdast node
  • parent (Node, optional)— parent ofnode
  • state (State)— info passed around about the current state
  • info (Info)— info on the surrounding of the node that is serialized
Returns

Serialized markdown representingnode (string).

Handlers

Handle particular nodes (TypeScript type).

Each key is a node type (Node['type']), each value its corresponding handler(Handle).

Type
typeHandlers=Record<Node['type'],Handle>

Info

Info on the surrounding of the node that is serialized (TypeScript type).

Fields
  • now (Point)— current point
  • lineShift (number)— number of columns each line will be shifted by wrapping nodes
  • before (string)— characters before this (guaranteed to be one, can be more)
  • after (string)— characters after this (guaranteed to be one, can be more)

Join

How to join two blocks (TypeScript type).

“Blocks” are typically joined by one blank line.Sometimes it’s nicer to have them flush next to each other, yet other timesthey cannot occur together at all.

Join functions receive two adjacent siblings and their parent and what theyreturn defines how many blank lines to use between them.

Parameters
  • left (Node)— first of two adjacent siblings
  • right (Node)— second of two adjacent siblings
  • parent (Node)— parent of the two siblings
  • state (State)— info passed around about the current state
Returns

How many blank lines to use between the siblings (boolean,number,optional).

Wheretrue is as passing1 andfalse means the nodes cannot bejoined by a blank line, such as two adjacent block quotes or indented codeafter a list, in which case a comment will be injected to break them up:

>Quote 1<!---->>Quote 2

👉Note: abusing this feature will break markdown.One such example is when returning0 for two paragraphs, which will resultin the text running together, and in the future to be seen as one paragraph.

Map

Map function to pad a single line (TypeScript type).

Parameters
  • value (string)— a single line of serialized markdown
  • line (number)— line number relative to the fragment
  • blank (boolean)— whether the line is considered blank in markdown
Returns

Padded line (string).

Options

Configuration (TypeScript type).

Fields

The following fields influence how markdown is serialized.

options.bullet

Marker to use for bullets of items in unordered lists ('*','+', or'-',default:'*').

There are three cases where the primary bullet cannot be used:

  • when three or more list items are on their own, the last one is empty, andbullet is also a validrule:* - +; this would turn into a thematicbreak if serialized with three primary bullets;bulletOther is used forthe last item
  • when a thematic break is the first child of a list item andbullet is thesame character asrule:- ***; this would turn into a single thematicbreak if serialized with primary bullets;bulletOther is used for theitem
  • when two unordered lists appear next to each other:* a\n- b;bulletOther is used for such lists
options.bulletOther

Marker to use in certain cases where the primary bullet doesn’t work ('*','+', or'-', default:'-' whenbullet is'*','*' otherwise).

Cannot be equal tobullet.

options.bulletOrdered

Marker to use for bullets of items in ordered lists ('.' or')', default:'.').

There is one case where the primary bullet for ordered items cannot be used:

  • when two ordered lists appear next to each other:1. a\n2) b; to solvethat,'.' will be used whenbulletOrdered is')', and'.' otherwise
options.closeAtx

Whether to add the same number of number signs (#) at the end of an ATXheading as the opening sequence (boolean, default:false).

options.emphasis

Marker to use for emphasis ('*' or'_', default:'*').

options.fence

Marker to use for fenced code ('`' or'~', default:'`').

options.fences

Whether to use fenced code always (boolean, default:true).The default is to use fenced code if there is a language defined, if the code isempty, or if it starts or ends in blank lines.

options.incrementListMarker

Whether to increment the counter of ordered lists items (boolean, default:true).

options.listItemIndent

How to indent the content of list items ('mixed','one', or'tab',default:'one').Either with the size of the bullet plus one space (when'one'), a tab stop('tab'), or depending on the item and its parent list ('mixed', uses'one'if the item and list are tight and'tab' otherwise).

options.quote

Marker to use for titles ('"' or"'", default:'"').

options.resourceLink

Whether to always use resource links (boolean, default:false).The default is to use autolinks (<https://example.com>) when possibleand resource links ([text](url)) otherwise.

options.rule

Marker to use for thematic breaks ('*','-', or'_', default:'*').

options.ruleRepetition

Number of markers to use for thematic breaks (number, default:3, min:3).

options.ruleSpaces

Whether to add spaces between markers in thematic breaks (boolean, default:false).

options.setext

Whether to use setext headings when possible (boolean, default:false).The default is to always use ATX headings (# heading) instead of setextheadings (heading\n=======).Setext headings cannot be used for empty headings or headings with a rank ofthree or more.

options.strong

Marker to use for strong ('*' or'_', default:'*').

options.tightDefinitions

Whether to join definitions without a blank line (boolean, default:false).

The default is to add blank lines between any flow (“block”) construct.Turning this option on is a shortcut for aJoin function like so:

functionjoinTightDefinitions(left,right){if(left.type==='definition'&&right.type==='definition'){return0}}
options.handlers

Handle particular nodes (Handlers, optional).

options.join

How to join blocks (Array<Join>, optional).

options.unsafe

Schemas that define when characters cannot occur(Array<Unsafe>, optional).

options.extensions

List of extensions (Array<Options>, default:[]).Each extension is an object with the same interface asOptions itself.

SafeConfig

Configuration passed tostate.safe (TypeScript type).

Fields
  • before (string)— characters before this (guaranteed to be one, can be more)
  • after (string)— characters after this (guaranteed to be one, can be more)
  • encode (Array<string>, optional)— extra characters thatmust be encoded (as character references) insteadof escaped (character escapes).Only ASCII punctuation will use character escapes, so you never need topass non-ASCII-punctuation here

State

Info passed around about the current state (TypeScript type).

Fields
  • stack (Array<ConstructName>)— stack of constructs we’re in
  • indexStack (Array<number>)— positions of child nodes in their parents
  • associationId ((node: Association) => string)— get an identifier from an association to match it to others (seeAssociation)
  • enter ((construct: ConstructName) => () => undefined)— enter a construct (returns a corresponding exit function)(seeConstructName)
  • indentLines ((value: string, map: Map) => string)— pad serialized markdown (seeMap)
  • compilePattern ((pattern: Unsafe) => RegExp)— compile an unsafe pattern to a regex (seeUnsafe)
  • containerFlow ((parent: Node, info: Info) => string)— serialize flow children (seeInfo)
  • containerPhrasing ((parent: Node, info: Info) => string)— serialize phrasing children (seeInfo)
  • createTracker ((info: Info) => Tracker)— track positional info in the output (seeInfo,Tracker)
  • safe ((value: string, config: SafeConfig) => string)— make a string safe for embedding (seeSafeConfig)
  • options (Options)— applied user configuration
  • unsafe (Array<Unsafe>)— applied unsafe patterns
  • join (Array<Join>)— applied join handlers
  • handle (Handle)— call the configured handler for the given node
  • handlers (Handlers)— applied handlers
  • bulletCurrent (string orundefined)— list marker currently in use
  • bulletLastUsed (string orundefined)— list marker previously in use

Tracker

Track positional info in the output (TypeScript type).

This info isn’t used yet but such functionality will allow line wrapping,source maps, etc.

Fields
  • current (() => Info)— get current tracked info
  • shift ((value: number) => undefined)— define a relative increased line shift (the typical indent for lines)
  • move ((value: string) => string)— move past some generated markdown

Unsafe

Schema that defines when a character cannot occur (TypeScript type).

Fields
  • character (string)— single unsafe character
  • inConstruct (Array<ConstructName>,ConstructName, optional)— constructs where this is bad
  • notInConstruct (Array<ConstructName>,ConstructName, optional)— constructs where this is fine again
  • before (string, optional)—character is bad when this is before it (cannot be used together withatBreak)
  • after (string, optional)—character is bad when this is after it
  • atBreak (boolean, optional)—character is bad at a break (cannot be used together withbefore)

List of extensions

Syntax

Markdown is serialized according to CommonMark but care is taken to format insuch a way that the resulting markdown should work with most markdown parsers.Extensions can add support for custom syntax.

Syntax tree

The syntax tree ismdast.

Types

This package is fully typed withTypeScript.It exports the additional typesConstructName,ConstructNameMap,Handle,Handlers,Info,Join,Map,Options,SafeConfig,State, andUnsafe.

Compatibility

Projects maintained by the unified collective are compatible with maintainedversions of Node.js.

When we cut a new major release, we drop support for unmaintained versions ofNode.This means we try to keep the current release line,mdast-util-to-markdown@^2,compatible with Node.js 16.

Security

mdast-util-to-markdown will do its best to serialize markdown to match thesyntax tree, but there are several cases where that is impossible.It’ll do its best, but complete roundtripping is impossible given that any valuecould be injected into the tree.

As markdown is sometimes used for HTML, and improper use of HTML can open you upto across-site scripting (XSS) attack,use ofmdast-util-to-markdownand parsing it again later could potentially be unsafe.When parsing markdown afterwards and then going to HTML, use something likehast-util-sanitize to make the tree safe.

Related

Contribute

Seecontributing.md insyntax-tree/.github forways to get started.Seesupport.md for ways to get help.

This project has acode of conduct.By interacting with this repository, organization, or community you agree toabide by its terms.

License

MIT ©Titus Wormer


[8]ページ先頭

©2009-2025 Movatter.jp