marked/docs/USING_PRO.md

## Extending Marked

To champion the single-responsibility and open/closed principles, we have tried to make it relatively painless to extend marked. If you are looking to add custom functionality, this is the place to start.

<h2 id="renderer">The renderer</h2>

The renderer defines the output of the parser.

**Example:** Overriding default heading token by adding an embedded anchor tag like on GitHub.

```js
// Create reference instance
const marked = require('marked');

// Override function
const renderer = {
  heading(text, level) {
    const escapedText = text.toLowerCase().replace(/[^\w]+/g, '-');

    return `
            <h${level}>
              <a name="${escapedText}" class="anchor" href="#${escapedText}">
                <span class="header-link"></span>
              </a>
              ${text}
            </h${level}>`;
  }
};

marked.use({ renderer });

// Run marked
console.log(marked('# heading+'));
```

**Output:**

```html
<h1>
  <a name="heading-" class="anchor" href="#heading-">
    <span class="header-link"></span>
  </a>
  heading+
</h1>
```

### Block level renderer methods

- code(*string* code, *string* infostring, *boolean* escaped)
- blockquote(*string* quote)
- html(*string* html)
- heading(*string* text, *number* level, *string* raw, *Slugger* slugger)
- hr()
- list(*string* body, *boolean* ordered, *number* start)
- listitem(*string* text, *boolean* task, *boolean* checked)
- checkbox(*boolean* checked)
- paragraph(*string* text)
- table(*string* header, *string* body)
- tablerow(*string* content)
- tablecell(*string* content, *object* flags)

`slugger` has the `slug` method to create a unique id from value:

```js
slugger.slug('foo')   // foo
slugger.slug('foo')   // foo-1
slugger.slug('foo')   // foo-2
slugger.slug('foo 1') // foo-1-1
slugger.slug('foo-1') // foo-1-2
...
```

`flags` has the following properties:

```js
{
    header: true || false,
    align: 'center' || 'left' || 'right'
}
```

### Inline level renderer methods

- strong(*string* text)
- em(*string* text)
- codespan(*string* code)
- br()
- del(*string* text)
- link(*string* href, *string* title, *string* text)
- image(*string* href, *string* title, *string* text)
- text(*string* text)

<h2 id="tokenizer">The tokenizer</h2>

The tokenizer defines how to turn markdown text into tokens.

**Example:** Overriding default `codespan` tokenizer to include LaTeX.

```js
// Create reference instance
const marked = require('marked');

// Override function
const tokenizer = {
  codespan(src) {
    const match = src.match(/\$+([^\$\n]+?)\$+/);
    if (match) {
      return {
        type: 'codespan',
        raw: match[0],
        text: match[1].trim()
      };
    }

    // return false to use original codespan tokenizer
    return false;
  }
};

marked.use({ tokenizer });

// Run marked
console.log(marked('$ latex code $\n\n` other code `'));
```

**Output:**

```html
<p><code>latex code</code></p>
<p><code>other code</code></p>
```

### Block level tokenizer methods

- space(*string* src)
- code(*string* src, *array* tokens)
- fences(*string* src)
- heading(*string* src)
- nptable(*string* src)
- hr(*string* src)
- blockquote(*string* src)
- list(*string* src)
- html(*string* src)
- def(*string* src)
- table(*string* src)
- lheading(*string* src)
- paragraph(*string* src)
- text(*string* src)

### Inline level tokenizer methods

- escape(*string* src)
- tag(*string* src, *bool* inLink, *bool* inRawBlock)
- link(*string* src)
- reflink(*string* src, *object* links)
- strong(*string* src)
- em(*string* src)
- codespan(*string* src)
- br(*string* src)
- del(*string* src)
- autolink(*string* src, *function* mangle)
- url(*string* src, *function* mangle)
- inlineText(*string* src, *bool* inRawBlock, *function* smartypants)

`mangle` is a method that changes text to HTML character references:

```js
mangle('test@example.com')
// "&#x74;&#101;&#x73;&#116;&#x40;&#101;&#120;&#x61;&#x6d;&#112;&#108;&#101;&#46;&#x63;&#111;&#x6d;"
```

`smartypants` is a method that translates plain ASCII punctuation characters into “smart” typographic punctuation HTML entities:

https://daringfireball.net/projects/smartypants/

```js
smartypants('"this ... string"')
// "“this … string”"
```

<h2 id="lexer">The lexer</h2>

The lexer takes a markdown string and calls the tokenizer functions.

<h2 id="parser">The parser</h2>

The parser takes tokens as input and calls the renderer functions.

***

<h2 id="extend">Access to lexer and parser</h2>

You also have direct access to the lexer and parser if you so desire.

``` js
const tokens = marked.lexer(markdown, options);
console.log(marked.parser(tokens, options));
```

``` js
const lexer = new marked.Lexer(options);
const tokens = lexer.lex(markdown);
console.log(tokens);
console.log(lexer.tokenizer.rules.block); // block level rules used
console.log(lexer.tokenizer.rules.inline); // inline level rules used
console.log(marked.Lexer.rules.block); // all block level rules
console.log(marked.Lexer.rules.inline); // all inline level rules
```

``` bash
$ node
> require('marked').lexer('> I am using marked.')
[
  {
    type: "blockquote",
    raw: "> I am using marked.",
    tokens: [
      {
        type: "paragraph",
        raw: "I am using marked.",
        text: "I am using marked.",
        tokens: [
          {
            type: "text",
            raw: "I am using marked.",
            text: "I am using marked."
          }
        ]
      }
    ]
  },
  links: {}
]
```

The Lexer builds an array of tokens, which will be passed to the Parser.
The Parser processes each token in the token array:

``` js
const marked = require('marked');

const md = `
  # heading

  [link][1]

  [1]: #heading "heading"
`;

const tokens = marked.lexer(md);
console.log(tokens);

const html = marked.parser(tokens);
console.log(html);
```

``` bash
[
  {
    type: "heading",
    raw: "  # heading\n\n",
    depth: 1,
    text: "heading",
    tokens: [
      {
        type: "text",
        raw: "heading",
        text: "heading"
      }
    ]
  },
  {
    type: "paragraph",
    raw: "  [link][1]",
    text: "  [link][1]",
    tokens: [
      {
        type: "text",
        raw: "  ",
        text: "  "
      },
      {
        type: "link",
        raw: "[link][1]",
        text: "link",
        href: "#heading",
        title: "heading",
        tokens: [
          {
            type: "text",
            raw: "link",
            text: "link"
          }
        ]
      }
    ]
  },
  {
    type: "space",
    raw: "\n\n"
  },
  links: {
    "1": {
      href: "#heading",
      title: "heading"
    }
  }
]
<h1 id="heading">heading</h1>
<p>  <a href="#heading" title="heading">link</a></p>
```
Renderer, lexer, parser 2018-02-25 16:37:11 -05:00			`## Extending Marked`
Base README 2018-02-25 16:01:22 -05:00
update examples with es6+ 2019-07-08 09:13:53 -05:00			`To champion the single-responsibility and open/closed principles, we have tried to make it relatively painless to extend marked. If you are looking to add custom functionality, this is the place to start.`
Base README 2018-02-25 16:01:22 -05:00
Typos and grammar 2018-02-25 21:56:00 -05:00			`<h2 id="renderer">The renderer</h2>`
Base README 2018-02-25 16:01:22 -05:00
update docs 2020-04-01 21:08:44 -05:00			`The renderer defines the output of the parser.`
Renderer, lexer, parser 2018-02-25 16:37:11 -05:00
			`Example: Overriding default heading token by adding an embedded anchor tag like on GitHub.`

			```js
			`// Create reference instance`
update examples with es6+ 2019-07-08 09:13:53 -05:00			`const marked = require('marked');`
Renderer, lexer, parser 2018-02-25 16:37:11 -05:00
			`// Override function`
update docs with marked.use 2020-04-19 00:38:19 -05:00			`const renderer = {`
			`heading(text, level) {`
			`const escapedText = text.toLowerCase().replace(/[^\w]+/g, '-');`

			return `
			`<h${level}>`
			`<a name="${escapedText}" class="anchor" href="#${escapedText}">`
			`<span class="header-link"></span>`
			`</a>`
			`${text}`
			</h${level}>`;
			`}`
Base README 2018-02-25 16:01:22 -05:00			`};`

update docs with marked.use 2020-04-19 00:38:19 -05:00			`marked.use({ renderer });`

Renderer, lexer, parser 2018-02-25 16:37:11 -05:00			`// Run marked`
update docs with marked.use 2020-04-19 00:38:19 -05:00			`console.log(marked('# heading+'));`
Base README 2018-02-25 16:01:22 -05:00			```
Renderer, lexer, parser 2018-02-25 16:37:11 -05:00
			`Output:`

Base README 2018-02-25 16:01:22 -05:00			```html
			`<h1>`
			`<a name="heading-" class="anchor" href="#heading-">`
			`<span class="header-link"></span>`
			`</a>`
			`heading+`
			`</h1>`
			```

Renderer, lexer, parser 2018-02-25 16:37:11 -05:00			`### Block level renderer methods`
Base README 2018-02-25 16:01:22 -05:00
update document about extending renderer 2019-02-13 14:42:33 +09:00			`- code(string code, string infostring, boolean escaped)`
Base README 2018-02-25 16:01:22 -05:00			`- blockquote(string quote)`
			`- html(string html)`
update document about extending renderer 2019-02-13 14:42:33 +09:00			`- heading(string text, number level, string raw, Slugger slugger)`
Base README 2018-02-25 16:01:22 -05:00			`- hr()`
Add missing parameters for renderer methods Some renderer methods have more parameters than documented. Let’s update them. 2018-07-29 00:20:04 +09:00			`- list(string body, boolean ordered, number start)`
Update list item documentation 2019-03-08 17:28:01 -06:00			`- listitem(string text, boolean task, boolean checked)`
update document about extending renderer 2019-02-13 14:42:33 +09:00			`- checkbox(boolean checked)`
Base README 2018-02-25 16:01:22 -05:00			`- paragraph(string text)`
			`- table(string header, string body)`
			`- tablerow(string content)`
			`- tablecell(string content, object flags)`

update docs 2020-04-07 11:43:11 -05:00			`slugger` has the `slug` method to create a unique id from value:
update document about extending renderer 2019-02-13 14:42:33 +09:00
			```js
			`slugger.slug('foo') // foo`
			`slugger.slug('foo') // foo-1`
			`slugger.slug('foo') // foo-2`
			`slugger.slug('foo 1') // foo-1-1`
			`slugger.slug('foo-1') // foo-1-2`
			`...`
			```

Base README 2018-02-25 16:01:22 -05:00			`flags` has the following properties:

			```js
			`{`
			`header: true \|\| false,`
			`align: 'center' \|\| 'left' \|\| 'right'`
			`}`
			```

Renderer, lexer, parser 2018-02-25 16:37:11 -05:00			`### Inline level renderer methods`
Base README 2018-02-25 16:01:22 -05:00
			`- strong(string text)`
			`- em(string text)`
			`- codespan(string code)`
			`- br()`
			`- del(string text)`
			`- link(string href, string title, string text)`
			`- image(string href, string title, string text)`
			`- text(string text)`

update docs 2020-04-07 11:43:11 -05:00			`<h2 id="tokenizer">The tokenizer</h2>`

			`The tokenizer defines how to turn markdown text into tokens.`

Update docs/USING_PRO.md Co-Authored-By: Steven <steven@ceriously.com> 2020-04-14 19:52:44 -05:00			Example: Overriding default `codespan` tokenizer to include LaTeX.
update docs 2020-04-07 11:43:11 -05:00
			```js
			`// Create reference instance`
			`const marked = require('marked');`

			`// Override function`
update docs with marked.use 2020-04-19 00:38:19 -05:00			`const tokenizer = {`
			`codespan(src) {`
			`const match = src.match(/\$+([^\$\n]+?)\$+/);`
			`if (match) {`
			`return {`
			`type: 'codespan',`
			`raw: match[0],`
			`text: match[1].trim()`
			`};`
			`}`
return false to use last renderer/tokenizer 2020-04-19 01:08:54 -05:00
Fix docs 2020-04-19 01:38:41 -05:00			`// return false to use original codespan tokenizer`
return false to use last renderer/tokenizer 2020-04-19 01:08:54 -05:00			`return false;`
update docs 2020-04-07 11:43:11 -05:00			`}`
			`};`

update docs with marked.use 2020-04-19 00:38:19 -05:00			`marked.use({ tokenizer });`

update docs 2020-04-07 11:43:11 -05:00			`// Run marked`
update docs with marked.use 2020-04-19 00:38:19 -05:00			console.log(marked('$ latex code $\n\n` other code `'));
update docs 2020-04-07 11:43:11 -05:00			```

			`Output:`

			```html
update docs with marked.use 2020-04-19 00:38:19 -05:00			`<p><code>latex code</code></p>`
			`<p><code>other code</code></p>`
update docs 2020-04-07 11:43:11 -05:00			```

			`### Block level tokenizer methods`

only send needed variables to tokenizer 2020-04-14 13:23:41 -05:00			`- space(string src)`
			`- code(string src, array tokens)`
			`- fences(string src)`
			`- heading(string src)`
			`- nptable(string src)`
			`- hr(string src)`
			`- blockquote(string src)`
			`- list(string src)`
			`- html(string src)`
			`- def(string src)`
			`- table(string src)`
			`- lheading(string src)`
			`- paragraph(string src)`
			`- text(string src)`
update docs 2020-04-07 11:43:11 -05:00
			`### Inline level tokenizer methods`

only send needed variables to tokenizer 2020-04-14 13:23:41 -05:00			`- escape(string src)`
			`- tag(string src, bool inLink, bool inRawBlock)`
			`- link(string src)`
			`- reflink(string src, object links)`
			`- strong(string src)`
			`- em(string src)`
			`- codespan(string src)`
			`- br(string src)`
			`- del(string src)`
move smartypants, mangle, and rules to lexer 2020-04-14 16:40:06 -05:00			`- autolink(string src, function mangle)`
			`- url(string src, function mangle)`
			`- inlineText(string src, bool inRawBlock, function smartypants)`
update docs 2020-04-07 11:43:11 -05:00
add mangle smartypants examples 2020-04-14 16:53:06 -05:00			`mangle` is a method that changes text to HTML character references:

			```js
			`mangle('test@example.com')`
			`// "test@example.com"`
			```

			`smartypants` is a method that translates plain ASCII punctuation characters into “smart” typographic punctuation HTML entities:

			`https://daringfireball.net/projects/smartypants/`

			```js
			`smartypants('"this ... string"')`
			`// "“this … string”"`
			```

Renderer, lexer, parser 2018-02-25 16:37:11 -05:00			`<h2 id="lexer">The lexer</h2>`

update docs 2020-04-07 11:43:11 -05:00			`The lexer takes a markdown string and calls the tokenizer functions.`
Renderer, lexer, parser 2018-02-25 16:37:11 -05:00
			`<h2 id="parser">The parser</h2>`

update docs 2020-04-02 01:05:04 -05:00			`The parser takes tokens as input and calls the renderer functions.`
Renderer, lexer, parser 2018-02-25 16:37:11 -05:00
			`***`

Base README 2018-02-25 16:01:22 -05:00			`<h2 id="extend">Access to lexer and parser</h2>`

update docs 2020-04-02 01:05:04 -05:00			`You also have direct access to the lexer and parser if you so desire.`
Base README 2018-02-25 16:01:22 -05:00
			``` js
update docs 2020-04-02 01:05:04 -05:00			`const tokens = marked.lexer(markdown, options);`
update docs 2019-11-06 16:03:39 -06:00			`console.log(marked.parser(tokens, options));`
Base README 2018-02-25 16:01:22 -05:00			```

			``` js
update examples with es6+ 2019-07-08 09:13:53 -05:00			`const lexer = new marked.Lexer(options);`
update docs 2020-04-02 01:05:04 -05:00			`const tokens = lexer.lex(markdown);`
Base README 2018-02-25 16:01:22 -05:00			`console.log(tokens);`
fix docs 2020-04-15 00:02:18 -05:00			`console.log(lexer.tokenizer.rules.block); // block level rules used`
			`console.log(lexer.tokenizer.rules.inline); // inline level rules used`
			`console.log(marked.Lexer.rules.block); // all block level rules`
			`console.log(marked.Lexer.rules.inline); // all inline level rules`
Base README 2018-02-25 16:01:22 -05:00			```

			``` bash
			`$ node`
update docs 2020-04-02 01:05:04 -05:00			`> require('marked').lexer('> I am using marked.')`
			`[`
			`{`
			`type: "blockquote",`
			`raw: "> I am using marked.",`
			`tokens: [`
			`{`
			`type: "paragraph",`
			`raw: "I am using marked.",`
			`text: "I am using marked.",`
			`tokens: [`
			`{`
			`type: "text",`
			`raw: "I am using marked.",`
			`text: "I am using marked."`
			`}`
			`]`
			`}`
			`]`
			`},`
			`links: {}`
			`]`
Remove duplicate part in docs/USING_PRO.md 2018-03-28 00:54:11 +03:00			```
Document mutability of tokens argument in parser 2018-03-28 01:10:10 +03:00
update docs 2020-04-02 01:05:04 -05:00			`The Lexer builds an array of tokens, which will be passed to the Parser.`
			`The Parser processes each token in the token array:`
Document mutability of tokens argument in parser 2018-03-28 01:10:10 +03:00
			``` js
			`const marked = require('marked');`

			const md = `
			`# heading`

			`[link][1]`

			`[1]: #heading "heading"`
			`;

update docs 2020-04-02 01:05:04 -05:00			`const tokens = marked.lexer(md);`
Document mutability of tokens argument in parser 2018-03-28 01:10:10 +03:00			`console.log(tokens);`

			`const html = marked.parser(tokens);`
			`console.log(html);`
			```

			``` bash
update docs 2020-04-02 01:05:04 -05:00			`[`
			`{`
			`type: "heading",`
			`raw: " # heading\n\n",`
			`depth: 1,`
			`text: "heading",`
			`tokens: [`
			`{`
			`type: "text",`
			`raw: "heading",`
			`text: "heading"`
			`}`
			`]`
			`},`
			`{`
			`type: "paragraph",`
			`raw: " [link][1]",`
			`text: " [link][1]",`
			`tokens: [`
			`{`
			`type: "text",`
			`raw: " ",`
			`text: " "`
			`},`
			`{`
			`type: "link",`
			`raw: "[link][1]",`
			`text: "link",`
			`href: "#heading",`
			`title: "heading",`
			`tokens: [`
			`{`
			`type: "text",`
			`raw: "link",`
			`text: "link"`
			`}`
			`]`
			`}`
			`]`
			`},`
			`{`
			`type: "space",`
			`raw: "\n\n"`
			`},`
			`links: {`
			`"1": {`
			`href: "#heading",`
			`title: "heading"`
			`}`
			`}`
			`]`
Document mutability of tokens argument in parser 2018-03-28 01:10:10 +03:00			`<h1 id="heading">heading</h1>`
			`<p> <a href="#heading" title="heading">link</a></p>`
			```