GitHub - foldmaker/foldmaker: Lightweight parser combinator that lets you leverage RegExps during parsing

Foldmaker is a lightweight tool (~120 lines uncommented), and a way of thinking for building parsers. It was designed to be minimal and user-friendly. Foldmaker has its own characteristic parsing technique. This technique results in very compact parsers. You can implement a JSON parser only in ~45 lines or SCSS-like preprocessors only in ~50 lines.

"Parsing is re-tokenization"

This is Foldmaker's motto, and its embodied in Foldmaker's architecture. The tokenizer function is also used as the parser function internally. First to tokenize our string, then to tokenize our token stream. Then we use it again, on the resulting token stream, and so on. More on this is explained below.

Special versions

@foldmaker/tiny - Strict mode with error logging
@foldmaker/strict - Strict mode with error logging (not done yet)
@foldmaker/capturing-groups - Offers capturing group support (not done yet)

Examples

@foldmaker/json-parse - JSON.parse implementation. Does not check for validity.
@foldmaker/css-nested - Tiny plugin to unwrap nested CSS rules (like SCSS)
@foldmaker/shallow-ast - Error tolerant AST generator for JS-like languages (not done yet)
Nested arrays
Basic linter
Line and column mapping

Installation

Foldmaker can be installed with npm, by the following command:

npm install foldmaker

Importing Foldmaker by the default export is enough:

import  Foldmaker  from  'foldmaker'

How it works?

In Foldmaker, tokenization and parsing are analogous processes. Our tokenizer function is also used as the parser function under the hood. First our string gets tokenized by it, then our token stream gets tokenized. This is possible, because in Foldmaker, token stream is a string! This is possible thanks to the first rule of Foldmaker's specification:

Always use one-character token names

While this rule may seem to restrict the user, but it has its own advantages. Let me explain. Let's say, in the following string, we want to match the condition inside the if block. We want to identify foo === 1 part as the "condition" .

if(foo === 1) { print('ONE') }

Imagine, after tokenization, we have the following array of tokens. i for if, p for primitive, o for operator and b for body.

let tokens = [
  {type: "i", value: "if"},
  {type: "(", value: "("},
  {type: "p", value: "foo"},
  {type: "o", value: "==="},
  {type: "p", value: "1"},
  {type: ")", value: ")"},
  {type: "b", value: "{ print('ONE') }"}
]

Now let's input our array of tokens inside Foldmaker like below:

let fm = Foldmaker(tokens)

we get the following FoldmakerObject.

{
  array: ["if", "(", "foo ", "=== ", "1", ") ", "{ print('ONE') }"  ],
  string: "i(pop)b",
  __proto__: FoldmakerObject
}

This is the first maneuver of Foldmaker. It creates two streams by joining types and values. Observe that each element of the string maps to an element in the array with the same index, and vice versa. Now, let's return to our case. In our case, we need to match the string that is represented by pop (primitive, operator, primitive in series) right? Let's say that we are going to mark that as c (condition). First, let's log our matching occurrence in the console.

We simply use replace method of Foldmaker:

let fm = fm.replace(/pop/, result => console.log(result))
console.log(fm.string)

Console output:

First log: {
  raw: ["foo",  "===",  "1"],
  map: ["pop",  index:  0,  input:  "pop)b",  groups:  undefined],
  index: 2,
  count: 3
}
Second log: "i(pop)b"

As you can see on the first log, our occurrence is shown in the raw key. Since we didn't return anything inside the callback function of the replace method, the string in our Foldmaker instance (fm.string) did not change, as can be seen on the second log. Let's manipulate If we return something inside the callback function as the following:

let fm = fm.replace(/pop/, result => {
	return ['c', { value: result.raw }]
})
console.log(fm)

console output will be:

{
  array: ["if", "(", Object, ") ", "{ print('ONE') }"  ],
  string: "i(c)b",
}

Input string:

{"a": 1, "b": 2, "c": [1, 2, 3, {"deep": {}}]}

Foldmaker's parsing steps:

Token stream : {s:n,s:n,s:[n,n,n,{s:{}}]}
Iteration 1  : {k,k,s:[n,n,n,{s:o}]}
Iteration 2  : {k,k,s:[n,n,n,{k}]}
Iteration 3  : {k,k,s:[n,n,n,o]}
Iteration 4  : {k,k,s:a}
Iteration 5  : {k,k,k}
Iteration 6  : o
Iteration 7  : o

API

Exports

Export	Usage	Description
default (function)	`Foldmaker(tokensArray)`	Takes a `tokensArray` and returns a FoldmakerObject
tokenize (function)	`tokenize(string, dictionary {, callback })`	Tokenizes the `string` by using `dictionary`. Returns an array of tokens.
traverse (function)	`traverse(node, callback)`	Conducts tree traversal for `node`. Node can be of type non-array or array.
visitor (function)	`visitor(regexp, callback)`	Is a helper function
FoldmakerObject (class)	-
fm (function)	`fm(tokensArray)`	Is a shorthand for default export (circular)

Methods of `FoldmakerObject`

Method	Usage	Arguments of callback
parse	`parse(regex , callback {, debug })`	`(result {, state {, oldState}})`
replace	`replace(regex , callback {, debug })`	`(result {, state {, oldState}})`
traverse	`traverse(callback)`	`(object {, alsoTraverse})`
add	`add(string, array)`	-

Useful Links

Tools for Compiler Writers by @jashkenas

License

Licensed by the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
dist		dist
examples		examples
src		src
.babelrc		.babelrc
.gitignore		.gitignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
TUTORIAL.md		TUTORIAL.md
logo.png		logo.png
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"Parsing is re-tokenization"

Special versions

Examples

Installation

How it works?

API

Exports

Methods of `FoldmakerObject`

Useful Links

License

About

Releases 1

Packages

Languages

License

foldmaker/foldmaker

Folders and files

Latest commit

History

Repository files navigation

"Parsing is re-tokenization"

Special versions

Examples

Installation

How it works?

API

Exports

Methods of FoldmakerObject

Useful Links

License

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Methods of `FoldmakerObject`

Packages