parzig

A parser generator which constructs its parsers at comptime.

The way parsers and grammars are written and used is heavily inspired by tree-sitter.

Usage

Minimal Example

// MinimalExample.zig
const parzig = @import("parzig");

const Rule = parzig.RuleFrom(Rules);
pub const Rules = enum {
    root,
};

root: Rule = .{ .regex = "" },

when using the parser:

const parzig = @import("parzig");
const Parser = parzig.ParserFrom(@import("MinimalExample.zig"));
// ... in a function
    var parser = Parser.init(allocator, content);
    const tree = parser.parse() catch |err| {
        // ... handle error
    };
    // ... use the returned tree

This grammar just tries to match the internal regex "".

Grammar Options

Add this if you want to change the default configuration:

pub fn config() parzig.Config {
    return .{};
}

Available Options

ignore_whitespace: bool (default: false)
if not used in regex whitespace will be ignored if true

internal Regex

Things you can express in this implementation:

character: a
escaped character: \\+
repeat any amount: *a
repeat at least once: +a
choice: [abc]
negative choice: [^abc]
character range: {a-z}

NOTE: the double backslash is nessecary because you escape in a string of zig. If you wish to parse a backslash you need to write \\\\ to match it.

Walking the Tree

Available functions of the parsed tree:

const Tree = struct {
    // frees all resources of the tree
    fn deinit(self: Tree) void;
    // recive the node of `node_index`
    fn node(self: Tree, node_index: usize) Node;
    // recive the kind of `node_index`
    fn nodeKind(self: Tree, node_index: usize) enum { ... };
    // recive the children of `node_index`
    fn children(self: Tree, node_index: usize) []const usize;
    // recive the matched characters of `node_index`
    fn chars(self: Tree, node_index: usize) []const u8;

    // prints the full tree to `out`
    fn dumpTo(self: Tree, out: std.io.AnyWriter) !void;
    // prints the node given by `node_index` to `out` with `indent_level` of whitespace padding
    fn dumpNodeTo(self: Tree, node_index: usize, out: std.io.AnyWriter, indent_level: usize) !void;
};

const Node = struct {
    kind: enum { ... },
    start_index: usize,
    end_index: usize,
    children: []const usize,
};

NOTE: The node kind enum is constructed from the rules enum you define for your grammar. The values repeat, sequence and regex are added during comptime.

The functions node, nodeKind, children and chars are provided for easy access but are not nessecary to be used.
For example these two are aquivalant:

const node_chars = tree.chars(index);

const node = tree.nodes[index];
const node_chars = tree.source[node.start_index .. node.end_index];

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.zig		build.zig
build.zig.zon		build.zig.zon

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

parzig

Usage

Minimal Example

Grammar Options

Available Options

internal Regex

Walking the Tree

About

Releases

Packages

Languages

License

DeSc1998/parzig

Folders and files

Latest commit

History

Repository files navigation

parzig

Usage

Minimal Example

Grammar Options

Available Options

internal Regex

Walking the Tree

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages