2021-12-25 18:59:02 +00:00
|
|
|
//! See [`Output`]
|
|
|
|
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
use crate::SyntaxKind;
|
|
|
|
|
2021-12-25 18:59:02 +00:00
|
|
|
/// Output of the parser -- a DFS traversal of a concrete syntax tree.
|
|
|
|
///
|
|
|
|
/// Use the [`Output::iter`] method to iterate over traversal steps and consume
|
|
|
|
/// a syntax tree.
|
|
|
|
///
|
|
|
|
/// In a sense, this is just a sequence of [`SyntaxKind`]-colored parenthesis
|
|
|
|
/// interspersed into the original [`crate::Input`]. The output is fundamentally
|
|
|
|
/// coordinated with the input and `n_input_tokens` refers to the number of
|
|
|
|
/// times [`crate::Input::push`] was called.
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
#[derive(Default)]
|
2021-12-25 18:59:02 +00:00
|
|
|
pub struct Output {
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
/// 32-bit encoding of events. If LSB is zero, then that's an index into the
|
|
|
|
/// error vector. Otherwise, it's one of the thee other variants, with data encoded as
|
|
|
|
///
|
2021-12-25 18:59:02 +00:00
|
|
|
/// |16 bit kind|8 bit n_input_tokens|4 bit tag|4 bit leftover|
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
///
|
|
|
|
event: Vec<u32>,
|
|
|
|
error: Vec<String>,
|
|
|
|
}
|
|
|
|
|
2021-12-29 15:23:34 +00:00
|
|
|
#[derive(Debug)]
|
2021-12-25 18:59:02 +00:00
|
|
|
pub enum Step<'a> {
|
|
|
|
Token { kind: SyntaxKind, n_input_tokens: u8 },
|
|
|
|
Enter { kind: SyntaxKind },
|
|
|
|
Exit,
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
Error { msg: &'a str },
|
|
|
|
}
|
|
|
|
|
2021-12-25 18:59:02 +00:00
|
|
|
impl Output {
|
|
|
|
pub fn iter(&self) -> impl Iterator<Item = Step<'_>> {
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
self.event.iter().map(|&event| {
|
|
|
|
if event & 0b1 == 0 {
|
2021-12-25 18:59:02 +00:00
|
|
|
return Step::Error { msg: self.error[(event as usize) >> 1].as_str() };
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
}
|
|
|
|
let tag = ((event & 0x0000_00F0) >> 4) as u8;
|
|
|
|
match tag {
|
|
|
|
0 => {
|
|
|
|
let kind: SyntaxKind = (((event & 0xFFFF_0000) >> 16) as u16).into();
|
2021-12-25 18:59:02 +00:00
|
|
|
let n_input_tokens = ((event & 0x0000_FF00) >> 8) as u8;
|
|
|
|
Step::Token { kind, n_input_tokens }
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
}
|
|
|
|
1 => {
|
|
|
|
let kind: SyntaxKind = (((event & 0xFFFF_0000) >> 16) as u16).into();
|
2021-12-25 18:59:02 +00:00
|
|
|
Step::Enter { kind }
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
}
|
2021-12-25 18:59:02 +00:00
|
|
|
2 => Step::Exit,
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
_ => unreachable!(),
|
|
|
|
}
|
|
|
|
})
|
|
|
|
}
|
|
|
|
|
|
|
|
pub(crate) fn token(&mut self, kind: SyntaxKind, n_tokens: u8) {
|
2022-12-30 10:02:45 +00:00
|
|
|
let e = ((kind as u16 as u32) << 16) | ((n_tokens as u32) << 8) | 1;
|
internal: replace TreeSink with a data structure
The general theme of this is to make parser a better independent
library.
The specific thing we do here is replacing callback based TreeSink with
a data structure. That is, rather than calling user-provided tree
construction methods, the parser now spits out a very bare-bones tree,
effectively a log of a DFS traversal.
This makes the parser usable without any *specifc* tree sink, and allows
us to, eg, move tests into this crate.
Now, it's also true that this is a distinction without a difference, as
the old and the new interface are equivalent in expressiveness. Still,
this new thing seems somewhat simpler. But yeah, I admit I don't have a
suuper strong motivation here, just a hunch that this is better.
2021-12-19 14:36:23 +00:00
|
|
|
self.event.push(e)
|
|
|
|
}
|
|
|
|
|
|
|
|
pub(crate) fn enter_node(&mut self, kind: SyntaxKind) {
|
|
|
|
let e = ((kind as u16 as u32) << 16) | (1 << 4) | 1;
|
|
|
|
self.event.push(e)
|
|
|
|
}
|
|
|
|
|
|
|
|
pub(crate) fn leave_node(&mut self) {
|
|
|
|
let e = 2 << 4 | 1;
|
|
|
|
self.event.push(e)
|
|
|
|
}
|
|
|
|
|
|
|
|
pub(crate) fn error(&mut self, error: String) {
|
|
|
|
let idx = self.error.len();
|
|
|
|
self.error.push(error);
|
|
|
|
let e = (idx as u32) << 1;
|
|
|
|
self.event.push(e);
|
|
|
|
}
|
|
|
|
}
|