Add a `spanned` method for automatically wrapping the parser result in a `Span` #640

TheOnlyTails · 2024-06-06T17:08:21Z

This would be a very simple utility method I feel is missing right now, as a shorthand:

parser.map_extra(|it, e| (it, e.span()))
// into
parser.spanned()

It's not essential or anything, but it'll definitely be nice to have.

zesterer · 2024-06-27T13:17:59Z

I've considered this a few times, but in practice it's far from a convention to represent the two as a tuple: folks often create a struct Spanned<T>(T, Span); type that implements Deref<Target = T> for the same purpose.

I am tempted to make such a Spanned type a part of chumsky's API though. Would that suit you?

TheOnlyTails · 2024-06-29T17:26:07Z

In my opinion, a library taking a position in cases like this usually makes it easier to solve problems and reduces fragmentation, so I'm all in favor of an official solution.

zesterer · 2024-06-30T15:55:06Z

What worries me about this is that such a Spanned type would effectively become a pervasive part of a user's compiler since it would be present in the AST too. Perhaps this isn't a problem in itself, but chumsky tries very hard to be domain-specific, not being opinionated about other aspects of a compiler.

TheOnlyTails · 2024-06-30T17:41:35Z

IMO that's a good thing, it encourages and makes it easier to do good error reporting, since you can access the span right then and there. Besides, it's opt-in only for those who use the method.

mlgiraud · 2024-07-11T09:25:48Z

Is there any case where Spanned is something else than a token or element, and a span? In the current version of chumsky i also don't see an easy way to create a custom spanned type in a tokenizer that is then consumed by a parser, since the Input trait is sealed, and the SpannedInput type requires tuples as tokens. So either it would make sense to have a Spanned type that is provided by the library, or to have a trait that needs to be implemented by a "Spanned" type such that chumsky can handle it, right? Imho i believe both would make sens, i.e. providing a trait and a default "Spanned" type that implements this trait.

zesterer · 2024-10-23T08:52:12Z

or to have a trait that needs to be implemented by a "Spanned" type such that chumsky can handle it, right? Imho i believe both would make sens, i.e. providing a trait and a default "Spanned" type that implements this trait.

The problem with a trait is that you then end up needing to specify the implementor you want, adding more type annotations (and potentially confusing type errors)

One option might be to tie it to the Span trait itself. For example:

pub trait Span {
    type Spanned<T>;

    fn make_spanned<T>(self, item: T) -> Self::Spanned<T>;
}

...

pub struct Spanned<T, S = SimpleSpan>(T, S);

impl Span for SimpleSpan {
    type Spanned<T> = Spanned<T>;

    fn make_spanned<T>(self, item: T) -> Self::Spanned<T> { Spanned(item, self) }
}

Then a .spanned() combinator could be introduced that adds performs this spanning operation automatically.

How do you both feel about this?

TheOnlyTails · 2024-10-23T11:51:53Z

I really like this!

MeGaGiGaGon · 2024-11-07T00:58:03Z

I third this, trying to migrate from pom where Parser could be easily extended to add a spanned combinator, not having it sucks majorly.

zesterer · 2025-01-02T12:54:29Z

In #713 I've made a first pass attempt at this. Do you have any thoughts @TheOnlyTails, @MeGaGiGaGon, and @Hedgehogo?

Hedgehogo · 2025-01-02T14:47:10Z

You could make a separate trait that would contain the associated Spanned type and be a subtrait for Span. Alternatively, you could just return From<(T, Span)> and hope that the type inference works out (e.g. based on the function type being returned). There is also an option to add some kind of map_into combinator, which would simplify the map(|i| From::from(i)) construction.

zesterer · 2025-01-02T15:45:28Z

I don't think that inference is likely to be powerful enough to infer the output type in enough cases that it would get very annoying.

.map(|i| From::from(i)) can already be simplified to .map(From::from), and I don't think there's much point going simpler than that. Users already understand both map and From, adding another combinator to replace both has few benefits but a big complexity drawback. To be honest, I think even .spanned() toes the line on this.

Hedgehogo · 2025-01-04T19:41:52Z

I don't think just a tuple works for us, as it's too weak a simplification to be worthy of a combinator. So the choice is between three solutions: built in chumsky Spanned without trait binding, adding an associated type to the Span trait, adding a new trait with the required associated type. Between the first and the others, the question is: can the user use his own type?

Yes, it can, in case the user would like to somehow optimize spans for memory by pulling missing span information from T. However, this also imposes constraints on T, thus cutting off the option of putting an associated type inside Span as a solution. Further reasoning depends on whether we really care about the rare user who needs to optimize spans by memory?

If the answer to this question is no, then there is practically no applicability of own Spanned and the first option should be taken, the alternative requires additional discussion in the matter of the design of this very trait and its name. The second solution does not appear anywhere.

zesterer · 2025-01-06T07:22:58Z

The current implementation allows the user to specify their own spanning type.

Hedgehogo · 2025-01-06T16:20:08Z

The current implementation allows the user to specify their own spanning type.

But it is useless as long as it does not allow to impose constraints on T.

Hedgehogo · 2025-01-06T16:39:39Z

I will describe how the implementation should look like, at which it is possible to realize the situation described above.

We need a separate trait that allows us to match a span - Spanned:

trait ???: Span {
    type Spanned<T>;
}

But such a trait does not allow to impose restrictions on T, and this is necessary to get the missing information from it in case of memory-optimized Span. In this case, we can move T from the associated type to the trait:

trait ????<T>: Span {
    type Spanned;
}

This is actually a mapping of the form (T, S) -> Sp. It can be written in Rust in a different way:

trait ???<S: Span> {
    type Spanned;
}

In this form it looks as if some types can have their own Spanned, and since in a condensed form how to get Span depends on the type, it looks logical. It also makes it easy to pick up the name - WithSpanned. The final design of the trait, as well as its standard implementation in chumsky:

trait WithSpanned<S: Span> {
    type Spanned;

    fn new_spanned(self, span: S) -> Self::Spanned;
}

impl<T> WithSpanned<SimpleSpan> for T {
    type Spanned = Spanned<T>;

    fn new_spanned(self, span: SimpleSpan) -> Self::Spanned {
        Spanned(self, span)
    }
}

zesterer added enhancement New feature or request api A problem with the design of an API feature labels Oct 23, 2024

zesterer linked a pull request Jan 2, 2025 that will close this issue

Added Spanned combinator #713

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a `spanned` method for automatically wrapping the parser result in a `Span` #640

Add a `spanned` method for automatically wrapping the parser result in a `Span` #640

TheOnlyTails commented Jun 6, 2024

zesterer commented Jun 27, 2024

TheOnlyTails commented Jun 29, 2024

zesterer commented Jun 30, 2024

TheOnlyTails commented Jun 30, 2024

mlgiraud commented Jul 11, 2024

zesterer commented Oct 23, 2024

TheOnlyTails commented Oct 23, 2024

MeGaGiGaGon commented Nov 7, 2024

zesterer commented Jan 2, 2025

Hedgehogo commented Jan 2, 2025

zesterer commented Jan 2, 2025

Hedgehogo commented Jan 4, 2025

zesterer commented Jan 6, 2025

Hedgehogo commented Jan 6, 2025

Hedgehogo commented Jan 6, 2025

Add a spanned method for automatically wrapping the parser result in a Span #640

Add a spanned method for automatically wrapping the parser result in a Span #640

Comments

TheOnlyTails commented Jun 6, 2024

zesterer commented Jun 27, 2024

TheOnlyTails commented Jun 29, 2024

zesterer commented Jun 30, 2024

TheOnlyTails commented Jun 30, 2024

mlgiraud commented Jul 11, 2024

zesterer commented Oct 23, 2024

TheOnlyTails commented Oct 23, 2024

MeGaGiGaGon commented Nov 7, 2024

zesterer commented Jan 2, 2025

Hedgehogo commented Jan 2, 2025

zesterer commented Jan 2, 2025

Hedgehogo commented Jan 4, 2025

zesterer commented Jan 6, 2025

Hedgehogo commented Jan 6, 2025

Hedgehogo commented Jan 6, 2025

Add a `spanned` method for automatically wrapping the parser result in a `Span` #640

Add a `spanned` method for automatically wrapping the parser result in a `Span` #640