aiken

Commit Graph

Author	SHA1	Message	Date
rvcas	db3b5c74bb	fix: todo and fail spans	2023-07-15 20:08:16 -04:00
rvcas	69fdee9f7e	fix: trace expr	2023-07-15 20:08:16 -04:00
rvcas	2edfd33594	fix: some attempted adjustments	2023-07-15 20:08:16 -04:00
rvcas	eafe3cdf75	test: fail with expr relates to #675	2023-07-14 13:09:55 -04:00
rvcas	e7c1b28b52	feat: add ability to reference validators in tests closes #632	2023-07-12 18:29:03 -04:00
rvcas	13ee62c05c	fix: don't break failing test defs for current users	2023-07-12 09:16:37 -04:00
rvcas	03e7d6e944	feat: update syntax for failing test to be more consistent	2023-07-12 09:16:37 -04:00
rvcas	1ab1ff9a1f	feat: rename error to fail	2023-07-12 09:16:37 -04:00
Cainã Costa	7f7a86765d	feat: bring back formatting imdepotency test	2023-07-11 13:06:25 -04:00
Cainã Costa	c27ef8ad93	chore: refactor formatting tests This changes the tests from normal assertions into snapshot tests, as well as standardizing test names.	2023-07-11 13:06:25 -04:00
rvcas	94bf75dd1c	chore: delete unreferenced snapshots	2023-07-06 21:00:13 -04:00
KtorZ	78d34f7f76	Fix parsing of negative int patterns and constants This was trickier than expected as the expression parser, and in particular the bin-op parser will interpret negative patterns as a continuation of a binary operation and eventually choke on the next right-arrow symbol. This is due to how we actually completely erase newlines once we're done with the lexer. The newline separating when clause is actually semantically important. In principle, we could only parse an expression until the next newline. Ideally, we would keep that newline in the list of token but it's difficult to figure out which newline to keep between two right arrows since a clause guard can be written over multiple lines. Though, since we know that this is only truly a problem for negative integers, we can use the same trick as for tuples and define a new 'NewLineMinus' token. That token CANNOT be part of a binop expression. That means it's impossible to write a binary operation with a minus over multiple lines, or more specifically, with the '-' symbol on a newline. This sounds like a fair limitation. What we get in exchange is less ambiguity when parsing patterns following expressions in when clause cases. Another more cumbersome option could be to preserve the first newline encountered after a 'right-arrow' symbol and before any parenthesis or curly brace is found (which would otherwise signal the beginning of a new block). That requires to traverse, at least partially, the list of tokens twice. This feels unnecessary for now and until we do face a similar issue with a binary operator.	2023-07-06 16:10:46 -04:00
KtorZ	346df47232	Refactor chain parser The main goal is to make the parser more reusable to be used for when-clauses, instead of the expression parser. A side goal has been to make it more readable by moving the construction of some untyped expression as method on UntypedExpr. Doing so, I got rid of the extra temporary 'ParseArg' type and re-used the generic 'CallArg' instead by simply using an Option<UntypedExpr> as value to get the same semantic as 'ParseArg' (which would distinguish between plain call args and holes). Now the chained parser is in a bit more reusable state.	2023-07-06 16:10:46 -04:00
KtorZ	549cf22cdd	Rename (Un)TypedExpr.Int -> (Un)TypedExpr.UInt We do not actually every parse negative values in there, as a negative value is a combination of a 'Negate' and 'UInt' expression. However, for patterns and constant, it'll be simpler to parse whole Int values as there's no ambiguity with arithmetic operations there. To avoid confusion of having some 'Int' constructors containing only non-negative values, and some being on the whole range, I've renamed the constructor to 'UInt' to make this more obvious.	2023-07-06 16:10:46 -04:00
KtorZ	5a4a2faa4d	Split pattern parser into individual modules.	2023-07-06 16:10:46 -04:00
KtorZ	0650d6152d	rename test cases for when/clause to somewhat match the file hierarchy.	2023-07-06 16:10:46 -04:00
KtorZ	ed85cb1c00	Fix todo/error parsing This was a bit more tricky than anticipated but played out nicely in the end. Now we have one holistic way of parsing todos and errors instead of it being duplicated between when/clause and sequence. The error/todo parser has been moved up to the expression part rather than being managed when parsing sequences. Not sure what motivated that to begin with. Fixes #621.	2023-07-05 20:12:57 +02:00
rvcas	e331b3449b	chore: clippy fix	2023-07-05 12:06:03 -04:00
KtorZ	a306d6e9f2	Move chain and chained parsing into their own submodule Alleviate a bit more the top-level expression parser. Note that we probably need a bit more disciplined in what we export and at what level because there doesn't seem to be much logic as for whether a parser is private, exported to the crate only or to the wide open. I'd be in favor of exporting everything by default.	2023-07-05 15:18:07 +02:00
KtorZ	4f6defcf3e	rename: 'r' → 'expression' & 'seq_r' → 'sequence' Better readability.	2023-07-05 14:42:14 +02:00
KtorZ	66296df9c3	Move parsing of literals under new 'literal' parser module group Also moved the logic for 'int' and 'string' there though it is trivial. Yet, for bytearray, it tidies things nicely by removing them from the 'utils' module.	2023-07-05 14:37:29 +02:00
KtorZ	e15cdaf248	Move 'utils::bytearray' to 'expr/bytearray'	2023-07-05 14:10:47 +02:00
KtorZ	44eb501d78	Favor pattern-match over if-else when parsing assignment kinds Equality on a union-type is potentially dangerous as the compiler won't complain if we add a new case that we don't cover. Reversing the assignment by yielding a `Token` for a given `AssignmentKind`. This way we can use a pattern-match that got us covered for future cases.	2023-07-05 14:01:13 +02:00
KtorZ	93e010b345	Replace 'public' utils with a more generic 'optional_flag' The 'public' util was arguably not really adding much except a layer of indirection. In the end, one useful parsing behavior to abstract is the idea of 'optional flag' that we use for both 'pub' and 'opaque' keywords.	2023-07-05 13:57:34 +02:00
rvcas	5e8edcb340	test(parser): finish moving tests to their correct modules	2023-07-04 17:48:48 -04:00
rvcas	47567c5e6f	test(parser): some adjustments after rebase with @ktorz fix	2023-07-04 17:19:30 -04:00
rvcas	b25db429be	test(parser): anon binop and ambiguous sequence	2023-07-04 17:19:30 -04:00
rvcas	8a6c81493c	test(parser): record create	2023-07-04 17:19:30 -04:00
rvcas	a75bcff5c8	test(parser): type alias, anon fn, record update and more	2023-07-04 17:19:30 -04:00
rvcas	bd8c13c372	test(parser): move over the validator tests and some misc tests to parser	2023-07-04 17:19:29 -04:00
rvcas	6b05d6a91e	test(parser): rename definitions to definition and more tests	2023-07-04 17:19:29 -04:00
rvcas	baf807ca2d	test(parser): list spread	2023-07-04 17:19:29 -04:00
rvcas	44d0432560	test(parser): int list	2023-07-04 17:19:29 -04:00
rvcas	f9c099a923	test: add indoc to assert_expr macro	2023-07-04 17:19:29 -04:00
rvcas	715752718d	test: assert_module	2023-07-04 17:19:29 -04:00
rvcas	8a7df7f66b	test: add empty list test	2023-07-04 17:19:29 -04:00
rvcas	f878ef7cef	feat: move some token processing to the lexer	2023-07-04 17:19:28 -04:00
rvcas	2226747dc1	feat: finish splitting up parsers	2023-07-04 17:19:28 -04:00
rvcas	eea94fc9a4	feat: move anon fn, let, and expect	2023-07-04 17:19:28 -04:00
rvcas	9c98fc8026	feat: start splitting apart expr_parser	2023-07-04 17:19:28 -04:00
rvcas	e3ed5d3b00	feat: move expr_parser and remove module.rs to definitions	2023-07-04 17:19:28 -04:00
rvcas	3339d41fdd	feat: finish moving definitions and start exprs	2023-07-04 17:19:27 -04:00
rvcas	fc580d4fa0	feat(parser): move definitions to their own modules	2023-07-04 17:19:27 -04:00
KtorZ	5a6cc855e6	Use byte count for token span in the lexer. Somehow, miette doesn't play well with spans when using chars indices. So we have to count the number of bytes in strings / chars, so that spans align accordingly.	2023-07-04 16:51:59 -04:00
KtorZ	6bd8e94e17	Preserve numeric underscore and hexadecimal notation through formatting.	2023-06-08 16:37:20 +02:00
KtorZ	79a2174f0a	Extend parser to support int as hexadecimal and numeric underscore. We only allow numeric underscore for decimal numbers as I am not sure how we can define it for non-decimal numbers?	2023-06-08 15:33:50 +02:00
rvcas	1747090931	fix: fmt crashing with comment at end of file closes #568	2023-06-07 15:59:54 -04:00
rvcas	26a607eb00	fix: bad parsing of comments at end of file closes #551	2023-05-30 11:07:39 -04:00
rvcas	7b3e1c6952	feat: adjust failing test syntax * also add a formatter test	2023-05-25 18:21:12 -04:00
rvcas	a124a16a61	feat(tests): implement a way to express that tests can fail	2023-05-25 16:54:53 -04:00
rvcas	baa2cef6c4	feat(playground): impl clone for some errors	2023-04-24 17:13:20 -04:00
rvcas	1444c9328d	fix some typos	2023-04-07 16:51:18 -04:00
KtorZ	8a2af4cd2e	Fix lexer throwing errors when parsing a too large tuple index.	2023-03-18 16:13:50 +01:00
KtorZ	1311d9bd27	Support flexible pipe operator formatting Rules are now as follows: - If a pipeline contains a newline, then the entire pipeline is formatted over multiple lines. - If it doesn't, then it's formatted as a single-line UNLESS it cannot fit; in which case, we fallback to multiline again.	2023-03-14 16:47:43 -04:00
KtorZ	a46a9fca41	Only use colors & text decorations on ANSI-capable terminals. Fixes #404.	2023-02-26 13:19:03 +01:00
KtorZ	539ed2dea4	Fix unicode char parsing in comments.	2023-02-22 17:33:13 +01:00
KtorZ	f307e214c3	Remove parse error on bytearray literals for trace, todo & error, parse as String instead. This has been bothering me and the more I thought of it the more I disliked the idea of a warning. The rationale being that in this very context, there's absolutely no ambiguity. So it is only frustrating that the parser is even able to make the exact suggestion of what should be fixed, but still fails. I can imagine it is going to be very common for people to type: ``` trace "foo" ``` ...yet terribly frustrating if they have to remember each time that this should actually be a string. Because of the `trace`, `todo` and `error` keywords, we know exactly the surrounding context and what to expect here. So we can work it nicely. However, the formatter will re-format it to: ``` trace @"foo" ``` Just for the sake of remaining consistent with the type-system. This way, we still only manipulate `String` in the AST, but we conveniently parse a double-quote utf-8 literal when coupled with one of the specific keywords. I believe that's the best of both worlds.	2023-02-19 10:10:42 +01:00
KtorZ	d72e13c7c8	Emit parse error when finding a ByteArray literal instead of String literal.	2023-02-19 10:10:42 +01:00
KtorZ	53fb821b62	Use double-quotes for utf-8 bytearrays, and @"..." for string literals The core observation is that in the context of Aiken (i.e. on-chain logic) people do not generally want to use String. Instead, they want bytearrays. So, it should be easy to produce bytearrays when needed and it should be the default. Before this commit, `"foo"` would parse as a `String`. Now, it parses as a `ByteArray`, whose bytes are the UTF-8 bytes encoding of "foo". Now, to make this change really "fool-proof", we now want to: - [ ] Emit a parse error if we parse a UTF-8 bytearray literal in place where we would expect a `String`. For example, `trace`, `error` and `todo` can only be followed by a `String`. So when we see something like: ``` trace "foo" ``` we know it's a mistake and we can suggest users to use: ``` trace @"foo" ``` instead. - [ ] Emit a warning if we ever see a bytearray literals UTF-8, which is either 56 or 64 character long and is a valid hexadecimal string. For example: ``` let policy_id = "29d222ce763455e3d7a09a665ce554f00ac89d2e99a1a83d267170c6" ``` This is _most certainly_ a mistake, as this generates a ByteArray of 56 bytes, which is effectively the hex-encoding of the provided string. In this scenario, we want to warn the user and inform them they probably meant to use: ``` let policy_id = #"29d222ce763455e3d7a09a665ce554f00ac89d2e99a1a83d267170c6" ```	2023-02-19 10:09:22 +01:00
KtorZ	4a22e5f656	Fix module comment parsing / formatting after bumping chumsky to 0.9.0	2023-02-17 14:07:24 +01:00
KtorZ	6a50bde666	Implement parser & formater for 'TraceIfFalse' Interestingly enough, chumsky seems to fail when given a 'choice' with more than 25 elements. That's why this commit groups together some of the choices as another nested 'choice'.	2023-02-16 20:29:41 -05:00
rvcas	a88a193383	fix: properly lex new token and adjust parsed spans	2023-02-16 00:05:55 -05:00
rvcas	2e7fe191db	feat(definitions): * add parsing for new validator defs * start adding typechecking * add a unit test for parsing	2023-02-16 00:05:55 -05:00
rvcas	dbd162e985	feat: handle expect in parser * map both assert/expect to Token::Expect * use the new token in the parser * new unit test to expect	2023-02-09 00:43:29 -05:00
rvcas	5a4a8df727	feat(token): add Token::Expect	2023-02-09 00:25:54 -05:00
rvcas	88ce8ba8b9	feat: remove check assignment	2023-02-01 23:03:35 -05:00
rvcas	21251d6499	fix: remove check from lexer	2023-02-01 20:44:58 -05:00
rvcas	a365649360	chore: clippy autofix	2023-02-01 18:53:11 -05:00
Lucas	b653714c0c	Merge branch 'main' into when-clause-guards	2023-01-30 11:40:29 -05:00
rvcas	703fcb451d	fix(parser,windows): capture carriage return properly	2023-01-26 10:16:29 -05:00
KtorZ	5d7585cc05	Implement parser for when clause guard With pretty parse errors on failures. The type-checker was already implemented for those, so it now only requires some work in the code generation. Fixes #297.	2023-01-21 17:43:13 +01:00
KtorZ	333a990249	Fix parsing of subtractions and negations in the absence of space.	2023-01-21 12:43:11 +01:00
KtorZ	2d99c07dd3	Support (and default to) parenthesis for block expressions This changes allow to use parenthesis `(` `)` to encapsulate expressions in addition to braces `{` `}` used to define blocks. The main use-case is for arithmetic and boolean expressions for which developers are used to using parenthesis. For example: ``` { 14 + 42 } * 1337 ``` can now be written as: ``` ( 14 + 42 ) * 1337 ``` This may sound straightforward at first but wasn't necessarily trivial in Aiken given that (a) everything is an expression, (b) whitespaces do not generally matter and (c) there's no symbol indicating the end of a 'statement' (because there's no statement). Thus, we have to properly disambiguate between: ``` let foo = bar(14 + 42) ``` and ``` let foo = bar (14 + 42) ``` Before this commit, the latter would be interpreted as a function call and would lead to a somewhat puzzling error. Now, the newline serves as a delimiting symbol. The trade-off being that for a function call, the left parenthesis has to be on the same line as the function name identifier -- which is a fair trade off. So this is still allowed: ``` let foo = bar( 14 + 42 ) ``` As there's very little ambiguity about it. This fixes #236 and would seemingly allow us to get rid of the leading `#` in front of tuples.	2023-01-14 11:49:45 -05:00
KtorZ	85a6a711e1	Add more colors to error messages. And, let 'formatdoc' insert newlines when relevant to format the text to the same column length.	2022-12-29 17:38:34 +01:00
KtorZ	3139c85fe8	Support declaring bytearray literals as base16 strings.	2022-12-29 13:08:58 +01:00
Kasey White	083b7fcb5f	feat: support negation of int * add unary op * parse, typecheck, and code gen it * express boolean not as unary op as well, previously called negate Co-authored-by: rvcas <x@rvcas.dev>	2022-12-27 20:39:03 -05:00
rvcas	37196a29ee	feat: error keyword	2022-12-23 15:52:44 -05:00
KtorZ	69f060e675	Rework all errors to provide better help text.	2022-12-23 19:27:06 +01:00
KtorZ	5cf9742e5e	Move tuple-index hint as diagnostic's help	2022-12-22 18:52:28 +01:00
KtorZ	bf7cdfba73	Implement parser & type-checker for tuple indexes. ```aiken fn foo() { let tuple = #(1, 2, 3, 4) tuple.1st + tuple.2nd + tuple.3rd + tuple.4th } ```	2022-12-22 09:14:23 +01:00
rvcas	42204d2d71	chore: make folder names match crate name	2022-12-21 18:11:07 -05:00

1 2 3

131 Commits