nix-super

mirror of https://github.com/privatevoid-net/nix-super.git synced 2024-11-28 16:46:16 +02:00

Author	SHA1	Message	Date
pennae	5d9fdab3de	use byte indexed locations for PosIdx we now keep not a table of all positions, but a table of all origins and their sizes. position indices are now direct pointers into the virtual concatenation of all parsed contents. this slightly reduces memory usage and time spent in the parser, at the cost of not being able to report positions if the total input size exceeds 4GiB. this limit is not unique to nix though, rustc and clang also limit their input to 4GiB (although at least clang refuses to process inputs that are larger, we will not). this new 4GiB limit probably will not cause any problems for quite a while, all of nixpkgs together is less than 100MiB in size and already needs over 700MiB of memory and multiple seconds just to parse. 4GiB worth of input will easily take multiple minutes and over 30GiB of memory without even evaluating anything. if problems do arise we can probably recover the old table-based system by adding some tracking to Pos::Origin (or increasing the size of PosIdx outright), but for time being this looks like more complexity than it's worth. since we now need to read the entire input again to determine the line/column of a position we'll make unsafeGetAttrPos slightly lazy: mostly the set it returns is only used to determine the file of origin of an attribute, not its exact location. the thunks do not add measurable runtime overhead. notably this change is necessary to allow changing the parser since apparently nothing supports nix's very idiosyncratic line ending choice of "anything goes", making it very hard to calculate line/column positions in the parser (while byte offsets are very easy).	2024-03-06 23:48:42 +01:00
pennae	f24e445bc0	add doc comment justifying ExprInheritFrom Co-authored-by: Robert Hensing <roberth@users.noreply.github.com>	2024-02-26 19:07:08 +01:00
pennae	1cd87b7042	remove ExprAttrs::AttrDef::inherited it's no longer widely used and has a rather confusing meaning now that inherit-from is handled very differently.	2024-02-26 19:07:08 +01:00
pennae	cefd0302b5	evaluate inherit (from) exprs only once per directive desugaring inherit-from to syntactic duplication of the source expr also duplicates side effects of the source expr (such as trace calls) and expensive computations (such as derivationStrict).	2024-02-26 19:07:08 +01:00
pennae	6c08fba533	use the same bindings print for ExprAttrs and ExprLet this also has the effect of sorting let bindings lexicographically rather than by symbol creation order as was previously done, giving a better canonicalization in the process.	2024-02-12 13:35:00 +01:00
pennae	1f542adb3e	add ExprAttrs::AttrDef::chooseByKind in place of inherited() — not quite useful yet since we don't distinguish plain and inheritFrom attr kinds so far.	2024-02-12 13:34:59 +01:00
pennae	c66ee57edc	preserve information about whether/how an attribute was inherited	2024-02-12 13:32:33 +01:00
Rebecca Turner	c6a89c1a16	libexpr: Support structured error classes While preparing PRs like #9753, I've had to change error messages in dozens of code paths. It would be nice if instead of EvalError("expected 'boolean' but found '%1%'", showType(v)) we could write TypeError(v, "boolean") or similar. Then, changing the error message could be a mechanical refactor with the compiler pointing out places the constructor needs to be changed, rather than the error-prone process of grepping through the codebase. Structured errors would also help prevent the "same" error from having multiple slightly different messages, and could be a first step towards error codes / an error index. This PR reworks the exception infrastructure in `libexpr` to support exception types with different constructor signatures than `BaseError`. Actually refactoring the exceptions to use structured data will come in a future PR (this one is big enough already, as it has to touch every exception in `libexpr`). The core design is in `eval-error.hh`. Generally, errors like this: state.error("'%s' is not a string", getAttrPathStr()) .debugThrow<TypeError>() are transformed like this: state.error<TypeError>("'%s' is not a string", getAttrPathStr()) .debugThrow() The type annotation has moved from `ErrorBuilder::debugThrow` to `EvalState::error`.	2024-02-01 16:39:38 -08:00
Rebecca Turner	c62c21e29a	Move `PodIdx` to `pos-idx.hh` and `PosTable` to `pos-table.hh`	2024-02-01 13:12:59 -08:00
pennae	09a1128d9e	don't repeatedly look up ast internal symbols these symbols are used a lot, so it makes sense to cache them. this mostly increases clarity of the code (however clear one may wish to call the parser desugaring here), but it also provides a small performance benefit.	2024-01-15 16:52:18 +01:00
Rebecca Turner	4feb7d9f71	Combine `AbstractPos`, `PosAdapter`, and `Pos` Also move `SourcePath` into `libutil`. These changes allow `error.hh` and `error.cc` to access source path and position information, which we can use to produce better error messages (for example, we could consider omitting filenames when two or more consecutive stack frames originate from the same file).	2024-01-08 10:59:41 -08:00
Eelco Dolstra	315aade89d	Merge pull request #9681 from edolstra/eval-optimisations Optimize empty list constants	2024-01-03 10:43:01 +01:00
Eelco Dolstra	3f796514b3	Optimize empty list constants This avoids a Value allocation for empty list constants. During a `nix search nixpkgs`, about 82% of all thunked lists are empty, so this removes about 3 million Value allocations. Performance comparison on `nix search github:NixOS/nixpkgs/e1fa12d4f6c6fe19ccb59cac54b5b3f25e160870 --no-eval-cache`: maximum RSS: median = 3845432.0000 mean = 3845432.0000 stddev = 0.0000 min = 3845432.0000 max = 3845432.0000 [rejected?, p=0.00000, Δ=-70084.00000±0.00000] soft page faults: median = 965395.0000 mean = 965394.6667 stddev = 1.1181 min = 965392.0000 max = 965396.0000 [rejected?, p=0.00000, Δ=-17929.77778±38.59610] system CPU time: median = 1.8029 mean = 1.7702 stddev = 0.0621 min = 1.6749 max = 1.8417 [rejected, p=0.00064, Δ=-0.12873±0.09905] user CPU time: median = 14.1022 mean = 14.0633 stddev = 0.1869 min = 13.8118 max = 14.3190 [not rejected, p=0.03006, Δ=-0.18248±0.24928] elapsed time: median = 15.8205 mean = 15.8618 stddev = 0.2312 min = 15.5033 max = 16.1670 [not rejected, p=0.00558, Δ=-0.28963±0.29434]	2024-01-02 12:49:11 +01:00
pennae	1fe66852ff	reduce the size of Env by one pointer since `up` and `values` are both pointer-aligned the type field will also be pointer-aligned, wasting 48 bits of space on most machines. we can get away with removing the type field altogether by encoding some information into the `with` expr that created the env to begin with, reducing the GC load for the absolutely massive amount of single-entry envs we create for lambdas. this reduces memory usage of system eval by quite a bit (reducing heap size of our system eval from 8.4GB to 8.23GB) and gives similar savings in eval time. running `nix eval --raw --impure --expr 'with import <nixpkgs/nixos> {}; system'` before: Time (mean ± σ): 5.576 s ± 0.003 s [User: 5.197 s, System: 0.378 s] Range (min … max): 5.572 s … 5.581 s 10 runs after: Time (mean ± σ): 5.408 s ± 0.002 s [User: 5.019 s, System: 0.388 s] Range (min … max): 5.405 s … 5.411 s 10 runs	2023-12-30 18:55:13 +01:00
pennae	2b0e95e7aa	use singleton expr to generate black hole errors this also reduces forceValue code size and removes the need for hideInDiagnostics. coopting thunk forcing like this has the additional benefit of clarifying how these errors can happen in the first place.	2023-12-19 19:32:16 +01:00
pennae	78353deb02	encode black holes as tApp values checking for isBlackhole in the forceValue hot path is rather more expensive than necessary, and with a little bit of trickery we can move such handling into the isApp case. small performance benefit, but under some circumstances we've seen 2% improvement as well. 〉 nix eval --raw --impure --expr 'with import <nixpkgs/nixos> {}; system' before: Time (mean ± σ): 4.429 s ± 0.002 s [User: 3.929 s, System: 0.500 s] Range (min … max): 4.427 s … 4.433 s 10 runs after: Time (mean ± σ): 4.396 s ± 0.002 s [User: 3.894 s, System: 0.501 s] Range (min … max): 4.393 s … 4.399 s 10 runs	2023-12-19 19:32:16 +01:00
Rebecca Turner	0b80935c22	Pass positions when evaluating This includes position information in more places, making debugging easier. Before: ``` $ nix-instantiate --show-trace --eval tests/functional/lang/eval-fail-using-set-as-attr-name.nix error: … while evaluating an attribute name at «none»:0: (source not available) error: value is a set while a string was expected ``` After: ``` error: … while evaluating an attribute name at /pwd/lang/eval-fail-using-set-as-attr-name.nix:5:10: 4\| in 5\| attr.${key} \| ^ 6\| error: value is a set while a string was expected ```	2023-12-07 10:27:21 -08:00
Eelco Dolstra	ea38605d11	Introduce FSInputAccessor and use it Backported from the lazy-trees branch. Note that this doesn't yet use the access control features of FSInputAccessor.	2023-10-18 17:37:32 +02:00
Robert Hensing	b19bd4f348	Merge pull request #8970 from hercules-ci/eval-stuff Expr: remove redundant fields, add nrExprs	2023-09-25 19:49:22 +02:00
Robert Hensing	bd24176ac5	libexpr/nixexpr.hh: Remove redundant inline This is redundant since definitions in C++ record are implicitly inline-ed. Co-authored-by: Yingchi Long <i@lyc.dev>	2023-09-25 17:51:17 +01:00
Yingchi Long	e4b83fbfe2	libexpr: const rvalue reference -> value for nix::Expr nodes	2023-09-24 14:54:41 +08:00
Robert Hensing	bf8deb4991	Expr: remove redundant int and float fields	2023-09-12 13:45:45 +02:00
Robert Hensing	3720e811fa	libexpr: Add nrExprs to NIX_SHOW_STATS	2023-09-12 13:21:55 +02:00
Robert Hensing	cb2615cf47	Merge remote-tracking branch 'upstream/master' into source-path	2023-04-17 11:41:50 +02:00
John Ericson	0746951be1	Finish converting existing comments for internal API docs (#8146 ) * Finish converting existing comments for internal API docs 99% of this was just reformatting existing comments. Only two exceptions: - Expanded upon `BuildResult::status` compat note - Split up file-level `symbol-table.hh` doc comments to get per-definition docs Also fixed a few whitespace goofs, turning leading tabs to spaces and removing trailing spaces. Picking up from #8133 * Fix two things from comments * Use triple-backtick not indent for `dumpPath` * Convert GNU-style `\`..'` quotes to markdown style in API docs This will render correctly.	2023-04-07 13:55:28 +00:00
Eelco Dolstra	a9759407e5	Origin: Use SourcePath	2023-04-06 15:25:06 +02:00
John Ericson	f4ab297b31	Ensure all headers have `#pragma once` and are in API docs `///@file` makes them show up in the internal API dos. A tiny few were missing `#pragma once`.	2023-03-31 23:19:44 -04:00
Et7f3	cec23f5dda	ExprOpHasAttr,ExprSelect,stripIndentation,binds,formals: delete losts objects We are looking for *$ because it indicate that it was constructed with a new but not release. De-referencing shallow copy so deleting as whole might create dangling pointer that's why we move it so we delete a empty containers + the nice perf boost.	2023-02-16 19:53:55 +01:00
Et7f3	fa89d317b7	ExprString: Avoid copy of string	2023-02-12 05:49:45 +01:00
Guillaume Maudoux	e4726a0c79	Revert "Revert "Merge pull request #6204 from layus/coerce-string"" This reverts commit `9b33ef3879`.	2023-01-19 13:23:04 +01:00
Robert Hensing	9b33ef3879	Revert "Merge pull request #6204 from layus/coerce-string" This reverts commit `a75b7ba30f`, reversing changes made to `9af16c5f74`.	2023-01-18 01:34:07 +01:00
Eelco Dolstra	6b69652385	Merge remote-tracking branch 'origin/master' into coerce-string	2023-01-02 20:53:39 +01:00
Eelco Dolstra	c9b0a85b08	Restore display of source lines for stdin/string inputs	2022-12-13 16:00:44 +01:00
Eelco Dolstra	b3fdab28a2	Introduce AbstractPos This makes the position object used in exceptions abstract, with a method getSource() to get the source code of the file in which the error originated. This is needed for lazy trees because source files don't necessarily exist in the filesystem, and we don't want to make libutil depend on the InputAccessor type in libfetcher.	2022-12-13 00:50:43 +01:00
Guillaume Maudoux	eb460a9529	WIP: broken merge but need a git checkpoint	2022-09-07 00:34:03 +02:00
Eelco Dolstra	81a486c607	Shut up clang warnings	2022-06-02 21:19:54 +02:00
Eelco Dolstra	9acc770ce4	Remove pre-C++11 hackiness	2022-05-26 12:40:01 +02:00
Ben Burdette	7ccb2700c0	comments	2022-05-22 19:15:58 -06:00
Ben Burdette	0600df86b8	'debugMode'	2022-05-19 17:01:23 -06:00
Ben Burdette	7ddef73d02	de-const evalState exceptions	2022-05-19 12:44:40 -06:00
Ben Burdette	f9cdb6af8d	Merge branch 'debug-exploratory-PR' into debuggerHook-eval-arg	2022-05-19 11:07:18 -06:00
Ben Burdette	357fb84dba	use an expr->StaticEnv table in evalState	2022-05-19 10:48:10 -06:00
Ben Burdette	667074b586	first whack at passing evalState as an arg to debuggerHook.	2022-05-16 09:20:51 -06:00
Eelco Dolstra	dd8b91eebc	Style fixes In particular, use std::make_shared and enumerate(). Also renamed some fields to fit naming conventions.	2022-05-05 17:17:03 +02:00
Ben Burdette	2a5632c70d	incorporate PosIdx changes, symbol changes.	2022-04-29 10:02:17 -06:00
Guillaume Maudoux	e93b59fbc5	Merge remote-tracking branch 'origin/master' into coerce-string	2022-04-29 00:12:25 +02:00
Ben Burdette	6e19947993	Merge branch 'master' into debug-merge-master	2022-04-28 12:32:57 -06:00
Eelco Dolstra	fab731a9d4	Don't pass Symbol by reference Since Symbol is just an integer, passing it by const reference is never advantageous.	2022-04-26 13:25:17 +02:00
pennae	a385e51a08	rename SymbolIdx -> Symbol, Symbol -> SymbolStr after #6218 `Symbol` no longer confers a uniqueness invariant on the string it wraps, it is now possible to create multiple symbols that compare equal but whose string contents have different addresses. this guarantee is now only provided by `SymbolIdx`, leaving `Symbol` only as a string wrapper that knows about the intricacies of how symbols need to be formatted for output. this change renames `SymbolIdx` to `Symbol` to restore the previous semantics of `Symbol` to that name. we also keep the wrapper type and rename it to `SymbolStr` instead of returning plain strings from lookups into the symbol table because symbols are formatted for output in many places. theoretically we do not need `SymbolStr`, only a function that formats a string for output as a symbol, but having to wrap every symbol that appears in a message into eg `formatSymbol()` is error-prone and inconvient.	2022-04-25 15:37:01 +02:00
Théophane Hufschmitt	7ca6fbc8ca	Move ChunkedVector to its own header	2022-04-22 10:01:02 +02:00

1 2 3 4

197 commits