user0/rust - Forgejo: Beyond coding. We Forge.

user0/rust

Author	SHA1	Message	Date
Tim Neumann	c15cfc91c4	LLVM 16: Switch to using MemoryEffects	2022-11-04 17:58:16 +00:00
ouz-a	a1672ad5b8	Remove bounds check with enum cast	2022-10-31 14:10:37 +03:00
Nicholas Nethercote	003a3f8cd3	Use `br` instead of `switch` in more cases. `codegen_switchint_terminator` already uses `br` instead of `switch` when there is one normal target plus the `otherwise` target. But there's another common case with two normal targets and an `otherwise` target that points to an empty unreachable BB. This comes up a lot when switching on the tags of enums that use niches. The pattern looks like this: ``` bb1: ; preds = %bb6 %3 = load i8, ptr %_2, align 1, !range !9, !noundef !4 %4 = sub i8 %3, 2 %5 = icmp eq i8 %4, 0 %_6 = select i1 %5, i64 0, i64 1 switch i64 %_6, label %bb3 [ i64 0, label %bb4 i64 1, label %bb2 ] bb3: ; preds = %bb1 unreachable ``` This commit adds code to convert the `switch` to a `br`: ``` bb1: ; preds = %bb6 %3 = load i8, ptr %_2, align 1, !range !9, !noundef !4 %4 = sub i8 %3, 2 %5 = icmp eq i8 %4, 0 %_6 = select i1 %5, i64 0, i64 1 %6 = icmp eq i64 %_6, 0 br i1 %6, label %bb4, label %bb2 bb3: ; No predecessors! unreachable ``` This has a surprisingly large effect on compile times, with reductions of 5% on debug builds of some crates. The reduction is all due to LLVM taking less time. Maybe LLVM is just much better at handling `br` than `switch`. The resulting code is still suboptimal. - The `icmp`, `select`, `icmp` sequence is silly, converting an `i1` to an `i64` and back to an `i1`. But with the current code structure it's hard to avoid, and LLVM will easily clean it up, in opt builds at least. - `bb3` is usually now truly dead code (though not always, so it can't be removed universally).	2022-10-31 10:16:39 +11:00
bors	f42b6fa7ca	Auto merge of #103299 - nikic:usub-overflow, r=wesleywiser Don't use usub.with.overflow intrinsic The canonical form of a usub.with.overflow check in LLVM are separate sub + icmp instructions, rather than a usub.with.overflow intrinsic. Using usub.with.overflow will generally result in worse optimization potential. The backend will attempt to form usub.with.overflow when it comes to actual instruction selection. This is not fully reliable, but I believe this is a better tradeoff than using the intrinsic in IR. Fixes #103285.	2022-10-30 17:45:04 +00:00
bors	77e7b74ad5	Auto merge of #103071 - wesleywiser:fix_inlined_line_numbers, r=davidtwco Fix line numbers for MIR inlined code `should_collapse_debuginfo` detects if the specified span is part of a macro expansion however it does this by checking if the span is anything other than a normal (non-expanded) kind, then the span sequence is walked backwards to the root span. This doesn't work when the MIR inliner inlines code as it creates spans with expansion information set to `ExprKind::Inlined` and results in the line number being attributed to the inline callsite rather than the normal line number of the inlined code. Fixes #103068	2022-10-28 16:27:56 +00:00
Patrick Walton	da630ac79d	Introduce deduced parameter attributes, and use them for deducing `readonly` on indirect immutable freeze by-value function parameters. Right now, `rustc` only examines function signatures and the platform ABI when determining the LLVM attributes to apply to parameters. This results in missed optimizations, because there are some attributes that can be determined via analysis of the MIR making up the function body. In particular, `readonly` could be applied to most indirectly-passed by-value function arguments (specifically, those that are freeze and are observed not to be mutated), but it currently is not. This patch introduces the machinery that allows `rustc` to determine those attributes. It consists of a query, `deduced_param_attrs`, that, when evaluated, analyzes the MIR of the function to determine supplementary attributes. The results of this query for each function are written into the crate metadata so that the deduced parameter attributes can be applied to cross-crate functions. In this patch, we simply check the parameter for mutations to determine whether the `readonly` attribute should be applied to parameters that are indirect immutable freeze by-value. More attributes could conceivably be deduced in the future: `nocapture` and `noalias` come to mind. Adding `readonly` to indirect function parameters where applicable enables some potential optimizations in LLVM that are discussed in [issue 103103] and [PR 103070] around avoiding stack-to-stack memory copies that appear in functions like `core::fmt::Write::write_fmt` and `core::panicking::assert_failed`. These functions pass a large structure unchanged by value to a subfunction that also doesn't mutate it. Since the structure in this case is passed as an indirect parameter, it's a pointer from LLVM's perspective. As a result, the intermediate copy of the structure that our codegen emits could be optimized away by LLVM's MemCpyOptimizer if it knew that the pointer is `readonly nocapture noalias` in both the caller and callee. We already pass `nocapture noalias`, but we're missing `readonly`, as we can't determine whether a by-value parameter is mutated by examining the signature in Rust. I didn't have much success with having LLVM infer the `readonly` attribute, even with fat LTO; it seems that deducing it at the MIR level is necessary. No large benefits should be expected from this optimization now; LLVM needs some changes (discussed in [PR 103070]) to more aggressively use the `noalias nocapture readonly` combination in its alias analysis. I have some LLVM patches for these optimizations and have had them looked over. With all the patches applied locally, I enabled LLVM to remove all the `memcpy`s from the following code: ```rust fn main() { println!("Hello {}", 3); } ``` which is a significant codegen improvement over the status quo. I expect that if this optimization kicks in in multiple places even for such a simple program, then it will apply to Rust code all over the place. [issue 103103]: https://github.com/rust-lang/rust/issues/103103 [PR 103070]: https://github.com/rust-lang/rust/pull/103070	2022-10-21 02:33:15 -07:00
Nikita Popov	783301298f	Don't use usub.with.overflow intrinsic The canonical form of a usub.with.overflow check in LLVM are separate sub + icmp instructions, rather than a usub.with.overflow intrinsic. Using usub.with.overflow will generally result in worse optimization potential. The backend will attempt to form usub.with.overflow when it comes to actual instruction selection. This is not fully reliable, but I believe this is a better tradeoff than using the intrinsic in IR. Fixes #103285.	2022-10-20 12:47:17 +02:00
Wesley Wiser	34d90a46da	Fix line numbers for MIR inlined code `should_collapse_debuginfo` detects if the specified span is part of a macro expansion however it does this by checking if the span is anything other than a normal (non-expanded) kind, then the span sequence is walked backwards to the root span. This doesn't work when the MIR inliner inlines code as it creates spans with expansion information set to `ExprKind::Inlined` and results in the line number being attributed to the inline callsite rather than the normal line number of the inlined code.	2022-10-14 18:44:30 -04:00
Wesley Wiser	9363a1401e	Add test case for MIR inlining debuginfo line numbers	2022-10-14 14:09:30 -04:00
Rageking8	7122abaddf	more dupe word typos	2022-10-14 12:57:56 +08:00
bors	365578445c	Auto merge of #102724 - pcc:scs-fix-test, r=Mark-Simulacrum Fix the sanitizer_scs_attr_check.rs test The test is failing when targeting aarch64 Android. The intent appears to have been to look for a function attributes comment (or the absence of one) on the line preceding the function declaration. But this isn't quite possible with FileCheck and the test as written was looking for a line with `no_scs` after a line with `scs`, which doesn't appear in the output. Instead, match on the function attributes comment on the line following the demangled function name comment.	2022-10-11 04:27:13 +00:00
bors	a6b7274a46	Auto merge of #102596 - scottmcm:option-bool-calloc, r=Mark-Simulacrum Do the `calloc` optimization for `Option<bool>` Inspired by <https://old.reddit.com/r/rust/comments/xtiqj8/why_is_this_functional_version_faster_than_my_for/iqqy37b/>.	2022-10-10 18:42:40 +00:00
Ralf Jung	6f6433428f	add a few more assert_unsafe_precondition	2022-10-07 14:35:12 +02:00
Peter Collingbourne	5f3a4240c5	Fix the sanitizer_scs_attr_check.rs test The test is failing when targeting aarch64 Android. The intent appears to have been to look for a function attributes comment (or the absence of one) on the line preceding the function declaration. But this isn't quite possible with FileCheck and the test as written was looking for a line with `no_scs` after a line with `scs`, which doesn't appear in the output. Instead, match on the function attributes comment on the line following the demangled function name comment.	2022-10-05 21:55:24 -07:00
bors	607b8296e0	Auto merge of #102503 - cuviper:x86-stack-probes, r=nagisa Enable inline stack probes on X86 with LLVM 16 The known problems with x86 inline-asm stack probes have been solved on LLVM main (16), so this flips the switch. Anyone using bleeding-edge LLVM with rustc can start testing this, as I have done locally. We'll get more direct rust-ci when LLVM 16 branches and we start our upgrade, and we can always patch or disable it then if we find new problems. The previous attempt was #77885, reverted in #84708.	2022-10-03 02:09:05 +00:00
Scott McMurray	31cd0aa823	Do the `calloc` optimization for `Option<bool>` Inspired by <https://old.reddit.com/r/rust/comments/xtiqj8/why_is_this_functional_version_faster_than_my_for/iqqy37b/>.	2022-10-02 12:26:58 -07:00
bors	c2590e6e89	Auto merge of #102535 - scottmcm:optimize-split-at-partition-point, r=thomcc Tell LLVM that `partition_point` returns a valid fencepost This was already done for a successful `binary_search`, but this way `partition_point` can get similar optimizations. Demonstration that nightly can't do this optimization today, and leaves in the panicking path: <https://play.rust-lang.org/?version=nightly&mode=release&edition=2021&gist=e1074cd2faf5f68e49cffd728ded243a> r? `@thomcc`	2022-10-02 07:11:15 +00:00
bors	47b2eee173	Auto merge of #102424 - sunfishcode:sunfishcode/hidden-main, r=nagisa Declare `main` as visibility hidden on targets that default to hidden. On targets with `default_hidden_visibility` set, which is currrently just WebAssembly, declare the generated `main` function with visibility hidden. This makes it consistent with clang's WebAssembly target, where `main` is just a user function that gets the same visibility as any other user function, which is hidden on WebAssembly unless explicitly overridden. This will help simplify use cases which in the future may want to automatically wasm-export all visibility-"default" symbols. `main` isn't intended to be wasm-exported, and marking it hidden prevents it from being wasm-exported in that scenario.	2022-10-02 04:12:09 +00:00
Scott McMurray	c7af338e6f	Tell LLVM that `partition_point` returns a valid fencepost This was already done for a successful `binary_search`, but this way `partition_point` can get similar optimizations.	2022-09-30 23:39:15 -07:00
Dan Gohman	72f15572ee	Allow `hidden` in src/test/codegen/abi-main-signature-32bit-c-int.rs	2022-09-30 14:55:26 -07:00
Josh Stone	ed9e6f2ad8	Enable inline stack probes on X86 with LLVM 16	2022-09-29 19:49:23 -07:00
Josh Stone	ad8f519ed7	Enable inline stack probes on PowerPC and SystemZ	2022-09-26 13:40:24 -07:00
bors	4ecfdfac51	Auto merge of #100214 - scottmcm:strict-range, r=thomcc Optimize `array::IntoIter` `.into_iter()` on arrays was slower than it needed to be (especially compared to slice iterator) since it uses `Range<usize>`, which needs to handle degenerate ranges like `10..4`. This PR adds an internal `IndexRange` type that's like `Range<usize>` but with a safety invariant that means it doesn't need to worry about those cases -- it only handles `start <= end` -- and thus can give LLVM more information to optimize better. I added one simple demonstration of the improvement as a codegen test. (`vec::IntoIter` uses pointers instead of indexes, so doesn't have this problem, but that only works because its elements are boxed. `array::IntoIter` can't use pointers because that would keep it from being movable.)	2022-09-21 00:41:33 +00:00
Scott McMurray	6dbd9a29c2	Optimize `array::IntoIter` `.into_iter()` on arrays was slower than it needed to be (especially compared to slice iterator) since it uses `Range<usize>`, which needs to handle degenerate ranges like `10..4`. This PR adds an internal `IndexRange` type that's like `Range<usize>` but with a safety invariant that means it doesn't need to worry about those cases -- it only handles `start <= end` -- and thus can give LLVM more information to optimize better. I added one simple demonstration of the improvement as a codegen test.	2022-09-19 23:24:34 -07:00
Scott McMurray	335690200e	Add a codegen test for slice::from_ptr_range	2022-09-17 18:54:00 -07:00
bors	95a992a686	Auto merge of #97800 - pnkfelix:issue-97463-fix-aarch64-call-abi-does-not-zeroext, r=wesleywiser Aarch64 call abi does not zeroext (and one cannot assume it does so) Fix #97463	2022-09-16 20:08:05 +00:00
Camille GILLOT	c296a489d4	Bless codegen.	2022-09-13 19:18:24 +02:00
Camille GILLOT	7712825cc0	Bless codegen test.	2022-09-13 19:18:24 +02:00
Nicholas Bishop	54d9ba8239	Use RelocModel::Pic for UEFI targets In https://github.com/rust-lang/rust/pull/100537, the relocation model for UEFI targets was changed from PIC (the default value) to static. There was some dicussion of this change here: https://github.com/rust-lang/rust/pull/100537#discussion_r952363012 It turns out that this can cause compilation to fail as described in https://github.com/rust-lang/rust/issues/101377, so switch back to PIC. Fixes https://github.com/rust-lang/rust/issues/101377	2022-09-09 15:26:19 -04:00
Nikita Popov	cbf3b2432e	Add test for #98294 Add a test to make that the failure condition for this pattern is optimized away. Fixes #98294.	2022-09-05 15:24:18 +02:00
bors	b32223fec1	Auto merge of #100707 - dzvon:fix-typo, r=davidtwco Fix a bunch of typo This PR will fix some typos detected by [typos]. I only picked the ones I was sure were spelling errors to fix, mostly in the comments. [typos]: https://github.com/crate-ci/typos	2022-09-01 05:39:58 +00:00
bors	aa857eb953	Auto merge of #100537 - petrochenkov:piccheck, r=oli-obk rustc_target: Add some more target spec sanity checking	2022-09-01 03:13:46 +00:00
Dezhi Wu	b1430fb7ca	Fix a bunch of typo This PR will fix some typos detected by [typos]. I only picked the ones I was sure were spelling errors to fix, mostly in the comments. [typos]: https://github.com/crate-ci/typos	2022-08-31 18:24:55 +08:00
Nikita Popov	5663bb3f1c	Add test for issue #85872 This has been fixed by the LLVM 15 upgrade, add a codegen test. Fixes #85872.	2022-08-30 15:03:22 +02:00
Alex Saveau	eaa00250ba	Add another MaybeUninit array test with const Signed-off-by: Alex Saveau <saveau.alexandre@gmail.com>	2022-08-29 23:17:24 -04:00
Dylan DPC	4cac0bf662	Rollup merge of #98304 - SUPERCILEX:maybeuninit, r=nikic Add MaybeUninit memset test Closes #96274	2022-08-29 16:49:37 +05:30
bors	1e978a3627	Auto merge of #96946 - WaffleLapkin:ptr_mask, r=scottmcm Add pointer masking convenience functions This PR adds the following public API: ```rust impl<T: ?Sized> const T { fn mask(self, mask: usize) -> const T; } impl<T: ?Sized> mut T { fn mask(self, mask: usize) -> const T; } // mod intrinsics fn mask<T>(ptr: const T, mask: usize) -> const T ``` This is equivalent to `ptr.map_addr(\|a\| a & mask)` but also uses a cool llvm intrinsic. Proposed in https://github.com/rust-lang/rust/pull/95643#issuecomment-1121562352 cc `@Gankra` `@scottmcm` `@RalfJung` r? rust-lang/libs-api	2022-08-28 01:34:47 +00:00
Vadim Petrochenkov	f7eb7ef2ca	Update tests for UEFI and AVR	2022-08-27 16:50:41 +03:00
Yuki Okushi	134cc2d6be	Rollup merge of #99784 - est31:deny_cfg_attr_crate_type_name, r=Mark-Simulacrum Make forward compatibility lint deprecated_cfg_attr_crate_type_name deny by default Turns the forward compatibility lint added by #83744 to deprecate `cfg_attr` usage with `#![crate_type]` and `#![crate_name]` attributes into deny by default. Copying the example from #83744: ```Rust #![crate_type = "lib"] // remains working #![cfg_attr(foo, crate_type = "bin")] // will stop working ``` Over 8 months have passed since #83744 was merged so I'd say this gives ample time for people to have been warned, so we can make the warning stronger. No usage was found via grep.app except for one, which was in an unmaintained code base that didn't seem to be used in the open source eco system. The crater run conducted in #83744 also didn't show up anything. cc #91632 - tracking issue for the lint	2022-08-27 13:14:16 +09:00
Alex Saveau	8c62cc2064	Add MaybeUninit memset test Signed-off-by: Alex Saveau <saveau.alexandre@gmail.com>	2022-08-25 01:15:37 -04:00
Matthias Krüger	938ebf9a83	Rollup merge of #100760 - krasimirgg:llvm-16-pic-level, r=nikic update test for LLVM change LLVM commit `c2a3888793` updates the PIC level version selection. Updated an affected rust test to work under both the old and new behaviors. Detected by our experimental rust + llvm @ HEAD bot: https://buildkite.com/llvm-project/rust-llvm-integrate-prototype/builds/12829#0182b368-a405-47a2-b3da-9c79cb907bfe/701-709	2022-08-21 16:54:03 +02:00
Maybe Waffle	ca75312408	fix `ptr_mask` codegen test wrt llvm opaque pointers	2022-08-21 07:04:11 +04:00
Maybe Waffle	92b05db761	Do not use void pointer for `ptr_mask` intrinsic I couldn't find where exactly it's documented, but apperantly pointers to void type are invalid in llvm - void is only allowed as a return type of functions.	2022-08-21 05:27:14 +04:00
Maybe Waffle	55ba58cadb	make `ptr_mask` codegen test more specific	2022-08-21 05:27:14 +04:00
Maybe Waffle	553f790556	Add codegen test for `intinsics::ptr_mask`	2022-08-21 05:27:14 +04:00
bors	878aef79dc	Auto merge of #100810 - matthiaskrgr:rollup-xep778s, r=matthiaskrgr Rollup of 9 pull requests Successful merges: - #97963 (net listen backlog set to negative on Linux.) - #99935 (Reenable disabled early syntax gates as future-incompatibility lints) - #100129 (add miri-test-libstd support to libstd) - #100500 (Ban references to `Self` in trait object substs for projection predicates too.) - #100636 (Revert "Revert "Allow dynamic linking for iOS/tvOS targets."") - #100718 ([rustdoc] Fix item info display) - #100769 (Suggest adding a reference to a trait assoc item) - #100777 (elaborate how revisions work with FileCheck stuff in src/test/codegen) - #100796 (Refactor: remove unnecessary string searchings) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup	2022-08-20 20:08:26 +00:00
Matthias Krüger	45568bdaf7	Rollup merge of #100693 - scottmcm:new-llvm15-nops, r=Mark-Simulacrum Add LLVM15-specific codegen test for `try`/`?`s that now optimize away These still generated a bunch of code back in Rust 1.63 (<https://rust.godbolt.org/z/z31P8h6rz>), but with LLVM 15 merged they no longer do 🎉	2022-08-20 19:32:12 +02:00
Felix S. Klock II	f47b61d19e	elaborate how revisions work with FileCheck stuff in src/test/codegen	2022-08-19 16:40:26 -04:00
Felix S. Klock II	59cc718e76	Update codegen tests to accommodate the potential presence/absence of the extension operation depending on target architecture.	2022-08-19 16:15:15 -04:00
Krasimir Georgiev	07e41fb54c	update test for LLVM change LLVM commit `c2a3888793` updates the PIC level version selection. This updates the rust tests to work under both the old and new behaviors. Detected by our experimental rust + llvm @ HEAD bot: https://buildkite.com/llvm-project/rust-llvm-integrate-prototype/builds/12829#0182b368-a405-47a2-b3da-9c79cb907bfe/701-709	2022-08-19 15:52:43 +00:00

1 2 3 4 5 ...

1005 commits