user0/rust - Forgejo: Beyond coding. We Forge.

user0/rust

Author	SHA1	Message	Date
bors	263edd43c5	Auto merge of #99033 - 5225225:interpreter-validity-checks, r=oli-obk Use constant eval to do strict mem::uninit/zeroed validity checks I'm not sure about the code organisation here, I just dumped the check in rustc_const_eval at the root. Not hard to move it elsewhere, in any case. Also, this means cranelift codegen intrinsics lose the strict checks, since they don't seem to depend on rustc_const_eval, and I didn't see a point in keeping around two copies. I also left comments in the is_zero_valid methods about "uhhh help how do i do this", those apply to both methods equally. Also rustc_codegen_ssa now depends on rustc_const_eval... is this okay? Pinging `@RalfJung` since you were the one who mentioned this to me, so I'm assuming you're interested. Haven't had a chance to run full tests on this since it's really warm, and it's 1AM, I'll check out any failures/comments in the morning :)	2022-07-17 19:28:01 +00:00
Oli Scherer	84a444a1f4	Introduce opaque type to hidden type projection	2022-07-15 15:49:22 +00:00
5225225	27412d1e3e	Use constant eval to do strict validity checks	2022-07-14 22:55:17 +01:00
Joshua Nelson	3c9765cff1	Rename `debugging_opts` to `unstable_opts` This is no longer used only for debugging options (e.g. `-Zoutput-width`, `-Zallow-features`). Rename it to be more clear.	2022-07-13 17:47:06 -05:00
ouz-a	cb0017f2f8	add new rval, pull deref early	2022-07-12 14:26:41 +03:00
Ralf Jung	4e7aaf1f44	tweak names and output and bless	2022-07-09 07:43:56 -04:00
Ralf Jung	ac265cdc19	review feedback	2022-07-09 07:27:29 -04:00
Ralf Jung	a422b42159	don't allow ZST in ScalarInt There are several indications that we should not ZST as a ScalarInt: - We had two ways to have ZST valtrees, either an empty `Branch` or a `Leaf` with a ZST in it. `ValTree::zst()` used the former, but the latter could possibly arise as well. - Likewise, the interpreter had `Immediate::Uninit` and `Immediate::Scalar(Scalar::ZST)`. - LLVM codegen already had to special-case ZST ScalarInt. So instead add new ZST variants to those types that did not have other variants which could be used for this purpose.	2022-07-09 07:27:29 -04:00
bors	1dcff2d507	Auto merge of #98638 - bjorn3:less_string_interning, r=tmiasko Use less string interning This removes string interning in a couple of places where doing so won't result in perf improvements. I also switched one place to use pre-interned symbols.	2022-07-08 10:03:27 +00:00
bors	3e51277fe6	Auto merge of #99014 - Dylan-DPC:rollup-n84y0jk, r=Dylan-DPC Rollup of 8 pull requests Successful merges: - #96856 (Fix ProjectionElem validation) - #97711 (Improve soundness of rustc_arena) - #98507 (Finishing touches for `#[expect]` (RFC 2383)) - #98692 (rustdoc: Cleanup more FIXMEs) - #98901 (incr: cache dwarf objects in work products) - #98930 (Make MIR basic blocks field public) - #98973 (Remove (unused) inherent impl anchors) - #98981 ( Edit `rustc_mir_dataflow::framework` documentation ) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup	2022-07-07 15:08:27 +00:00
bors	20dd693013	Auto merge of #98675 - eddyb:cg-array-literal, r=nikic rustc_codegen_ssa: use `project_index`, not `project_field`, for array literals. See https://github.com/rust-lang/rust/pull/98615#issuecomment-1170082774 for some context. In short, we were using `project_field` even for array `mir::Rvalue::Aggregate`s, which results in benchmarks like `deep-vector.rs` (and presumably also some real-world usecases?) being impacted by how we handle non-array aggregate fields. (This is a separate PR so that we can measure the perf effects in isolation) r? `@nikic`	2022-07-07 12:23:26 +00:00
Tomasz Miąsko	17adfeb2b4	Move `dominators` from Body to BasicBlocks	2022-07-07 08:11:49 +02:00
Guillaume Gomez	4755173cf6	Rollup merge of #96935 - thomcc:atomicptr-strict-prov, r=dtolnay Allow arithmetic and certain bitwise ops on AtomicPtr This is mainly to support migrating from `AtomicUsize`, for the strict provenance experiment. This is a pretty dubious set of APIs, but it should be sufficient to allow code that's using `AtomicUsize` to manipulate a tagged pointer atomically. It's under a new feature gate, `#![feature(strict_provenance_atomic_ptr)]`, but I'm not sure if it needs its own tracking issue. I'm happy to make one, but it's not clear that it's needed. I'm unsure if it needs changes in the various non-LLVM backends. Because we just cast things to integers anyway (and were already doing so), I doubt it. API change proposal: https://github.com/rust-lang/libs-team/issues/60 Fixes #95492	2022-07-06 20:43:23 +02:00
Alan Egerton	4f0a64736b	Update TypeVisitor paths	2022-07-06 06:41:53 +01:00
bors	53792b9c5c	Auto merge of #96862 - oli-obk:enum_cast_mir, r=RalfJung Change enum->int casts to not go through MIR casts. follow-up to https://github.com/rust-lang/rust/pull/96814 this simplifies all backends and even gives LLVM more information about the return value of `Rvalue::Discriminant`, enabling optimizations in more cases.	2022-07-05 09:36:29 +00:00
Oli Scherer	82c73af4a6	Prefer trace level instrumentation for the new noisy instrument attributes	2022-07-05 09:27:06 +00:00
Oli Scherer	c3aec3056e	Add a helper method with an explicit name instead of hand rolling a match 3x	2022-07-05 09:26:45 +00:00
bors	0075bb4fad	Auto merge of #91743 - cjgillot:enable_mir_inlining_inline_all, r=oli-obk Enable MIR inlining Continuation of https://github.com/rust-lang/rust/pull/82280 by `@wesleywiser.` #82280 has shown nice compile time wins could be obtained by enabling MIR inlining. Most of the issues in https://github.com/rust-lang/rust/issues/81567 are now fixed, except the interaction with polymorphization which is worked around specifically. I believe we can proceed with enabling MIR inlining in the near future (preferably just after beta branching, in case we discover new issues). Steps before merging: - [x] figure out the interaction with polymorphization; - [x] figure out how miri should deal with extern types; - [x] silence the extra arithmetic overflow warnings; - [x] remove the codegen fulfilment ICE; - [x] remove the type normalization ICEs while compiling nalgebra; - [ ] tweak the inlining threshold.	2022-07-02 11:24:17 +00:00
lcnr	cf9c0a5935	cleanup mir visitor for `rustc::pass_by_value`	2022-07-01 16:21:21 +02:00
Thom Chiovoloni	2f872afdb5	Allow arithmetic and certain bitwise ops on AtomicPtr This is mainly to support migrating from AtomicUsize, for the strict provenance experiment. Fixes #95492	2022-07-01 06:21:18 -07:00
Camille GILLOT	0161ecd13f	Recover when failing to normalize closure signature.	2022-06-30 21:45:29 +02:00
bors	7425fb293f	Auto merge of #98377 - davidv1992:add-lifetimes-to-argument-temporaries, r=oli-obk Added llvm lifetime annotations to function call argument temporaries. The goal of this change is to ensure that llvm will do stack slot optimization on these temporaries. This ensures that in code like: ```rust const A: [u8; 1024] = [0; 1024]; fn copy_const() { f(A); f(A); } ``` we only use 1024 bytes of stack space, instead of 2048 bytes. I am new to developing for the rust compiler, and as such not entirely sure, but I believe this should be sufficient to close #98156. Also, this does not contain a test case to ensure this keeps working, primarily because I am not sure how to go about testing this. I would love some suggestions as to how that could be approached.	2022-06-30 09:20:52 +00:00
Oli Scherer	7839cb963f	Change enum->int casts to not go through MIR casts. Instead we generate a discriminant rvalue and cast the result of that.	2022-06-30 07:47:07 +00:00
Eduard-Mihai Burtescu	900309ec8a	rustc_codegen_ssa: use `project_index`, not `project_field`, for array literals.	2022-06-29 14:58:14 +00:00
Oli Scherer	0e674b3ec5	Some tracing cleanups	2022-06-29 09:56:30 +00:00
Dylan DPC	45740acd34	Rollup merge of #97423 - m-ou-se:memory-ordering-intrinsics, r=tmiasko Simplify memory ordering intrinsics This changes the names of the atomic intrinsics to always fully include their memory ordering arguments. ```diff - atomic_cxchg + atomic_cxchg_seqcst_seqcst - atomic_cxchg_acqrel + atomic_cxchg_acqrel_release - atomic_cxchg_acqrel_failrelaxed + atomic_cxchg_acqrel_relaxed // And so on. ``` - `seqcst` is no longer implied - The failure ordering on chxchg is no longer implied in some cases, but now always explicitly part of the name. - `release` is no longer shortened to just `rel`. That was especially confusing, since `relaxed` also starts with `rel`. - `acquire` is no longer shortened to just `acq`, such that the names now all match the `std::sync::atomic::Ordering` variants exactly. - This now allows for more combinations on the compare exchange operations, such as `atomic_cxchg_acquire_release`, which is necessary for #68464. - This PR only exposes the new possibilities through unstable intrinsics, but not yet through the stable API. That's for [a separate PR](https://github.com/rust-lang/rust/pull/98383) that requires an FCP. Suffixes for operations with a single memory order: \| Order \| Before \| After \| \|---------\|--------------\|------------\| \| Relaxed \| `_relaxed` \| `_relaxed` \| \| Acquire \| `_acq` \| `_acquire` \| \| Release \| `_rel` \| `_release` \| \| AcqRel \| `_acqrel` \| `_acqrel` \| \| SeqCst \| (none) \| `_seqcst` \| Suffixes for compare-and-exchange operations with two memory orderings: \| Success \| Failure \| Before \| After \| \|---------\|---------\|--------------------------\|--------------------\| \| Relaxed \| Relaxed \| `_relaxed` \| `_relaxed_relaxed` \| \| Relaxed \| Acquire \| ❌ \| `_relaxed_acquire` \| \| Relaxed \| SeqCst \| ❌ \| `_relaxed_seqcst` \| \| Acquire \| Relaxed \| `_acq_failrelaxed` \| `_acquire_relaxed` \| \| Acquire \| Acquire \| `_acq` \| `_acquire_acquire` \| \| Acquire \| SeqCst \| ❌ \| `_acquire_seqcst` \| \| Release \| Relaxed \| `_rel` \| `_release_relaxed` \| \| Release \| Acquire \| ❌ \| `_release_acquire` \| \| Release \| SeqCst \| ❌ \| `_release_seqcst` \| \| AcqRel \| Relaxed \| `_acqrel_failrelaxed` \| `_acqrel_relaxed` \| \| AcqRel \| Acquire \| `_acqrel` \| `_acqrel_acquire` \| \| AcqRel \| SeqCst \| ❌ \| `_acqrel_seqcst` \| \| SeqCst \| Relaxed \| `_failrelaxed` \| `_seqcst_relaxed` \| \| SeqCst \| Acquire \| `_failacq` \| `_seqcst_acquire` \| \| SeqCst \| SeqCst \| (none) \| `_seqcst_seqcst` \|	2022-06-29 10:28:18 +05:30
bjorn3	f6484fa9b5	Avoid unnecessary string interning for const_str	2022-06-28 18:38:36 +00:00
Mara Bos	4982a59986	Rename/restructure memory ordering intrinsics.	2022-06-28 08:58:27 +02:00
David Venhoek	8f529aba86	Improved naming for copied constant arguments vector.	2022-06-25 16:36:11 +02:00
David Venhoek	a174d65709	Added llvm lifetime annotations to function call argument temporaries. The goal of this change is to ensure that llvm will do stack slot optimization on these temporaries. This ensures that in code like: ```rust const A: [u8; 1024] = [0; 1024]; fn copy_const() { f(A); f(A); } ``` we only use 1024 bytes of stack space, instead of 2048 bytes.	2022-06-22 11:47:22 +02:00
DrMeepster	1d1ff36214	fix codegen assertion	2022-06-15 18:39:23 -07:00
DrMeepster	cb417881a9	remove box derefs from codgen	2022-06-15 18:38:26 -07:00
bors	2d1e075079	Auto merge of #96285 - flip1995:pk-vfe, r=nagisa Introduce `-Zvirtual-function-elimination` codegen flag Fixes #68262 This PR adds a codegen flag `-Zvirtual-function-elimination` to enable the VFE optimization in LLVM. To make this work, additonal information has to be added to vtables ([`!vcall_visibility` metadata](https://llvm.org/docs/TypeMetadata.html#vcall-visibility-metadata) and a `typeid` of the trait). Furthermore, instead of just `load`ing functions, the [`llvm.type.checked.load` intrinsic](https://llvm.org/docs/LangRef.html#llvm-type-checked-load-intrinsic) has to be used to map functions to vtables. For technical details of the changes, see the commit messages. I also tested this flag on https://github.com/tock/tock on different boards to verify that this fixes the issue https://github.com/tock/tock/issues/2594. This flag is able to improve the size of the resulting binary by about 8k-9k bytes by removing the unused debug print functions. [Rendered documentation update](https://github.com/flip1995/rust/blob/pk-vfe/src/doc/rustc/src/codegen-options/index.md#virtual-function-elimination)	2022-06-14 21:37:11 +00:00
b-naber	705d818bd5	implement valtrees as the type-system representation for constant values	2022-06-14 16:07:11 +02:00
flip1995	e1c1d0f8c2	Add llvm.type.checked.load intrinsic Add the intrinsic declare {i8, i1} @llvm.type.checked.load(i8 %ptr, i32 %offset, metadata %type) This is used in the VFE optimization when lowering loading functions from vtables to LLVM IR. The `metadata` is used to map the function to all vtables this function could belong to. This ensures that functions from vtables that might be used somewhere won't get removed.	2022-06-14 14:50:52 +02:00
Nicholas Nethercote	93e4b6ef06	Rename the `ConstS::val` field as `kind`. And likewise for the `Const::val` method. Because its type is called `ConstKind`. Also `val` is a confusing name because `ConstKind` is an enum with seven variants, one of which is called `Value`. Also, this gives consistency with `TyS` and `PredicateS` which have `kind` fields. The commit also renames a few `Const` variables from `val` to `c`, to avoid confusion with the `ConstKind::Value` variant.	2022-06-14 13:06:44 +10:00
Ralf Jung	d5a590f537	comment Co-authored-by: Oli Scherer <github35764891676564198441@oli-obk.de>	2022-06-02 11:12:12 -04:00
Ralf Jung	fafccdced3	add cast kind of from_exposed_addr (int-to-ptr casts)	2022-06-02 10:46:13 -04:00
Ralf Jung	4dc5d457d8	rename PointerAddress → PointerExposeAddress	2022-06-01 14:08:17 -04:00
Tomasz Miąsko	dff602fc18	Add a pointer to address cast kind A pointer to address cast are often special-cased. Introduce a dedicated cast kind to make them easy distinguishable.	2022-05-31 00:00:00 +00:00
Matthias Krüger	5fc8a8e227	clippy::complexity fixes clone_on_copy useless_format bind_instead_of_map filter_map_identity useless_conversion map_flatten unnecessary_unwrap	2022-05-26 13:14:24 +02:00
bors	99c4758747	Auto merge of #97369 - tmiasko:codgen-ssa-atomic-ordering, r=michaelwoerister rustc_codegen_ssa: cleanup `AtomicOrdering` * Remove unused `NotAtomic` ordering. * Rename `Monotonic` to `Relaxed` - a Rust specific name. * Derive copy and clone.	2022-05-26 02:00:17 +00:00
Tomasz Miąsko	f4c92cc4d1	rustc_codegen_ssa: cleanup `AtomicOrdering` * Remove unused `NotAtomic` ordering. * Rename `Monotonic` to `Relaxed` - a Rust specific name.	2022-05-25 10:34:35 +02:00
5225225	dd9f31d000	Add flag for stricter checks on uninit/zeroed	2022-05-24 14:26:52 +01:00
Jakob Degen	09b0936db2	Refactor call terminator to always hold a destination place	2022-05-23 17:49:04 -04:00
SparrowLii	38bf1158bd	Change `Successors` to `impl Iterator<Item = BasicBlock>`	2022-05-17 08:41:01 +08:00
Scott McMurray	89a18cb600	Add `unsigned_offset_from` on pointers Like we have `add`/`sub` which are the `usize` version of `offset`, this adds the `usize` equivalent of `offset_from`. Like how `.add(d)` replaced a whole bunch of `.offset(d as isize)`, you can see from the changes here that it's fairly common that code actually knows the order between the pointers and wants a `usize`, not an `isize`. As a bonus, this can do `sub nuw`+`udiv exact`, rather than `sub`+`sdiv exact`, which can be optimized slightly better because it doesn't have to worry about negatives. That's why the slice iterators weren't using `offset_from`, though I haven't updated that code in this PR because slices are so perf-critical that I'll do it as its own change. This is an intrinsic, like `offset_from`, so that it can eventually be allowed in CTFE. It also allows checking the extra safety condition -- see the test confirming that CTFE catches it if you pass the pointers in the wrong order.	2022-05-11 17:16:25 -07:00
Tomasz Miąsko	fa41852c7a	Use reverse postorder in `non_ssa_locals` The reverse postorder, unlike preorder, is now cached inside the MIR body. Code generation uses reverse postorder anyway, so it might be a small perf improvement to use it here as well.	2022-05-01 14:58:29 +02:00
bors	9a98c63b30	Auto merge of #96500 - SparrowLii:rpo, r=tmiasko Reduce duplication of RPO calculation of mir Computing the RPO of mir is not a low-cost thing, but it is duplicate in many places. In particular the `iterate_to_fixpoint` method which is called multiple times when computing the data flow. This PR reduces the number of times the RPO is recalculated as much as possible, which should save some compile time.	2022-04-30 05:06:47 +00:00
SparrowLii	7149bbcdc5	Eliminate duplication of RPO calculation for mir add `postorder_cache` to mir Body add `ReversePostorderCache` struct correct struct name and comments	2022-04-30 03:42:57 +08:00

1 2 3 4 5 ...

264 commits