user0/rust - Forgejo: Beyond coding. We Forge.

user0/rust

Author	SHA1	Message	Date
Matthias Krüger	b8e230a824	Rollup merge of #134030 - folkertdev:min-fn-align, r=workingjubilee add `-Zmin-function-alignment` tracking issue: https://github.com/rust-lang/rust/issues/82232 This PR adds the `-Zmin-function-alignment=<align>` flag, that specifies a minimum alignment for all* functions. ### Motivation This feature is requested by RfL [here](https://github.com/rust-lang/rust/issues/128830): > i.e. the equivalents of `-fmin-function-alignment` ([GCC](https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html#index-fmin-function-alignment_003dn), Clang does not support it) / `-falign-functions` ([GCC](https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html#index-falign-functions), [Clang](https://clang.llvm.org/docs/ClangCommandLineReference.html#cmdoption-clang1-falign-functions)). > > For the Linux kernel, the behavior wanted is that of GCC's `-fmin-function-alignment` and Clang's `-falign-functions`, i.e. align all functions, including cold functions. > > There is [`feature(fn_align)`](https://github.com/rust-lang/rust/issues/82232), but we need to do it globally. ### Behavior The `fn_align` feature does not have an RFC. It was decided at the time that it would not be necessary, but maybe we feel differently about that now? In any case, here are the semantics of this flag: - `-Zmin-function-alignment=<align>` specifies the minimum alignment of all* functions - the `#[repr(align(<align>))]` attribute can be used to override the function alignment on a per-function basis: when `-Zmin-function-alignment` is specified, the attribute's value is only used when it is higher than the value passed to `-Zmin-function-alignment`. - the target may decide to use a higher value (e.g. on x86_64 the minimum that LLVM generates is 16) - The highest supported alignment in rust is `2^29`: I checked a bunch of targets, and they all emit the `.p2align 29` directive for targets that align functions at all (some GPU stuff does not have function alignment). *: Only with `build-std` would the minimum alignment also be applied to `std` functions. --- cc `@ojeda` r? `@workingjubilee` you were active on the tracking issue	2025-01-11 18:13:45 +01:00
Folkert de Vries	47573bf61e	add `-Zmin-function-alignment`	2025-01-10 22:53:54 +01:00
Oli Scherer	65b01cb182	Use llvm.memset.p0i8.* to initialize all same-bytes arrays	2025-01-10 15:22:06 +00:00
Oli Scherer	65ea9f3eb4	Pull element init into a reusable closure	2025-01-10 08:27:41 +00:00
Oli Scherer	7ad45f1d2f	Change repeat element check into a match	2025-01-10 08:27:41 +00:00
Ralf Jung	7291b1eaf7	rename typed_swap → typed_swap_nonoverlapping	2024-12-25 10:53:03 +01:00
Scott McMurray	5ba54c9e31	Delete `Rvalue::Len` Everything's moved to `PtrMetadata` instead.	2024-12-22 06:12:39 -08:00
Ralf Jung	e023590de4	make no-variant types a dedicated Variants variant	2024-12-18 11:01:54 +01:00
Ralf Jung	21de42bf8d	Variants::Single: do not use invalid VariantIdx for uninhabited enums	2024-12-18 11:00:21 +01:00
Nicholas Nethercote	2620eb42d7	Re-export more `rustc_span::symbol` things from `rustc_span`. `rustc_span::symbol` defines some things that are re-exported from `rustc_span`, such as `Symbol` and `sym`. But it doesn't re-export some closely related things such as `Ident` and `kw`. So you can do `use rustc_span::{Symbol, sym}` but you have to do `use rustc_span::symbol::{Ident, kw}`, which is inconsistent for no good reason. This commit re-exports `Ident`, `kw`, and `MacroRulesNormalizedIdent`, and changes many `rustc_span::symbol::` qualifiers in `compiler/` to `rustc_span::`. This is a 200+ net line of code reduction, mostly because many files with two `use rustc_span` items can be reduced to one.	2024-12-18 13:38:53 +11:00
Jonathan Dönszelmann	efb98b6552	rename rustc_attr to rustc_attr_parsing and create rustc_attr_data_structures	2024-12-16 19:08:19 +01:00
DianQK	3fc506b4d4	Simplify the GEP instruction for index	2024-12-15 19:01:45 +08:00
Folkert de Vries	9aabef1c28	emit `.weak_definition` instead of `.weak` on macos	2024-12-10 21:41:05 +01:00
Folkert	bd8f8e0631	codegen `#[naked]` functions using `global_asm!`	2024-12-10 21:41:03 +01:00
Ben Kimock	711c8cc690	Remove polymorphization	2024-12-06 16:42:09 -05:00
The 8472	26d7b5da99	use stores of the correct size to set discriminants	2024-11-30 18:33:08 +01:00
Ding Xiang Fei	297b618944	reduce false positives of tail-expr-drop-order from consumed values take 2 open up coroutines tweak the wordings the lint works up until 2021 We were missing one case, for ADTs, which was causing `Result` to yield incorrect results. only include field spans with significant types deduplicate and eliminate field spans switch to emit spans to impl Drops Co-authored-by: Niko Matsakis <nikomat@amazon.com> collect drops instead of taking liveness diff apply some suggestions and add explantory notes small fix on the cache let the query recurse through coroutine new suggestion format with extracted variable name fine-tune the drop span and messages bugfix on runtime borrows tweak message wording filter out ecosystem types earlier apply suggestions clippy check lint level at session level further restrict applicability of the lint translate bid into nop for stable mir detect cycle in type structure	2024-11-20 20:53:11 +08:00
bors	70e814bd9e	Auto merge of #133212 - lcnr:questionable-uwu, r=compiler-errors continue `ParamEnv` to `TypingEnv` transition cc #132279 r? `@compiler-errors`	2024-11-20 06:22:01 +00:00
lcnr	7a90e84f4d	`InterpCx` store `TypingEnv` instead of a `ParamEnv`	2024-11-19 21:36:23 +01:00
Kyle Huey	f5b023bd9c	When the required discriminator value exceeds LLVM's limits, drop the debug info for the function instead of panicking. The maximum discriminator value LLVM can currently encode is 2^12. If macro use results in more than 2^12 calls to the same function attributed to the same callsite, and those calls are MIR-inlined, we will require more than the maximum discriminator value to completely represent the debug information. Once we reach that point drop the debug info instead.	2024-11-19 05:19:09 -08:00
lcnr	9cba14b95b	use `TypingEnv` when no `infcx` is available the behavior of the type system not only depends on the current assumptions, but also the currentnphase of the compiler. This is mostly necessary as we need to decide whether and how to reveal opaque types. We track this via the `TypingMode`.	2024-11-18 10:38:56 +01:00
Jiri Bobek	777003ae9f	Likely unlikely fix	2024-11-17 21:49:10 +01:00
Jubilee Young	b895bf4fdc	compiler: Directly use rustc_abi in codegen	2024-11-03 12:30:32 -08:00
Jubilee Young	7086dd83cc	compiler: `rustc_abi::Abi` => `BackendRepr` The initial naming of "Abi" was an awful mistake, conveying wrong ideas about how psABIs worked and even more about what the enum meant. It was only meant to represent the way the value would be described to a codegen backend as it was lowered to that intermediate representation. It was never meant to mean anything about the actual psABI handling! The conflation is because LLVM typically will associate a certain form with a certain ABI, but even that does not hold when the special cases that actually exist arise, plus the IR annotations that modify the ABI. Reframe `rustc_abi::Abi` as the `BackendRepr` of the type, and rename `BackendRepr::Aggregate` as `BackendRepr::Memory`. Unfortunately, due to the persistent misunderstandings, this too is now incorrect: - Scattered ABI-relevant code is entangled with BackendRepr - We do not always pre-compute a correct BackendRepr that reflects how we "actually" want this value to be handled, so we leave the backend interface to also inject various special-cases here - In some cases `BackendRepr::Memory` is a "real" aggregate, but in others it is in fact using memory, and in some cases it is a scalar! Our rustc-to-backend lowering code handles this sort of thing right now. That will eventually be addressed by lifting duplicated lowering code to either rustc_codegen_ssa or rustc_target as appropriate.	2024-10-29 14:56:00 -07:00
Jubilee Young	88a9edc091	compiler: Add `is_uninhabited` and use LayoutS accessors This reduces the need of the compiler to peek on the fields of LayoutS.	2024-10-28 09:58:30 -07:00
Ralf Jung	c3e928d8dd	stabilize Strict Provenance and Exposed Provenance This comes with a big docs rewrite.	2024-10-21 15:05:35 +01:00
Jonathan Dönszelmann	0a9c87b1f5	rename RcBox in other places too	2024-10-11 10:04:22 +02:00
Jubilee Young	839cf1c1a4	compiler: Factor rustc_target::abi out of cg_ssa	2024-10-08 18:24:56 -07:00
Folkert de Vries	5fc60d1e52	various fixes for `naked_asm!` implementation - fix for divergence - fix error message - fix another cranelift test - fix some cranelift things - don't set the NORETURN option for naked asm - fix use of naked_asm! in doc comment - fix use of naked_asm! in run-make test - use `span_bug` in unreachable branch	2024-10-06 19:00:09 +02:00
Urgau	018ba0528f	Use wide pointers consistenly across the compiler	2024-10-04 14:06:48 +02:00
bors	76ed7a1fa4	Auto merge of #130329 - khuey:reorder-constant-spills, r=davidtwco Reorder stack spills so that constants come later. Currently constants are "pulled forward" and have their stack spills emitted first. This confuses LLVM as to where to place breakpoints at function entry, and results in argument values being wrong in the debugger. It's straightforward to avoid emitting the stack spills for constants until arguments/etc have been introduced in debug_introduce_locals, so do that. Example LLVM IR (irrelevant IR elided): Before: ``` define internal void `@_ZN11rust_1289457binding17h2c78f956ba4bd2c3E(i64` %a, i64 %b, double %c) unnamed_addr #0 !dbg !178 { start: %c.dbg.spill = alloca [8 x i8], align 8 %b.dbg.spill = alloca [8 x i8], align 8 %a.dbg.spill = alloca [8 x i8], align 8 %x.dbg.spill = alloca [4 x i8], align 4 store i32 0, ptr %x.dbg.spill, align 4, !dbg !192 ; LLVM places breakpoint here. #dbg_declare(ptr %x.dbg.spill, !190, !DIExpression(), !192) store i64 %a, ptr %a.dbg.spill, align 8 #dbg_declare(ptr %a.dbg.spill, !187, !DIExpression(), !193) store i64 %b, ptr %b.dbg.spill, align 8 #dbg_declare(ptr %b.dbg.spill, !188, !DIExpression(), !194) store double %c, ptr %c.dbg.spill, align 8 #dbg_declare(ptr %c.dbg.spill, !189, !DIExpression(), !195) ret void, !dbg !196 } ``` After: ``` define internal void `@_ZN11rust_1289457binding17h2c78f956ba4bd2c3E(i64` %a, i64 %b, double %c) unnamed_addr #0 !dbg !178 { start: %x.dbg.spill = alloca [4 x i8], align 4 %c.dbg.spill = alloca [8 x i8], align 8 %b.dbg.spill = alloca [8 x i8], align 8 %a.dbg.spill = alloca [8 x i8], align 8 store i64 %a, ptr %a.dbg.spill, align 8 #dbg_declare(ptr %a.dbg.spill, !187, !DIExpression(), !192) store i64 %b, ptr %b.dbg.spill, align 8 #dbg_declare(ptr %b.dbg.spill, !188, !DIExpression(), !193) store double %c, ptr %c.dbg.spill, align 8 #dbg_declare(ptr %c.dbg.spill, !189, !DIExpression(), !194) store i32 0, ptr %x.dbg.spill, align 4, !dbg !195 ; LLVM places breakpoint here. #dbg_declare(ptr %x.dbg.spill, !190, !DIExpression(), !195) ret void, !dbg !196 } ``` Note in particular the position of the "LLVM places breakpoint here" comment relative to the stack spills for the function arguments. LLVM assumes that the first instruction with with a debug location is the end of the prologue. As LLVM does not currently offer front ends any direct control over the placement of the prologue end reordering the IR is the only mechanism available to fix argument values at function entry in the presence of MIR optimizations like SingleUseConsts. Fixes #128945 r? `@michaelwoerister`	2024-09-26 02:37:52 +00:00
Matthias Krüger	0e439090cb	Rollup merge of #130734 - Luv-Ray:fix_vfe, r=lcnr Fix: ices on virtual-function-elimination about principal trait Extract `load_vtable` function to ensure the `virtual_function_elimination` option is always checked. It's okay not to use `llvm.type.checked.load` to load the vtable if there is no principal trait. Fixes #123955 Fixes #124092	2024-09-25 10:09:23 +02:00
Lukas Markeffsky	bd31e3ed70	be even more precise about "cast" vs "coercion"	2024-09-24 23:12:02 +02:00
Lukas Markeffsky	46ecb23198	unify dyn* coercions with other pointer coercions	2024-09-24 22:17:55 +02:00
Luv-Ray	16093faea8	fix ices on vfe about principal trait	2024-09-23 15:25:52 +08:00
Michael Goulet	c682aa162b	Reformat using the new identifier sorting from rustfmt	2024-09-22 19:11:29 -04:00
bors	2836482241	Auto merge of #129283 - saethlin:unreachable-allocas, r=scottmcm Don't alloca for unused locals We already have a concept of mono-unreachable basic blocks; this is primarily useful for ensuring that we do not compile code under an `if false`. But since we never gave locals the same analysis, a large local only used under an `if false` will still have stack space allocated for it. There are 3 places we traverse MIR during monomorphization: Inside the collector, `non_ssa_locals`, and the walk to generate code. Unfortunately, https://github.com/rust-lang/rust/pull/129283#issuecomment-2297925578 indicates that we cannot afford the expense of tracking reachable locals during the collector's traversal, so we do need at least two mono-reachable traversals. And of course caching is of no help here because the benchmarks that regress are incr-unchanged; they don't do any codegen. This fixes the second problem in https://github.com/rust-lang/rust/issues/129282, and brings us anther step toward `const if` at home.	2024-09-21 13:48:14 +00:00
Ben Kimock	523f8f8398	Compute reachable locals as part of non_ssa_locals	2024-09-21 01:07:00 -04:00
Ben Kimock	0ea5dc506f	Don't alloca for unused locals	2024-09-21 01:06:59 -04:00
Michael Goulet	914193c8f4	Do not unnecessarily eval consts in codegen	2024-09-20 20:38:11 -04:00
Matthias Krüger	21313d7947	Rollup merge of #130457 - nnethercote:cleanup-codegen-traits, r=bjorn3 Cleanup codegen traits The traits governing codegen are quite complicated and hard to follow. This PR cleans them up a bit. r? `@bjorn3`	2024-09-18 17:49:43 +02:00
Kyle Huey	652b502d9c	Reorder stack spills so that constants come later. Currently constants are "pulled forward" and have their stack spills emitted first. This confuses LLVM as to where to place breakpoints at function entry, and results in argument values being wrong in the debugger. It's straightforward to avoid emitting the stack spills for constants until arguments/etc have been introduced in debug_introduce_locals, so do that. Example LLVM IR (irrelevant IR elided): Before: define internal void @_ZN11rust_1289457binding17h2c78f956ba4bd2c3E(i64 %a, i64 %b, double %c) unnamed_addr #0 !dbg !178 { start: %c.dbg.spill = alloca [8 x i8], align 8 %b.dbg.spill = alloca [8 x i8], align 8 %a.dbg.spill = alloca [8 x i8], align 8 %x.dbg.spill = alloca [4 x i8], align 4 store i32 0, ptr %x.dbg.spill, align 4, !dbg !192 ; LLVM places breakpoint here. #dbg_declare(ptr %x.dbg.spill, !190, !DIExpression(), !192) store i64 %a, ptr %a.dbg.spill, align 8 #dbg_declare(ptr %a.dbg.spill, !187, !DIExpression(), !193) store i64 %b, ptr %b.dbg.spill, align 8 #dbg_declare(ptr %b.dbg.spill, !188, !DIExpression(), !194) store double %c, ptr %c.dbg.spill, align 8 #dbg_declare(ptr %c.dbg.spill, !189, !DIExpression(), !195) ret void, !dbg !196 } After: define internal void @_ZN11rust_1289457binding17h2c78f956ba4bd2c3E(i64 %a, i64 %b, double %c) unnamed_addr #0 !dbg !178 { start: %x.dbg.spill = alloca [4 x i8], align 4 %c.dbg.spill = alloca [8 x i8], align 8 %b.dbg.spill = alloca [8 x i8], align 8 %a.dbg.spill = alloca [8 x i8], align 8 store i64 %a, ptr %a.dbg.spill, align 8 #dbg_declare(ptr %a.dbg.spill, !187, !DIExpression(), !192) store i64 %b, ptr %b.dbg.spill, align 8 #dbg_declare(ptr %b.dbg.spill, !188, !DIExpression(), !193) store double %c, ptr %c.dbg.spill, align 8 #dbg_declare(ptr %c.dbg.spill, !189, !DIExpression(), !194) store i32 0, ptr %x.dbg.spill, align 4, !dbg !195 ; LLVM places breakpoint here. #dbg_declare(ptr %x.dbg.spill, !190, !DIExpression(), !195) ret void, !dbg !196 } Note in particular the position of the "LLVM places breakpoint here" comment relative to the stack spills for the function arguments. LLVM assumes that the first instruction with with a debug location is the end of the prologue. As LLVM does not currently offer front ends any direct control over the placement of the prologue end reordering the IR is the only mechanism available to fix argument values at function entry in the presence of MIR optimizations like SingleUseConsts. Fixes #128945	2024-09-17 16:45:26 -07:00
Nicholas Nethercote	c629538dad	Merge some impl blocks.	2024-09-17 16:24:35 +10:00
Nicholas Nethercote	3ec2f121cc	Rename some lifetimes. `'mir` is not a good lifetime name in `LocalAnalyzer`, because it's used on two unrelated fields. `'a` is more standard for a situation like this (e.g. #130022).	2024-09-17 16:24:35 +10:00
Nicholas Nethercote	cd3da000c0	Clean up formatting. Reflow overly long comments, plus some minor whitespace improvements.	2024-09-17 16:24:35 +10:00
Nicholas Nethercote	bdacdfe95f	Minimize visibilities. This makes it much clearer which things are used outside the crate.	2024-09-17 16:24:33 +10:00
Nicholas Nethercote	a8d22eb39e	Rename supertraits of `CodegenMethods`. Supertraits of `BuilderMethods` are all called `XyzBuilderMethods`. Supertraits of `CodegenMethods` are all called `XyzMethods`. This commit changes the latter to `XyzCodegenMethods`, for consistency.	2024-09-17 10:24:43 +10:00
Nicholas Nethercote	540fcc617a	Move some supertraits outward. Specifically, put them where they are genuinely required, i.e. the outermost place they can be.	2024-09-17 10:24:43 +10:00
Ralf Jung	60ee1b7ac6	simd_shuffle: require index argument to be a vector	2024-09-14 14:43:24 +02:00
Kyle Huey	7ed9f945a2	Don't leave debug locations for constants sitting on the builder indefinitely. Because constants are currently emitted before the prologue, leaving the debug location on the IRBuilder spills onto other instructions in the prologue and messes up both line numbers as well as the point LLVM chooses to be the prologue end. Example LLVM IR (irrelevant IR elided): Before: define internal { i64, i64 } @_ZN3tmp3Foo18var_return_opt_try17he02116165b0fc08cE(ptr align 8 %self) !dbg !347 { start: %self.dbg.spill = alloca [8 x i8], align 8 %_0 = alloca [16 x i8], align 8 %residual.dbg.spill = alloca [0 x i8], align 1 #dbg_declare(ptr %residual.dbg.spill, !353, !DIExpression(), !357) store ptr %self, ptr %self.dbg.spill, align 8, !dbg !357 #dbg_declare(ptr %self.dbg.spill, !350, !DIExpression(), !358) After: define internal { i64, i64 } @_ZN3tmp3Foo18var_return_opt_try17h00b17d08874ddd90E(ptr align 8 %self) !dbg !347 { start: %self.dbg.spill = alloca [8 x i8], align 8 %_0 = alloca [16 x i8], align 8 %residual.dbg.spill = alloca [0 x i8], align 1 #dbg_declare(ptr %residual.dbg.spill, !353, !DIExpression(), !357) store ptr %self, ptr %self.dbg.spill, align 8 #dbg_declare(ptr %self.dbg.spill, !350, !DIExpression(), !358) Note in particular how !357 from %residual.dbg.spill's dbg_declare no longer falls through onto the store to %self.dbg.spill. This fixes argument values at entry when the constant is a ZST (e.g. <Option as Try>::Residual). This fixes #130003 (but note that it does not fix issues with argument values and non-ZST constants, which emit their own stores that have debug info on them, like #128945).	2024-09-06 23:12:18 +00:00

1 2 3 4 5 ...

795 commits