user0/rust - Forgejo: Beyond coding. We Forge.

user0/rust

Author	SHA1	Message	Date
Stuart Cook	1c892e829c	Rollup merge of #147436 - okaneco:eq_ignore_ascii_autovec, r=scottmcm slice/ascii: Optimize `eq_ignore_ascii_case` with auto-vectorization - Refactor the current functionality into a helper function - Use `as_chunks` to encourage auto-vectorization in the optimized chunk processing function - Add a codegen test checking for vectorization and no panicking - Add benches for `eq_ignore_ascii_case` --- The optimized function is initially only enabled for x86_64 which has `sse2` as part of its baseline, but none of the code is platform specific. Other platforms with SIMD instructions may also benefit from this implementation. Performance improvements only manifest for slices of 16 bytes or longer, so the optimized path is gated behind a length check for greater than or equal to 16. Benchmarks - Cases below 16 bytes are unaffected, cases above all show sizeable improvements. ``` before: str::eq_ignore_ascii_case::bench_large_str_eq 4942.30ns/iter +/- 48.20 str::eq_ignore_ascii_case::bench_medium_str_eq 632.01ns/iter +/- 16.87 str::eq_ignore_ascii_case::bench_str_17_bytes_eq 16.28ns/iter +/- 0.45 str::eq_ignore_ascii_case::bench_str_31_bytes_eq 35.23ns/iter +/- 2.28 str::eq_ignore_ascii_case::bench_str_of_8_bytes_eq 7.56ns/iter +/- 0.22 str::eq_ignore_ascii_case::bench_str_under_8_bytes_eq 2.64ns/iter +/- 0.06 after: str::eq_ignore_ascii_case::bench_large_str_eq 611.63ns/iter +/- 28.29 str::eq_ignore_ascii_case::bench_medium_str_eq 77.10ns/iter +/- 19.76 str::eq_ignore_ascii_case::bench_str_17_bytes_eq 3.49ns/iter +/- 0.39 str::eq_ignore_ascii_case::bench_str_31_bytes_eq 3.50ns/iter +/- 0.27 str::eq_ignore_ascii_case::bench_str_of_8_bytes_eq 7.27ns/iter +/- 0.09 str::eq_ignore_ascii_case::bench_str_under_8_bytes_eq 2.60ns/iter +/- 0.05 ```	2026-01-27 17:36:35 +11:00
Stuart Cook	2f8f4acbd6	Rollup merge of #151700 - lblasc:os-allow-missing-docs, r=tgross35 os allow missing_docs Resolves rustc build faliure. Discovered in https://github.com/NixOS/nixpkgs/pull/470993 ``` rustc> Documenting core v0.0.0 (/nix/var/nix/builds/nix-78118-1377149852/rustc-1.92.0-src/library/core) rustc> error: missing documentation for a module rustc> --> library/core/src/os/mod.rs:13:1 rustc> \| rustc> 13 \| pub mod darwin {} rustc> \| ^^^^^^^^^^^^^^ rustc> \| rustc> = note: `-D missing-docs` implied by `-D warnings` rustc> = help: to override `-D warnings` add `#[allow(missing_docs)]` rustc> rustc> Checking compiler_builtins v0.1.160 (/nix/var/nix/builds/nix-78118-1377149852/rustc-1.92.0-src/library/compiler-builtins/compiler-builtins) rustc> error: could not document `core` rustc> warning: build failed, waiting for other jobs to finish... rustc> Command `/nix/store/h499wcc6pl9whxa2kznjm76wy4f3lcm0-cargo-bootstrap-1.92.0/bin/cargo doc --target wasm32-unknown-unknown -Zbinary-dep-depinfo -j 10 -Zroot-dir=/nix/var/nix/builds/nix-78118-1377149852/rustc-1.92.0-src --frozen --release -p alloc -p compiler_builtins -p core -p panic_abort -p panic_unwind -p proc_macro -p rustc-std-workspace-core -p std -p std_detect -p sysroot -p test -p unwind --features 'backtrace panic-unwind' --manifest-path /nix/var/nix/builds/nix-78118-1377149852/rustc-1.92.0-src/library/sysroot/Cargo.toml --no-deps --target-dir /nix/var/nix/builds/nix-78118-1377149852/rustc-1.92.0-src/build/aarch64-apple-darwin/stage1-std/wasm32-unknown-unknown/doc -Zskip-rustdoc-fingerprint -Zrustdoc-map [workdir=/nix/var/nix/builds/nix-78118-1377149852/rustc-1.92.0-src]` failed with exit code 101 rustc> Created at: src/bootstrap/src/core/build_steps/doc.rs:781:21 rustc> Executed at: src/bootstrap/src/core/build_steps/doc.rs:814:22 rustc> rustc> Command has failed. Rerun with -v to see more details. ```	2026-01-27 12:50:54 +11:00
Stuart Cook	bf01ad8916	Rollup merge of #151669 - quaternic:rename-gather-scatter-bits, r=scottmcm rename uN::{gather,scatter}_bits to uN::{extract,deposit}_bits Feature gate: `#![feature(uint_gather_scatter_bits)]` Tracking issue: https://github.com/rust-lang/rust/issues/149069 Rename the methods as requested in https://github.com/rust-lang/rust/issues/149069#issuecomment-3633691777 - `gather_bits` -> `extract_bits` - `scatter_bits` -> `deposit_bits`	2026-01-27 12:50:53 +11:00
Stuart Cook	af523529be	Rollup merge of #151529 - tgross35:lint-apfloat, r=nnethercote lint: Use rustc_apfloat for `overflowing_literals`, add f16 and f128 Switch to parsing float literals for overflow checks using `rustc_apfloat` rather than host floats. This avoids small variations in platform support and makes it possible to start checking `f16` and `f128` as well. Using APFloat matches what we try to do elsewhere to avoid platform inconsistencies.	2026-01-27 12:50:52 +11:00
Stuart Cook	956ebbde20	Rollup merge of #151383 - cyrgani:no-internal-deprecation, r=scottmcm remove `#[deprecated]` from unstable & internal `SipHasher13` and `24` types These types are unstable and `doc(hidden)` (under the internal feature `hashmap_internals`). Deprecating them only adds noise (`#[allow(deprecated)]`) to all places where they are used, so this PR removes the deprecation attributes from them. It also includes a few other small cleanups in separate commits, including one I overlooked in rust-lang/rust#151228.	2026-01-27 12:50:52 +11:00
Stuart Cook	92e8bf864b	Rollup merge of #151680 - ChrisDenton:bindgen, r=tgross35 Update backtrace and windows-bindgen Supersedes the backtrace bump in rust-lang/rust#151659 This is mostly just renaming `windows_targets` to `windows_link` but it needs to be done in tandem with the backtrace submodule update. The reason for doing this is that backtrace is both copy/pasted into std (via being a submodule) and published as an independent crate.	2026-01-27 12:50:51 +11:00
Trevor Gross	9b15010686	lint: Use rustc_apfloat for `overflowing_literals`, add f16 and f128 Switch to parsing float literals for overflow checks using `rustc_apfloat` rather than host floats. This avoids small variations in platform support and makes it possible to start checking `f16` and `f128` as well. Using APFloat matches what we try to do elsewhere to avoid platform inconsistencies.	2026-01-26 18:25:42 -06:00
Jonathan Brouwer	9ad4ae88cf	Rollup merge of #151661 - estebank:issue-68095, r=mati865 Suggest changing `iter`/`into_iter` when the other was meant When encountering a call to `iter` that should have been `into_iter` and vice-versa, provide a structured suggestion: ``` error[E0271]: type mismatch resolving `<IntoIter<{integer}, 3> as IntoIterator>::Item == &{integer}` --> $DIR/into_iter-when-iter-was-intended.rs:5:37 \| LL \| let _a = [0, 1, 2].iter().chain([3, 4, 5].into_iter()); \| ----- ^^^^^^^^^^^^^^^^^^^^^ expected `&{integer}`, found integer \| \| \| required by a bound introduced by this call \| note: the method call chain might not have had the expected associated types --> $DIR/into_iter-when-iter-was-intended.rs:5:47 \| LL \| let _a = [0, 1, 2].iter().chain([3, 4, 5].into_iter()); \| --------- ^^^^^^^^^^^ `IntoIterator::Item` is `{integer}` here \| \| \| this expression has type `[{integer}; 3]` note: required by a bound in `std::iter::Iterator::chain` --> $SRC_DIR/core/src/iter/traits/iterator.rs:LL:COL help: consider not consuming the `[{integer}, 3]` to construct the `Iterator` \| LL - let _a = [0, 1, 2].iter().chain([3, 4, 5].into_iter()); LL + let _a = [0, 1, 2].iter().chain([3, 4, 5].iter()); \| ``` Finish addressing the original case in rust-lang/rust#68095. Only the case of chaining a `Vec` or `[]` is left unhandled.	2026-01-26 18:19:17 +01:00
Luka Blašković	d541277ce1	os allow missing_docs	2026-01-26 17:08:00 +00:00
Chris Denton	aaeb550f6f	Update windows bindings in std	2026-01-26 10:59:16 +00:00
Chris Denton	8a32fcee2f	Update backtrace	2026-01-26 10:57:35 +00:00
bors	db6bc0f6a4	Auto merge of #151674 - Zalathar:rollup-pNhrXnP, r=Zalathar Rollup of 2 pull requests Successful merges: - rust-lang/rust#151612 (Update documentation for `cold_path`, `likely`, and `unlikely`) - rust-lang/rust#151670 (compiletest: Parse aux `proc-macro` directive into struct)	2026-01-26 09:25:45 +00:00
Stuart Cook	443c5b0742	Rollup merge of #151612 - tgross35:cold-path-doc, r=scottmcm Update documentation for `cold_path`, `likely`, and `unlikely` * Add a note recommending benchmarks to `cold_path`, as other hints have * Note that `cold_path` can be used to implement `likely` and `unlikely` * Update the tracking issue for the `likely_unlikely` feature Tracking issue: https://github.com/rust-lang/rust/issues/136873 Tracking issue: https://github.com/rust-lang/rust/issues/151619	2026-01-26 19:52:41 +11:00
Juho Kahala	29596f87be	rename uN::{gather,scatter}_bits to uN::{extract,deposit}_bits	2026-01-26 06:28:42 +02:00
Stuart Cook	ef849a2c7d	Rollup merge of #150705 - justanotheranonymoususer:patch-1, r=joboet Add missing mut to pin.rs docs Per my understanding, needed for mut access next line.	2026-01-26 14:36:21 +11:00
Stuart Cook	a6e8a31b86	Rollup merge of #151611 - bonega:improve-is-slice-is-ascii-performance, r=folkertdev Improve is_ascii performance on x86_64 with explicit SSE2 intrinsics # Summary Improves `slice::is_ascii` performance for SSE2 target roughly 1.5-2x on larger inputs. AVX-512 keeps similiar performance characteristics. This is building on the work already merged in rust-lang/rust#151259. In particular this PR improves the default SSE2 performance, I don't consider this a temporary fix anymore. Thanks to @folkertdev for pointing me to consider `as_chunk` again. # The implementation: - Uses 64-byte chunks with 4x 16-byte SSE2 loads OR'd together - Extracts the MSB mask with a single `pmovmskb` instruction - Falls back to usize-at-a-time SWAR for inputs < 64 bytes # Performance impact (vs before rust-lang/rust#151259): - AVX-512: 34-48x faster - SSE2: 1.5-2x faster <details> <summary>Benchmark Results (click to expand)</summary> Benchmarked on AMD Ryzen 9 9950X (AVX-512 capable). Values show relative performance (1.00 = fastest). Tops out at 139GB/s for large inputs. ### early_non_ascii \| Input Size \| new_avx512 \| new_sse2 \| old_avx512 \| old_sse2 \| \|------------\|------------\|----------\|------------\|----------\| \| 64 \| 1.01 \| 1.00 \| 13.45 \| 1.13 \| \| 1024 \| 1.01 \| 1.00 \| 13.53 \| 1.14 \| \| 65536 \| 1.01 \| 1.00 \| 13.99 \| 1.12 \| \| 1048576 \| 1.02 \| 1.00 \| 13.29 \| 1.12 \| ### late_non_ascii \| Input Size \| new_avx512 \| new_sse2 \| old_avx512 \| old_sse2 \| \|------------\|------------\|----------\|------------\|----------\| \| 64 \| 1.00 \| 1.01 \| 13.37 \| 1.13 \| \| 1024 \| 1.10 \| 1.00 \| 42.42 \| 1.95 \| \| 65536 \| 1.00 \| 1.06 \| 42.22 \| 1.73 \| \| 1048576 \| 1.00 \| 1.03 \| 34.73 \| 1.46 \| ### pure_ascii \| Input Size \| new_avx512 \| new_sse2 \| old_avx512 \| old_sse2 \| \|------------\|------------\|----------\|------------\|----------\| \| 4 \| 1.03 \| 1.00 \| 1.75 \| 1.32 \| \| 8 \| 1.00 \| 1.14 \| 3.89 \| 2.06 \| \| 16 \| 1.00 \| 1.04 \| 1.13 \| 1.62 \| \| 32 \| 1.07 \| 1.19 \| 5.11 \| 1.00 \| \| 64 \| 1.00 \| 1.13 \| 13.32 \| 1.57 \| \| 128 \| 1.00 \| 1.01 \| 19.97 \| 1.55 \| \| 256 \| 1.00 \| 1.02 \| 27.77 \| 1.61 \| \| 1024 \| 1.00 \| 1.02 \| 41.34 \| 1.84 \| \| 4096 \| 1.02 \| 1.00 \| 45.61 \| 1.98 \| \| 16384 \| 1.01 \| 1.00 \| 48.67 \| 2.04 \| \| 65536 \| 1.00 \| 1.03 \| 43.86 \| 1.77 \| \| 262144 \| 1.00 \| 1.06 \| 41.44 \| 1.79 \| \| 1048576 \| 1.02 \| 1.00 \| 35.36 \| 1.44 \| </details> ## Reproduction / Test Projects Standalone validation tools: https://github.com/bonega/is-ascii-fix-validation - `bench/` - Criterion benchmarks for SSE2 vs AVX-512 comparison - `fuzz/` - Compares old/new implementations with libfuzzer Relates to: https://github.com/llvm/llvm-project/issues/176906	2026-01-26 14:36:21 +11:00
Trevor Gross	6c0ae93222	hint: Update the tracking issue for `likely_unlikely` These were split from the `cold_path` tracking issue.	2026-01-25 20:34:12 -06:00
Trevor Gross	a694b502d4	hint: Document that `cold_path` can be used to implement `(un)likely`	2026-01-25 20:34:12 -06:00
Esteban Küber	2b32446c7c	Suggest changing `iter`/`into_iter` when the other was meant When encountering a call to `iter` that should have been `into_iter` and vice-versa, provide a structured suggestion: ``` error[E0271]: type mismatch resolving `<IntoIter<{integer}, 3> as IntoIterator>::Item == &{integer}` --> $DIR/into_iter-when-iter-was-intended.rs:5:37 \| LL \| let _a = [0, 1, 2].iter().chain([3, 4, 5].into_iter()); \| ----- ^^^^^^^^^^^^^^^^^^^^^ expected `&{integer}`, found integer \| \| \| required by a bound introduced by this call \| note: the method call chain might not have had the expected associated types --> $DIR/into_iter-when-iter-was-intended.rs:5:47 \| LL \| let _a = [0, 1, 2].iter().chain([3, 4, 5].into_iter()); \| --------- ^^^^^^^^^^^ `IntoIterator::Item` is `{integer}` here \| \| \| this expression has type `[{integer}; 3]` note: required by a bound in `std::iter::Iterator::chain` --> $SRC_DIR/core/src/iter/traits/iterator.rs:LL:COL help: consider not consuming the `[{integer}, 3]` to construct the `Iterator` \| LL - let _a = [0, 1, 2].iter().chain([3, 4, 5].into_iter()); LL + let _a = [0, 1, 2].iter().chain([3, 4, 5].iter()); \| ```	2026-01-25 23:12:05 +00:00
bors	873d4682c7	Auto merge of #151337 - the8472:bail-before-memcpy2, r=Mark-Simulacrum optimize `vec.extend(slice.to_vec())`, take 2 Redoing https://github.com/rust-lang/rust/pull/130998 It was reverted in https://github.com/rust-lang/rust/pull/151150 due to flakiness. I have traced this to layout randomization perturbing the test (the failure reproduces locally with layout randomization), which is now excluded.	2026-01-25 19:45:35 +00:00
Andreas Liljeqvist	dbc870afec	Mark is_ascii_sse2 as #[inline]	2026-01-25 20:05:08 +01:00
Matthias Krüger	d83f2ebea5	Rollup merge of #151620 - vinDelphini:fix-typo-library-core, r=joboet Fix 'the the' typo in library/core/src/array/iter.rs This PR fixes a small grammatical error in a safety comment within `library/core/src/array/iter.rs` where the word "the" was duplicated. No functional changes.	2026-01-25 07:43:01 +01:00
Matthias Krüger	996992eced	Rollup merge of #151505 - bjorn3:proc_macro_refactors, r=petrochenkov,Kobzol Various refactors to the proc_macro bridge This reduces the amount of types, traits and other abstractions that are involved with the bridge, which should make it easier to understand and modify. This should also help a bit with getting rid of the type marking hack, which is complicating the code a fair bit. Fixes: rust-lang/rust#139810	2026-01-25 07:43:00 +01:00
Matthias Krüger	38504731be	Rollup merge of #150842 - PaulDance:patches/fix-win7-sleep, r=Mark-Simulacrum Fix(lib/win/thread): Ensure `Sleep`'s usage passes over the requested duration under Win7 Fixes rust-lang/rust#149935. See the added comment for more details. This makes the concerned test now reproducibly pass, for us at least. Also, testing this separately revealed successful: see the issue. @rustbot label C-bug I-flaky-test O-windows-7 T-libs A-time A-thread	2026-01-25 07:42:59 +01:00
Matthias Krüger	cc666ba8f4	Rollup merge of #149869 - joboet:torn-dbg, r=Mark-Simulacrum std: avoid tearing `dbg!` prints Fixes https://github.com/rust-lang/rust/issues/136703. This is an alternative to rust-lang/rust#149859. Instead of formatting everything into a string, this PR makes multi-expression `dbg!` expand into multiple nested matches, with the final match containing a single `eprint!`. By using macro recursion and relying on hygiene, this allows naming every bound value in that `eprint!`. CC @orlp r? libs	2026-01-25 07:42:58 +01:00
Matthias Krüger	2da5959600	Rollup merge of #148764 - GrigorenkoPV:aligment_api, r=scottmcm ptr_aligment_type: add more APIs As per https://github.com/rust-lang/rust/issues/102070#issuecomment-1650043557 Tracking issue: rust-lang/rust#102070 Mostly duplicating methods that previously worked with `usize`-represented alignments. Naming follows a convention of `align: usize`, `alignment: Alignment`.	2026-01-25 07:42:57 +01:00
vinDelphini	a06fdc1806	Fix 'the the' typo in library/core/src/array/iter.rs	2026-01-25 03:37:57 +05:30
Trevor Gross	74e70765ad	hint: Add a recommendation to benchmark with `cold_path` Other hints have a note recommending benchmarks to ensure they actually do what is intended. This is also applicable for `cold_path`, so add a note here.	2026-01-24 15:23:45 -06:00
Andreas Liljeqvist	a72f68e801	Fix is_ascii performance on x86_64 with explicit SSE2 intrinsics Use explicit SSE2 intrinsics to avoid LLVM's broken AVX-512 auto-vectorization which generates ~31 kshiftrd instructions. Performance - AVX-512: 34-48x faster - SSE2: 1.5-2x faster Improves on earlier pr	2026-01-24 22:03:58 +01:00
Matthias Krüger	f6cc562026	Rollup merge of #151403 - joboet:clock_nanosleep_time64, r=Mark-Simulacrum std: use 64-bit `clock_nanosleep` on GNU/Linux if available glibc 2.31 added support for both 64-bit `clock_gettime` and 64-bit `clock_nanosleep`. Thus, if [`__clock_nanosleep_time64`](https://sourceware.org/git/?p=glibc.git;a=blob;f=include/time.h;h=22b29ca583549488a0e5395cb820f55ec6e38e5f;hb=e14a91e59d35bf2fa649a9726ccce838b8c6e4b7#l322) and the underlying syscall are available, use them for implementing `sleep_until` to avoid having to fall back to `nanosleep` for long-duration sleeps.	2026-01-24 21:04:17 +01:00
Matthias Krüger	275ffd55b5	Rollup merge of #151538 - joboet:sleep_more, r=Mark-Simulacrum std: `sleep_until` on Motor and VEX This PR: * Forwards the public `sleep_until` to the private `sleep_until` on Motor OS * Adds a `sleep_until` implementation on VEX that yields until the deadline has passed CC @lasiotus CC @lewisfm @tropicaaal @Gavin-Niederman @max-niederman	2026-01-24 21:04:16 +01:00
Matthias Krüger	3a69035338	Rollup merge of #151346 - folkertdev:simd-splat, r=workingjubilee add `simd_splat` intrinsic Add `simd_splat` which lowers to the LLVM canonical splat sequence. ```llvm insertelement <N x elem> poison, elem %x, i32 0 shufflevector <N x elem> v0, <N x elem> poison, <N x i32> zeroinitializer ``` Right now we try to fake it using one of ```rust fn splat(x: u32) -> u32x8 { u32x8::from_array([x; 8]) } ``` or (in `stdarch`) ```rust fn splat(value: $elem_type) -> $name { #[derive(Copy, Clone)] #[repr(simd)] struct JustOne([$elem_type; 1]); let one = JustOne([value]); // SAFETY: 0 is always in-bounds because we're shuffling // a simd type with exactly one element. unsafe { simd_shuffle!(one, one, [0; $len]) } } ``` Both of these can confuse the LLVM optimizer, producing sub-par code. Some examples: - https://github.com/rust-lang/rust/issues/60637 - https://github.com/rust-lang/rust/issues/137407 - https://github.com/rust-lang/rust/issues/122623 - https://github.com/rust-lang/rust/issues/97804 --- As far as I can tell there is no way to provide a fallback implementation for this intrinsic, because there is no `const` way of evaluating the number of elements (there might be issues beyond that, too). So, I added implementations for all 4 backends. Both GCC and const-eval appear to have some issues with simd vectors containing pointers. I have a workaround for GCC, but haven't yet been able to make const-eval work. See the comments below. Currently this just adds the intrinsic, it does not actually use it anywhere yet.	2026-01-24 21:04:15 +01:00
Matthias Krüger	fd5f48f559	Rollup merge of #150905 - PaulDance:patches/unsupport-win7-hostname, r=Mark-Simulacrum Fix(lib/win/net): Remove hostname support under Win7 Fixes rust-lang/rust#150896. `GetHostNameW` is not available under Windows 7, leading to dynamic linking failures upon program executions. For now, as it is still unstable, this therefore appropriately cfg-gates the feature in order to mark the Win7 as unsupported with regards to this particular feature. Porting the functionality for Windows 7 would require changing the underlying system call and so more work for the immediate need. @rustbot label C-bug O-windows-7 T-libs A-io	2026-01-24 21:04:14 +01:00
Jonathan 'theJPster' Pallant	7cc102a4ee	Revised yield hints Turns out v7 targets always have v6t2 set, so that line was redundant. Also add a link to the Arm Armv7 A.R.M.	2026-01-24 17:29:25 +00:00
Jonathan 'theJPster' Pallant	96897f016e	Add ARMv6 bare-metal targets Three targets, covering A32 and T32 instructions, and soft-float and hard-float ABIs. Hard-float not available in Thumb mode. Atomics in Thumb mode require __sync* functions from compiler-builtins.	2026-01-24 17:29:25 +00:00
bjorn3	e8c48c6895	Fix review comments	2026-01-24 14:44:03 +00:00
bjorn3	d9ec1aef8d	Get rid of MarkedTypes	2026-01-24 14:44:03 +00:00
bjorn3	8a119c3145	Merge FreeFunctions trait into Server trait And rename FreeFunctions struct to Methods.	2026-01-24 14:44:03 +00:00
bjorn3	9481890143	Various simplifications after moving all bridge methods to a single type	2026-01-24 14:44:03 +00:00
bjorn3	2f44019470	Move all bridge methods into a single type	2026-01-24 14:44:03 +00:00
bjorn3	4dc28c59ab	Expand with_api_handle_types	2026-01-24 14:44:03 +00:00
bjorn3	dabae7eea4	Handle FreeFunctions outside with_api_handle_types It is a singleton which doesn't actually need to be passed through over the bridge.	2026-01-24 14:44:03 +00:00
bjorn3	ef819e49a8	Remove a couple of unnecessary impls	2026-01-24 14:44:03 +00:00
Jonathan Brouwer	85430dfc90	Rollup merge of #151555 - nicholasbishop:bishop-fix-just-uefi-test, r=Ayush1325,tgross35 Fix compilation of std/src/sys/pal/uefi/tests.rs Dropped the `align` test since the `POOL_ALIGNMENT` and `align_size` items it uses do not exist. The other changes are straightforward fixes for places where the test code drifted from the current API, since the tests are not yet built in CI for the UEFI target. CC @Ayush1325	2026-01-24 08:18:08 +01:00
Jonathan Brouwer	b6b11f473c	Rollup merge of #151551 - ehuss:test-build-script, r=jhpratt Don't use default build-script fingerprinting in `test` This changes the `test` build script so that it does not use the default fingerprinting mechanism in cargo which causes a full scan of the package every time it runs. This build script does not depend on any of the files in the package. This is the recommended approach for writing build scripts.	2026-01-24 08:18:07 +01:00
Jonathan Brouwer	474c9fe9a4	Rollup merge of #151489 - bend-n:constify-all-boolean-methods-under-feature-gate-const-bool, r=jhpratt constify boolean methods ```rs // core::bool impl bool { pub const fn then_some<T: [const] Destruct>(self, t: T) -> Option<T>; pub const fn then<T, F: [const] FnOnce() -> T + [const] Destruct>(self, f: F) -> Option<T>; pub const fn ok_or<E: [const] Destruct>(self, err: E) -> Result<(), E>; pub const fn ok_or_else<E, F: [const] FnOnce() -> E + [const] Destruct>; } ``` will make tracking issue if pr liked	2026-01-24 08:18:07 +01:00
Jonathan Brouwer	13f0399a57	Rollup merge of #151259 - bonega:fix-is-ascii-avx512, r=folkertdev Fix is_ascii performance regression on AVX-512 CPUs when compiling with -C target-cpu=native ## Summary This PR fixes a severe performance regression in `slice::is_ascii` on AVX-512 CPUs when compiling with `-C target-cpu=native`. On affected systems, the current implementation achieves only ~3 GB/s for large inputs, compared to ~60–70 GB/s previously (≈20–24× regression). This PR restores the original performance characteristics. This change is intended as a temporary workaround for upstream LLVM poor codegen. Once the underlying LLVM issue is fixed and Rust is able to consume that fix, this workaround should be reverted. ## Problem When `is_ascii` is compiled with AVX-512 enabled, LLVM's auto-vectorization generates ~31 `kshiftrd` instructions to extract mask bits one-by-one, instead of using the efficient `pmovmskb` instruction. This causes a ~22x performance regression. Because `is_ascii` is marked `#[inline]`, it gets inlined and recompiled with the user's target settings, affecting anyone using `-C target-cpu=native` on AVX-512 CPUs. ## Root cause (upstream) The underlying issue appears to be an LLVM vectorizer/backend bug affecting certain AVX-512 patterns. An upstream issue has been filed by @folkertdev to track the root cause: llvm/llvm-project#176906 Until this is resolved in LLVM and picked up by rustc, this PR avoids triggering the problematic codegen pattern. ## Solution Replace the counting loop with explicit SSE2 intrinsics (`_mm_movemask_epi8`) that force `pmovmskb` codegen regardless of CPU features. ## Godbolt Links (Rust 1.92) \| Pattern \| Target \| Link \| Result \| \|---------\|--------\|------\|--------\| \| Counting loop (old) \| Default SSE2 \| https://godbolt.org/z/sE86xz4fY \| `pmovmskb` \| \| Counting loop (old) \| AVX-512 (znver4) \| https://godbolt.org/z/b3jvMhGd3 \| 31x `kshiftrd` (broken) \| \| SSE2 intrinsics (fix) \| Default SSE2 \| https://godbolt.org/z/hMeGfeaPv \| `pmovmskb` \| \| SSE2 intrinsics (fix) \| AVX-512 (znver4) \| https://godbolt.org/z/Tdvdqjohn \| `vpmovmskb` (fixed) \| ## Benchmark Results CPU: AMD Ryzen 5 7500F (Zen 4 with AVX-512) ### Default Target (SSE2) — Mixed \| Size \| Before \| After \| Change \| \|------\|--------\|-------\|--------\| \| 4 B \| 1.8 GB/s \| 2.0 GB/s \| +11% \| \| 8 B \| 3.2 GB/s \| 5.8 GB/s \| +81% \| \| 16 B \| 5.3 GB/s \| 8.5 GB/s \| +60% \| \| 32 B \| 17.7 GB/s \| 15.8 GB/s \| -11% \| \| 64 B \| 28.6 GB/s \| 25.1 GB/s \| -12% \| \| 256 B \| 51.5 GB/s \| 48.6 GB/s \| ~same \| \| 1 KB \| 64.9 GB/s \| 60.7 GB/s \| ~same \| \| 4 KB+ \| ~68-70 GB/s \| ~68-72 GB/s \| ~same \| ### Native Target (AVX-512) — Up to 24x Faster \| Size \| Before \| After \| Speedup \| \|------\|--------\|-------\|---------\| \| 4 B \| 1.2 GB/s \| 2.0 GB/s \| 1.7x \| \| 8 B \| 1.6 GB/s \| 5.0 GB/s \| 3.3x \| \| 16 B \| ~7 GB/s \| ~7 GB/s \| ~same \| \| 32 B \| 2.9 GB/s \| 14.2 GB/s \| 4.9x \| \| 64 B \| 2.9 GB/s \| 23.2 GB/s \| 8x \| \| 256 B \| 2.9 GB/s \| 47.2 GB/s \| 16x \| \| 1 KB \| 2.8 GB/s \| 60.0 GB/s \| 21x \| \| 4 KB+ \| 2.9 GB/s \| ~68-70 GB/s \| 23-24x \| ### Summary - SSE2 (default): Small inputs (4-16 B) 11-81% faster; 32-64 B ~11% slower; large inputs unchanged - AVX-512 (native): 21-24x faster for inputs ≥1 KB, peak ~70 GB/s (was ~3 GB/s) Note: this is the pure ascii path, but the story is similar for the others. See linked bench project. ## Test Plan - [x] Assembly test (`slice-is-ascii-avx512.rs`) verifies no `kshiftrd` with AVX-512 - [x] Existing codegen test updated to `loongarch64`-only (auto-vectorization still used there) - [x] Fuzz testing confirms old/new implementations produce identical results (~53M iterations) - [x] Benchmarks confirm performance improvement - [x] Tidy checks pass ## Reproduction / Test Projects Standalone validation tools: https://github.com/bonega/is-ascii-fix-validation - `bench/` - Criterion benchmarks for SSE2 vs AVX-512 comparison - `fuzz/` - Compares old/new implementations with libfuzzer ## Related Issues - issue opened by @folkertdev llvm/llvm-project#176906 - Regression introduced in https://github.com/rust-lang/rust/pull/130733	2026-01-24 08:18:05 +01:00
joboet	dee0e39471	std: implement `sleep_until` on VEX	2026-01-23 23:15:12 +01:00
Nicholas Bishop	8995ff5ba8	Fix compilation of std/src/sys/pal/uefi/tests.rs Dropped the `align` test since the `POOL_ALIGNMENT` and `align_size` items it uses do not exist. The other changes are straightforward fixes for places where the test code drifted from the current API, since the tests are not yet built in CI for the UEFI target.	2026-01-23 17:07:06 -05:00
Eric Huss	e38b55d0b7	Don't use default build-script fingerprinting in `test` This changes the `test` build script so that it does not use the default fingerprinting mechanism in cargo which causes a full scan of the package every time it runs. This build script does not depend on any of the files in the package. This is the recommended approach for writing build scripts.	2026-01-23 11:10:53 -08:00

1 2 3 4 5 ...

27018 commits