bjoernager/rust - mandelbrot.dk

Author	SHA1	Message	Date
Josh Stone	12167d7064	Update the minimum external LLVM to 19	2025-04-05 11:44:38 -07:00
Stuart Cook	c6bf3a01ef	Rollup merge of #137880 - EnzymeAD:autodiff-batching, r=oli-obk Autodiff batching Enzyme supports batching, which is especially known from the ML side when training neural networks. There we would normally have a training loop, where in each iteration we would pass in some data (e.g. an image), and a target vector. Based on how close we are with our prediction we compute our loss, and then use backpropagation to compute the gradients and update our weights. That's quite inefficient, so what you normally do is passing in a batch of 8/16/.. images and targets, and compute the gradients for those all at once, allowing better optimizations. Enzyme supports batching in two ways, the first one (which I implemented here) just accepts a Batch size, and then each Dual/Duplicated argument has not one, but N shadow arguments. So instead of ```rs for i in 0..100 { df(x[i], y[i], 1234); } ``` You can now do ```rs for i in 0..100.step_by(4) { df(x[i+0],x[i+1],x[i+2],x[i+3], y[i+0], y[i+1], y[i+2], y[i+3], 1234); } ``` which will give the same results, but allows better compiler optimizations. See the testcase for details. There is a second variant, where we can mark certain arguments and instead of having to pass in N shadow arguments, Enzyme assumes that the argument is N times longer. I.e. instead of accepting 4 slices with 12 floats each, we would accept one slice with 48 floats. I'll implement this over the next days. I will also add more tests for both modes. For any one preferring some more interactive explanation, here's a video of Tim's llvm dev talk, where he presents his work. https://www.youtube.com/watch?v=edvaLAL5RqU I'll also add some other docs to the dev guide and user docs in another PR. r? ghost Tracking: - https://github.com/rust-lang/rust/issues/124509 - https://github.com/rust-lang/rust/issues/135283	2025-04-05 13:18:13 +11:00
Manuel Drehwald	b7c63a973f	add autodiff batching backend	2025-04-04 14:24:23 -04:00
Augie Fackler	b14a0ce7f6	PassWrapper: adapt for llvm/llvm-project@94122d58fc We also have to remove the LLVM argument in cast-target-abi.rs for LLVM 21. I'm not really sure what the best approach here is since that test already uses revisions. We could also fork the test into a copy for LLVM 19-20 and another for LLVM 21, but what I did for now was drop the lint-abort-on-error flag to LLVM figuring that some coverage was better than none, but I'm happy to change this if that was a bad direction. The above also applies for ffi-out-of-bounds-loads.rs. r? dianqk @rustbot label llvm-main	2025-03-31 15:47:26 -04:00
Zalathar	d07ef5b0e1	coverage: Add LLVM plumbing for expansion regions This is currently unused, but paves the way for future work on expansion regions without having to worry about the FFI parts.	2025-03-20 12:40:36 +11:00
Hans Wennborg	eb2e421e36	Adapt to LLVM dropping CfiFunctionIndex::begin()/end() After https://github.com/llvm/llvm-project/pull/130382, RustWrapper needs to call CfiFunctionIndex::symbols() instead.	2025-03-12 20:03:29 +01:00
Jacob Pratt	b16047780d	Rollup merge of #138137 - ZequanWu:fix-triple, r=cuviper setTargetTriple now accepts Triple rather than string https://github.com/llvm/llvm-project/pull/129868 updated `setTargetTriple`	2025-03-07 21:57:52 -05:00
Zequan Wu	8814679a54	rename Triple to Target	2025-03-06 22:52:20 -08:00
Zequan Wu	4e4bed8684	setTargetTriple now accepts Triple rather than string	2025-03-06 16:37:57 -08:00
Josh Stone	feae27945b	[llvm/PassWrapper] use `size_t` when building arg strings	2025-03-06 10:09:19 -08:00
bors	0c72c0d11a	Auto merge of #133250 - DianQK:embed-bitcode-pgo, r=nikic The embedded bitcode should always be prepared for LTO/ThinLTO Fixes #115344. Fixes #117220. There are currently two methods for generating bitcode that used for LTO. One method involves using `-C linker-plugin-lto` to emit object files as bitcode, which is the typical setting used by cargo. The other method is through `-C embed-bitcode=yes`. When using with `-C embed-bitcode=yes -C lto=no`, we run a complete non-LTO LLVM pipeline to obtain bitcode, then the bitcode is used for LTO. We run the Call Graph Profile Pass twice on the same module. This PR is doing something similar to LLVM's `buildFatLTODefaultPipeline`, obtaining the bitcode for embedding after running `buildThinLTOPreLinkDefaultPipeline`. r? nikic	2025-03-01 08:22:18 +00:00
bors	e0be1a0262	Auto merge of #137271 - nikic:gep-nuw-2, r=scottmcm Emit getelementptr inbounds nuw for pointer::add() Lower pointer::add (via intrinsic::offset with unsigned offset) to getelementptr inbounds nuw on LLVM versions that support it. This lets LLVM make use of the pre-condition that the offset addition does not wrap in an unsigned sense. Together with inbounds, this also implies that the offset is non-negative. Fixes https://github.com/rust-lang/rust/issues/137217.	2025-02-24 03:06:16 +00:00
DianQK	1a99ca8da9	The embedded bitcode should always be prepared for LTO/ThinLTO	2025-02-23 21:23:36 +08:00
Manuel Drehwald	161a4bf6ff	update enzyme submodule and users	2025-02-21 21:49:46 -05:00
Nikita Popov	5e9d8a7d55	Switch to the LLVMBuildGEPWithNoWrapFlags API This API allows us to set the nuw flag as well.	2025-02-19 11:32:32 +01:00
Nikita Popov	97f6e4d34b	Quote embedded codeview command line arguments The formatting of the command line arguments has been moved to the frontend in: `e190d074a0` However, the Rust logic introduced in `ad0ecebf43` did not replicate the previous argument quoting behavior.	2025-02-14 11:02:28 +01:00
Matthias Krüger	78f5bddd57	Rollup merge of #136419 - EnzymeAD:autodiff-tests, r=onur-ozkan,jieyouxu adding autodiff tests I'd like to get started with upstreaming some tests, even though I'm still waiting for an answer on how to best integrate the enzyme pass. Can we therefore temporarily support the -Z llvm-plugins here without too much effort? And in that case, how would that work? I saw you can do remapping, e.g. `rust-src-base`, but I don't think that will give me the path to libEnzyme.so. Do you have another suggestion? Other than that this test simply checks that the derivative of `xx` is `2.0 x`, which in this case is computed as `%0 = fadd fast double %x.0.val, %x.0.val` (I'll add a few more tests and move it to an autodiff folder if we can use the -Z flag) r? ``@jieyouxu`` Locally at least `-Zllvm-plugins=${PWD}/build/x86_64-unknown-linux-gnu/enzyme/build/Enzyme/libEnzyme-19.so` seems to work if I copy the command I get from x.py test and run it manually. However, running x.py test itself fails. Tracking: - https://github.com/rust-lang/rust/issues/124509 Zulip discussion: https://rust-lang.zulipchat.com/#narrow/channel/326414-t-infra.2Fbootstrap/topic/Enzyme.20build.20changes	2025-02-10 16:38:23 +01:00
Manuel Drehwald	1221cff551	move second opt run to lto phase and cleanup code	2025-02-10 01:35:22 -05:00
Manuel Drehwald	21d096184e	fix non-enzyme builds	2025-02-07 22:27:46 -05:00
Manuel Drehwald	8f87e8095e	get tests to work without -Z/-C flags	2025-02-07 01:58:48 -05:00
Daniel Paoliello	2a6b27444a	Remove dead code from rustc_codegen_llvm and the LLVM wrapper	2025-02-06 16:53:52 -08:00
Zalathar	8ddd9c38f6	Use `LLVMDIBuilderCreateDebugLocation` The LLVM-C binding takes an explicit context, whereas our binding obtained the context from the scope argument.	2025-02-01 14:14:40 +11:00
Zalathar	949b4673ce	Use `LLVMDIBuilderCreateLexicalBlockFile`	2025-02-01 14:14:40 +11:00
Zalathar	70d41bc711	Use `LLVMDIBuilderCreateLexicalBlock`	2025-02-01 14:14:40 +11:00
Zalathar	878ab125a1	Use `LLVMDIBuilderCreateNameSpace`	2025-02-01 14:14:39 +11:00
Zalathar	cd2af2dd9a	Use `LLVMDIBuilderFinalize`	2025-02-01 13:38:12 +11:00
bors	854f22563c	Auto merge of #136350 - matthiaskrgr:rollup-6eqfyvh, r=matthiaskrgr Rollup of 9 pull requests Successful merges: - #134531 ([rustdoc] Add `--extract-doctests` command-line flag) - #135860 (Compiler: Finalize dyn compatibility renaming) - #135992 (Improve documentation when adding a new target) - #136194 (Support clobber_abi in BPF inline assembly) - #136325 (Delay a bug when indexing unsized slices) - #136326 (Replace our `LLVMRustDIBuilderRef` with LLVM-C's `LLVMDIBuilderRef`) - #136330 (Remove unnecessary hooks) - #136336 (Overhaul `rustc_middle::util`) - #136341 (Remove myself from vacation) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-31 20:16:46 +00:00
Matthias Krüger	a5a005febe	Rollup merge of #136326 - Zalathar:llvm-di-builder-ref, r=nikic Replace our `LLVMRustDIBuilderRef` with LLVM-C's `LLVMDIBuilderRef` Inspired by trying to split #134009 into smaller steps that are easier to review individually. This makes it possible to start incrementally replacing our debuginfo bindings with the ones in the LLVM-C API, all of which operate on `LLVMDIBuilderRef`. There should be no change to compiler behaviour.	2025-01-31 12:28:18 +01:00
bors	7f36543a48	Auto merge of #136332 - jhpratt:rollup-aa69d0e, r=jhpratt Rollup of 9 pull requests Successful merges: - #132156 (When encountering unexpected closure return type, point at return type/expression) - #133429 (Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle) - #136281 (`rustc_hir_analysis` cleanups) - #136297 (Fix a typo in profile-guided-optimization.md) - #136300 (atomic: extend compare_and_swap migration docs) - #136310 (normalize `*.long-type.txt` paths for compare-mode tests) - #136312 (Disable `overflow_delimited_expr` in edition 2024) - #136313 (Filter out RPITITs when suggesting unconstrained assoc type on too many generics) - #136323 (Fix a typo in conventions.md) r? `@ghost` `@rustbot` modify labels: rollup	2025-01-31 09:42:28 +00:00
Zalathar	bf15d599ff	Remove the temporary typedef for `LLVMRustDIFlags` If we're already churning all of the debuginfo bindings, we might as well fix this at the same time.	2025-01-31 13:03:03 +11:00
Zalathar	63a039a7f5	Replace our `LLVMRustDIBuilderRef` with LLVM-C's `LLVMDIBuilderRef` This makes it possible to start incrementally replacing our debuginfo bindings with the ones in the LLVM-C API, all of which operate on `LLVMDIBuilderRef`.	2025-01-31 13:02:42 +11:00
Hans Wennborg	6e457b88eb	use Attribute::getWithCaptureInfo	2025-01-30 12:30:14 +01:00
Hans Wennborg	23fb08bb53	LLVM changed the nocapture attribute to captures(none) This updates RustWrapper.cpp and tests after https://github.com/llvm/llvm-project/pull/123181	2025-01-30 11:22:46 +01:00
Manuel Drehwald	1f30517d40	upstream rustc_codegen_ssa/rustc_middle changes for enzyme/autodiff	2025-01-29 21:31:13 -05:00
Zalathar	32f1c1d85e	Make our `DIFlags` match `LLVMDIFlags` in the LLVM-C API	2025-01-21 14:41:44 +11:00
Matthew Maurer	fc32dd49cb	llvm: Ignore error value that is always false See llvm/llvm-project#121851 For LLVM 20+, this function (`renameModuleForThinLTO`) has no return value. For prior versions of LLVM, this never failed, but had a signature which allowed an error value people were handling.	2025-01-07 01:02:22 +00:00
Zalathar	1b62645418	Use constants for DWARF opcodes, instead of FFI calls	2025-01-05 22:16:25 +11:00
bors	3dc3c524f7	Auto merge of #133990 - Walnut356:static_const, r=workingjubilee [Debuginfo] Force enum `DISCR_` to `static const u64` to allow for inspection via LLDB see [here](`486614878`) for more info. This change mainly helps `-msvc` debugged with LLDB. Currently, LLDB cannot inspect `static` struct fields, so the intended visualization for enums is only borderline functional, and niche enums with ranges of discriminant cannot be determined at all . LLDB can inspect `static const` values (though for whatever reason, non-enum/non-u64 consts don't work). This change adds the `LLVMRustDIBuilderCreateQualifiedType` to the rust FFI layer to wrap the discr type with a `const` modifier, as well as forcing all generated integer enum `DISCR_*` values to be u64's. Those values will only ever be used by debugger visualizers anyway, so it shouldn't be a huge deal, but I left a fixme comment for it just in case.. The `tag` also still properly reflects the discriminant type, so no information is lost.	2025-01-04 23:56:29 +00:00
Manuel Drehwald	d753cbf779	upstream rustc_codegen_llvm changes for enzyme/autodiff	2025-01-01 21:42:45 +01:00
Walnut	a1191e30b6	force enum `DISCR_*` to `const u64` to allow for inspection via LLDB's `SBTypeStaticField::GetConstantValue()`	2024-12-30 19:01:48 -06:00
Ralf Jung	fff026c8e5	rustc_llvm: expose FloatABIType target machine parameter	2024-12-30 18:10:59 +01:00
Walnut	bc4266ca96	add LLVMRustDIBuilderCreateQualifiedType to ffi	2024-12-23 19:12:32 -06:00
bjorn3	c02c311d84	Remove some dead code around import library generation This was missed when replacing the usage of LLVM for generating import libraries.	2024-12-20 15:20:15 +00:00
bors	903d2976fd	Auto merge of #129181 - beetrees:asm-spans, r=pnkfelix,compiler-errors Pass end position of span through inline ASM cookie Before this PR, only the start position of the span was passed though the inline ASM cookie to diagnostics. LLVM 19 has full support for 64-bit inline ASM cookies; this PR uses that to pass the end position of the span in the upper 32 bits, meaning inline ASM diagnostics now point at the entire line the error occurred on, not just the first character of it.	2024-12-12 02:34:06 +00:00
Nikita Popov	d3ad000943	Respect verify-llvm-ir option in the backend We are currently unconditionally verifying the LLVM IR in the backend (twice), ignoring the value of the verify-llvm-ir option.	2024-11-26 15:26:03 +01:00
beetrees	68227a3777	Pass end position of span through inline ASM cookie	2024-11-26 13:00:08 +00:00
许杰友 Jieyou Xu (Joe)	c6d36256a6	Rollup merge of #127483 - BertalanD:no_sanitize-global-var, r=rcvalle Allow disabling ASan instrumentation for globals AddressSanitizer adds instrumentation to global variables unless the [`no_sanitize_address`](https://llvm.org/docs/LangRef.html#global-attributes) attribute is set on them. This commit extends the existing `#[no_sanitize(address)]` attribute to set this; previously it only had the desired effect on functions. (cc https://github.com/rust-lang/rust/issues/39699)	2024-11-23 20:19:51 +08:00
Jakob Koschel	61013f040e	PassWrapper: disable UseOdrIndicator for Asan Win32 As described here UseOdrIndicator should be disabled on Windows since link.exe does not support duplicate weak definitions (https://reviews.llvm.org/D137227). Co-Authored-By: Bastian Kersting <bkersting@google.com>	2024-11-14 04:20:18 +08:00
Kyle Huey	1dc106121b	Add discriminators to DILocations when multiple functions are inlined into a single point. LLVM does not expect to ever see multiple dbg_declares for the same variable at the same location with different values. proc-macros make it possible for arbitrary code, including multiple calls that get inlined, to happen at any given location in the source code. Add discriminators when that happens so these locations are different to LLVM. This may interfere with the AddDiscriminators pass in LLVM, which is added by the unstable flag -Zdebug-info-for-profiling. Fixes #131944	2024-11-09 08:01:31 -08:00
bors	4b198d6871	Auto merge of #132584 - Zalathar:includes, r=cuviper Trim and tidy includes in `rustc_llvm` These includes tend to accumulate over time, and are usually only removed when something breaks in a new LLVM version, so it's nice to clean them up manually once in a while. General strategy used for this PR: - Remove all includes from `LLVMWrapper.h` that aren't needed by the header itself, transplanting them to individual source files as necessary. - For each source file, temporarily remove each include if doing so doesn't cause a compile error. - If a “required” include looks like it shouldn't be needed, try replacing it with its sub-includes, then trim that list. - After doing all of the above, go back and re-add any removed include if the file does actually use things defined in that header, even if the header happens to also be included by something else.	2024-11-09 09:46:08 +00:00

1 2 3 4 5 ...

473 commits