Commits · 8ce135123ec574406c72333798380f486b734fc8 · educg-net-26154-2315672 / llvm_project-2529

27 Feb, 2023 40 commits

[ELF][PPC64] Merge PPC64R12SetupStub and PPC64PCRelPLTStub. NFC · 8ce13512

Fangrui Song authored 2 years ago

PPC64PCRelPLTStub (from D83669) duplicates lot of code from
PPC64R12SetupStub. Just merge them.

Note: PPC64R12SetupStub does not correctly handle long branch to a
non-preemptible non-TOC code.

8ce13512

[lld][WebAssembly] Fix handling of mixed strong and weak references · d65ed8cd

Sam Clegg authored 2 years ago

When adding a undefined symbols to the symbol table, if the existing
reference is weak replace the symbol flags with (potentially) non-weak
binding.

Fixes: https://github.com/llvm/llvm-project/issues/60829

Differential Revision: https://reviews.llvm.org/D144747

d65ed8cd

[test] Remove unnecessary -enable-new-pm=0 · 45391e13
Arthur Eubanks authored 2 years ago

45391e13

[LLVMContextImpl] Separate out opaque pointers · 5a201a73

Arthur Eubanks authored 2 years ago

To make the map lookups simpler for opaque pointers and to simplify future typed pointer code removal. No significant compile time wins though.

While we're here, remove the address space 0 optimization for typed pointers.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D144910

5a201a73

[BOLT] Fix intermittent crash with instrumentation · fb28196a

Maksim Panchenko authored 2 years ago

When createInstrumentedIndirectCall() was invoked for tail calls, we
attached annotation instruction twice to the new call instruction.
First in createDirectCall(), and then again while copying over the
metadata operands.

As a result, the annotations were not properly stripped for such calls
before the call to freeAnnotations() in LowerAnnotations pass. That lead
to use-after-free while restoring the offsets with setOffset() call.

Reviewed By: yota9

Differential Revision: https://reviews.llvm.org/D144806

fb28196a

[NFC][PGO] Prefix duplicate profile MemOp entry diagnostic with 'warning:' · 4364e242

Matthew Voss authored 2 years ago

Adding this prefix will indicate clearly that the compiler doesn't exit
when it hits this diagnostic. Searches for other non-fatal diagnostics
will also be able to find this diagnostic easily.

4364e242

[libc++] Fix "size_t" constants that should be "bool" or "int", and add tests · 049a3fe1

Arthur O'Dwyer authored 2 years ago

`is_placeholder`, despite having an "is_" name, actually returns an int:
1 for `_1`, 2 for `_2`, 3 for `_3`, and so on. But it should still be int,
not size_t.

049a3fe1

[X86] Split off x86-64-v* tuning flags. NFC · c08867e3

Simon Pilgrim authored 2 years ago

Noticed when reviewing D143786, we are currently inheriting the x86-64-v* tuning flags from specific CPUs when really we need these to be a mixture of common traits and tuning to avoid specific severe regressions.

Differential Revision: https://reviews.llvm.org/D144832

c08867e3

[libc] use vars in string to num fuzz targets · 62e7bdd2

Michael Jones authored 2 years ago

The string to integer and string to float standalone fuzz targets just
ran the functions and didn't do anything with the output. This was
intentional, since they are intended to be used with sanitizers to
detect buffer overflow bugs. Not using the variables was causing compile
warnings, so this patch adds trivial checks to use the variables.

Reviewed By: sivachandra, lntue

Differential Revision: https://reviews.llvm.org/D144208

62e7bdd2

Revert "[scudo] Only prepare PageMap entry for partial region" · 387452ec
Chia-hung Duan authored 2 years ago
```
This reverts commit 0a0b6fa4.
```
387452ec

[Bitcode] Remove typed pointer abbreviation · 01dacc41

Arthur Eubanks authored 2 years ago

Since typed pointers are deprecated.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D144901

01dacc41

[AArch64] Fix a warning · fa66e4bd

Kazu Hirata authored 2 years ago

This patch fixes:

  llvm/lib/Target/AArch64/AArch64MIPeepholeOpt.cpp:582:17: error:
  unused variable 'INSvilaneMI' [-Werror,-Wunused-variable]

fa66e4bd

[SPIR-V] Support TargetExtType for SPIR-V builtin types · 5ac69674

Michal Paszkowski authored 2 years ago

This patch adds support for TargetExtType/target(...) representing
SPIR-V builtin types. After D135202, target(...) is the preferred way
for representing SPIR-V builtin types in LLVM IR and the only working
in the opaque pointer mode.

In order to maintain compatibility with LLVM IR generated by older
versions of Clang and LLVM/SPIR-V Translator, pointers-to-opaque-structs
denoting SPIR-V/OpenCL builtin types will be translated to equivalent
SPIR-V target extension types. This translation is only available in the
typed pointer mode (-opaque-pointers=0).

The relevant LIT tests with SPIR-V builtins were converted to use the
new target(...) notation.

Differential Revision: https://reviews.llvm.org/D144494

5ac69674

[SLP] Fixes crash in BoUpSLP::isGatherShuffledEntry() · a700fb3d
Vasileios Porpodas authored 2 years ago
```
Crash caused by: 708eb1b9

Differential Revision: https://reviews.llvm.org/D144895
```
a700fb3d

[AArch64] Avoid using intermediate integer registers for copying between... · 72105d10

Nilanjana Basu authored 2 years ago

[AArch64] Avoid using intermediate integer registers for copying between source and destination floating point registers

In post-isel code, there are cases where there were redundant copies from a source FPR to an intermediate GPR in order to copy to a destination FPR. In this patch, we identify these patterns in post-isel peephole optimization and replace them with a direct FPR-to-FPR copy.
One example for this will be the insertion of the scalar result of 'uaddlv' neon intrinsic function into a destination vector. During instruction selection phase, 'uaddlv' result is copied to a GPR, & a vector insert instruction is matched separately to copy the previous result to a destination SIMD&FP register.

Reviewed By: dmgreen

Differential Revision: https://reviews.llvm.org/D142594

72105d10

[Clang] [AVR] Fix USHRT_MAX for 16-bit int. · 0fecac18

Daniel Thornburgh authored 3 years ago

For AVR, the definition of USHRT_MAX overflows.

Reviewed By: aaron.ballman, #clang-language-wg

Differential Revision: https://reviews.llvm.org/D144218

0fecac18

[clang-format-diff] Correctly parse start-of-file diffs · 50563944

Tamir Duberstein authored 2 years ago

Handle the case where the diff is a pure removal of lines. Before this
change start_line would end up as 0 which is rejected by clang-format.

Submitting on behalf of @tamird.

Differential Revision: https://reviews.llvm.org/D144291

50563944

[Pass][CHR] Move ControlHeightReduction to module optimization pipeline · 66673166

Rong Xu authored 2 years ago

This is a modified version of commit b3744233 by
Arthur (https://reviews.llvm.org/D143424).

Here we invoke to the pass independent of PGOOPT. We now check if the
profile is available through the program summary. This ensures CHR is
called in distributed ThinLTO BE compilation (where PGOOPT might not
be created).

Differential Revision: https://reviews.llvm.org/D144769

66673166

[SCEV] Hoist common cleanup code to function. (NFC) · 2f3c748c

Florian Hahn authored 2 years ago

This allows for easier updating of common code in follow-on patches.

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D144847

2f3c748c

[AArch64][GlobalISel] Reorder stack up-adjustment and register copies · 31d6a572

Amara Emerson authored 2 years ago

This change reorders the stack up-adjustment and return value copying phases of
machine-ir generation on Aarch64. Doing so prevents a bug observed for fastcc
calls with >8 arguments, where the up-adjustment required from making that call
is placed in the wrong place relative to spill and reloading code.

See: https://github.com/llvm/llvm-project/issues/60972 for full issue
reproduction and context.

Patch contributed by Bruce Collie

Differential Revision: https://reviews.llvm.org/D144791

31d6a572

[AArch64] Don't remove free sext_inreg(vector_extract(x)) if it leads to multiple extracts · 06daa515

David Green authored 2 years ago

If we have sext_inreg(vector_extract(x)) but the top bits are not used, DAG
will try to remove the sext_inreg, using vector_extract(x) directly. This can
lead to multiple uses of both sext_inreg(vector_extract(x)) and
vector_extract(x), leading to the generation of both umov and smov extracts.
This adds a target hook to prevent that under AArch64 where the sext_inreg can
be considered free if there are multiple uses of the sext and no uses of the
vector_extract. This helps fix a small regression from D144550.

Differential Revision: https://reviews.llvm.org/D144850

06daa515

[MLIR] Add primitive builders for scf.if · e7b52c46
Frederik Gossen authored 2 years ago
```
Differential Revision: https://reviews.llvm.org/D144886
```
e7b52c46

[scudo] Only prepare PageMap entry for partial region · 0a0b6fa4

Chia-hung Duan authored 2 years ago

This reduces the size of PageMap and we are more likely to use the
static local buffer. Note that now this is only supported for single
region case, i.e. on SizeClassAllocator64. For SizeClassAllocator32,
it needs a different way to save the PageMap.

Differential Revision: https://reviews.llvm.org/D142659

0a0b6fa4

[libc++][NFC] Format __split_buffer and move constructors that are marked... · 2aeda9aa

Nikolas Klauser authored 2 years ago

[libc++][NFC] Format __split_buffer and move constructors that are marked inline into the class body

Reviewed By: ldionne, #libc

Spies: libcxx-commits

Differential Revision: https://reviews.llvm.org/D142433

2aeda9aa

[libc++] Simplify the modules_include.sh.cpp script a bit · 411c799a

Nikolas Klauser authored 2 years ago

Reviewed By: #libc, ldionne

Spies: vvereschaka, libcxx-commits

Differential Revision: https://reviews.llvm.org/D144825

411c799a

[libc++] Improves clang-format settings. · de6827b5

Mark de Wever authored 2 years ago

Add a new test based .clang-format file which inherits from the generic
one. This moves some test specific formatting rules to the test
directory.

The main benefit is that headers are sorted, which makes it more likely
to catch these errors before creating a review instead of spotting the
error in the CI clang-tidy step.

Reviewed By: ldionne, philnik, #libc

Differential Revision: https://reviews.llvm.org/D144755

de6827b5

[libc++] Fixes operator& hijacking atomic types. · f41f3925

Mark de Wever authored 2 years ago

This uses std::addressof everywherein atomic. This is not strictly
needed for the integral and floating point specializations. They should
not be used by user defined types. But it's easier to fix everything.

Note these changes are made using a WIP clang-tidy plugin.

Reviewed By: #libc, ldionne

Differential Revision: https://reviews.llvm.org/D144786

f41f3925

[LLVMContextImpl] Separate out integer constant ones · 86bdcdf0

Arthur Eubanks authored 2 years ago

Very small compile time improvement:
https://llvm-compile-time-tracker.com/compare.php?from=6a7a8907e8334eaf551742148079c628f78e6ed7&to=454d1181fbdb9121f0c7a3ecf526520db32ab420&stat=instructions:u

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D144746

86bdcdf0

[LLVMContextImpl] Separate out integer constant zeroes · c3166753

Arthur Eubanks authored 2 years ago

Very small compile time improvement:
https://llvm-compile-time-tracker.com/compare.php?from=a628ca4925f7249b4fbd3e932c9627b12e2770dd&to=6a7a8907e8334eaf551742148079c628f78e6ed7&stat=instructions:u

Reviewed By: nikic

Differential Revision: https://reviews.llvm.org/D144745

c3166753

[SLP]Fix PR61018: Assertion `Mask[I] == UndefMaskElem && "Multiple uses · 007177bd

Alexey Bataev authored 2 years ago

of scalars."' failed.

Need to check for the reused indices when checking if 2 insertelement
instruction are from the same buildvector. If the inidices are reused,
better not to match buildvectors and consider them as differenet,
otherwise need to track the order of insertelement operations.

007177bd

[AMDGPU] Update the CHECK autogenerated as it's expired · d514726d
zhongyunde authored 2 years ago
```
Reviewed By: nikic
Differential Revision: https://reviews.llvm.org/D144771
```
d514726d

[Sema] Use isSVESizelessBuiltinType instead of isSizelessBuiltinType to prevent crashing on RISC-V. · 2e731117

Craig Topper authored 2 years ago

These 2 spots are protecting calls to SVE specific functions. If RISC-V
sizeless types end up in there we trigger assertions.

Use the more specific isSVESizelessBuiltinType() to avoid letting
RISC-V vectors through.

Reviewed By: asb, c-rhodes

Differential Revision: https://reviews.llvm.org/D144772

2e731117

[Flang][OpenMP][OpenACC] Error for loop with no control · 7d7633bd

Kiran Chandramohan authored 2 years ago

Issue error if a DO construct associated with a loop does not have
loop control. Currently, it is issued only for the loop immediately
following the loop construct. This patch extends it to cases like
collapse where there is more than one loop associated. It also fixes
a crash since the existing code always expects loop control.

This is covered in OpenMP 4.5 standard, Section 2.7.1.
"The do-loop cannot be a DO WHILE or a DO loop without loop control."

OpenACC 3.3 covers this indirectly in Section 2.9.1.
The trip count for all loops associated with the collapse clause must
be computable and invariant in all the loops".

Reviewed By: clementval

Differential Revision: https://reviews.llvm.org/D144290

7d7633bd

[OpenMP] Ignore implicit casts on assertion for `use_device_ptr` · 853d4059

Joseph Huber authored 2 years ago

There was an assertion triggering when invoking a captured member whose
initializer was in a blase class. This patch fixes it by allowing the
assertion on implicit casts to the base class rather than only the base
class itself.

Fixes https://github.com/llvm/llvm-project/issues/61027

Reviewed By: tianshilei1992

Differential Revision: https://reviews.llvm.org/D144873

853d4059

[Flang][OpenMP] NFC: Change a few message/comments to fit 80chars · 54acf9a3

Kiran Chandramohan authored 2 years ago

Changes are all in the OpenMP semantic checks file.

Reviewed By: SBallantyne

Differential Revision: https://reviews.llvm.org/D144874

54acf9a3

[mlir][Linalg] Reimplement hoisting on tensors as a subset-based transformation · 4521b113

Nicolas Vasilache authored 2 years ago

This revision significantly rewrites hoisting on tensors.
Previously, `vector.transfer_read/write` and `tensor.extract/insert_slice` would
be clumped together when looking for candidate pairs.
This would significantly increase the complexity of the logic and would not apply
independently to `tensor.extract/insert_slice`.

The new implementation decouples the cases and starts to cast the problem
as a generic matching subset extract/insert, which will be future proof when
other such operation pairs are introduced.

Lastly, the implementation makes the distinction clear between `vector.transfer_read/write` for
which we allow bypasses of the disjoint subsets from `tensor.extract/insert_slice` for which we
do not yet allow it.

This can be extended in the future and unified once we have subset disjunction implemented more generally.

The algorithm can be rewritten to be less of a fixed point with interspersed canonicalizations.
As a consequence, the test explicitly adds a canonicalization to clean up the IR and verify we end up in the same state.

That extra canonicalization exhibited that one of the uses in one of the tests was dead, so we fix the appropriate test.

Differential Revision: https://reviews.llvm.org/D144656

4521b113

[mlir] Fix a -Wunused-variable warning, NFC · 779d54fd
Haojian Wu authored 2 years ago

779d54fd

[ConstExpr] Avoid creation of select constant expressions · 5d6dfba1

Nikita Popov authored 2 years ago

These expressions will now only be created if explicitly requested
in IR/bitcode (and by LowerTypeTests, which has a tricky to remove
use).

This is in preparation for removing these expressions entirely,
but also fixes #60983 in the meantime.

5d6dfba1

[MLIR] Add pass to deduplicate functions · b12bcf3f

Frederik Gossen authored 2 years ago

Deduplicate functions that are equivalent in all aspects but their symbol name.
The pass chooses one representative per equivalence class, erases the remainder, and updates function calls accordingly.

Differential Revision: https://reviews.llvm.org/D144738

b12bcf3f

[mlir] Port bazel for 115711c1 · 8877d8f5
Haojian Wu authored 2 years ago

8877d8f5