Commits · c417b7a695704d5bc3be23f34d1bfa505f5172de · educg-net-26154-2315672 / llvm_project-2529

27 Feb, 2023 40 commits

[OHOS] Add support for OpenHarmony · c417b7a6

Pavel Kosov authored 2 years ago

Add support for OpenHarmony OS

General OpenHarmony OS discussion on discourse thread "[RFC] Add support for OpenHarmony OS"
https://discourse.llvm.org/t/rfc-add-support-for-openharmony-os/66656

Reviewed By: DavidSpickett

Differential Revision: https://reviews.llvm.org/D138202

c417b7a6

[SME2][AArch64] Add multi-indexed multiply-add long long intrinsics · a9df6270

Kerry McLaughlin authored 2 years ago

Adds intrinsics for the following SME2 instructions (1, 2 & 4 vector):
 - smlall
 - umlall
 - smlsll
 - umlsll
 - sumlall
 - usmlall

NOTE: These intrinsics are still in development and are subject to future changes.

Reviewed By: david-arm

Differential Revision: https://reviews.llvm.org/D143278

a9df6270

[GlobalOpt] Ignore only loaded / only stored global parts in global SRA heuristic · 49aa3777

Nikita Popov authored 2 years ago

When limiting the number of parts we split a global into, ignore
any parts that are either only loaded or only stored, because we
expect these to be optimized away after SRA.

Differential Revision: https://reviews.llvm.org/D129857

49aa3777

[libc++][ranges] Implement LWG-3860 range_common_reference_t is missing · a8ead919
Igor Zhukov authored 2 years ago

a8ead919

[mlir] Use the same name as the generated parameter name (NFC). · 01b9d355

Adrian Kuegel authored 2 years ago

When commenting for which parameter a value is passed, the same name
should be used as is used for the real parameter. In this case, the
parameter name is generated from the TransformOps.td file.

01b9d355

[flang][OpenMP] Handle lastprivate on sections construct · f49b6afc

Nimish Mishra authored 2 years ago

This patch adds support for lastprivate on sections construct.
One omp.sections operation can have several omp.section operation. As such, the privatization happens in the lexically last omp.section operation.

Reviewed By: kiranchandramohan, peixin

Differential Revision: https://reviews.llvm.org/D133686

f49b6afc

[Flang] Add Minloc to simplify intrinsics pass · 614cd721

Sacha Ballantyne authored 2 years ago

This patch adds minloc to the simplify intrinsics pass, supporting calls with KIND or MASK arguments while calls which have BACK, DIM or have a CHARACTER input array are rejected. This patch is targeting exchange2, and in benchmarks provides a ~11% improvement in performance.

Also included are some minor style changes / cleanup in simplifyIntrinsics.cpp.

Reviewed By: vzakhari

Differential Revision: https://reviews.llvm.org/D144103

614cd721

[LoopPredication] Account for critical edges when inserting assumes. PR26496 · a18ce47a

Max Kazantsev authored 2 years ago

Loop predication can insert assumes to preserve knowledge about some facts that
may otherwise be lost, because loop predication is a lossy transform. When a guard
is represented as branch by widenable condition, it should insert it in the guarded
block. However, if the guarded block has other predecessors than the guard block,
then the condition might not dominate it. Currently we generate invalid code here.

One possible fix here is to split critical edge and insert the assume there, but in
this case we should modify CFG, which Loop Predication is not currently doing, and we
want to keep it that way.

The fix is to handle this case by inserting a Phi which takes `Cond` as input from the
guard block and `true` from any other blocks. This is valid in terms of IR and does
not introduce any new knowledge if we came from another block.

Differential Revision: https://reviews.llvm.org/D144859
Reviewed By: nikic, skatkov

a18ce47a

Reapply [InstCombine] Remove early constant fold · ee2f9d6d

Nikita Popov authored 2 years ago

The reported compile-time regression has been address in
47f9109d.

Additionally, this contains a change to immediately fold zext
with constant operand, even if it's used in a trunc. I'm not sure
if this is relevant for anything, but I noticed it as a behavioral
discrepancy when investigating this issue.

-----

InstCombine currently performs a constant folding attempt as part
of the main InstCombine loop, before visiting the instruction.
However, each visit method will also attempt to simplify the
instruction, which will in turn constant fold it. (Additionally,
we also constant fold instructions before the main InstCombine loop
and use a constant folding IR builder, so this is doubly redundant.)

There is one place where InstCombine visit methods currently don't
call into simplification, and that's casts. To be conservative,
I've added an explicit constant folding call there (though it has
no impact on tests).

This makes for a mild compile-time improvement and in particular
mitigates the compile-time regression from enabling load
simplification in be88b581.

Differential Revision: https://reviews.llvm.org/D144369

ee2f9d6d

[SelectionDAG] Transitively copy NodeExtraInfo on RAUW · 7f635b90

Marco Elver authored 2 years ago

During legalization of the SelectionDAG, some nodes are replaced with
arch-specific nodes. These may be complex nodes, where the root node no
longer corresponds to the node that should carry the extra info.

Fix the issue by copying extra info to the new node and all its new
transitive operands during RAUW. See code comments for more details.

This fixes the remaining pcsections-atomics.ll tests on X86.

Reviewed By: dvyukov

Differential Revision: https://reviews.llvm.org/D144677

7f635b90

[X86][FixupBWInsts] Fix propagation of !pcsections metadata · d73da868

Marco Elver authored 2 years ago

Use MIMetadata() to propagate both DebugLoc and !pcsections metadata.

This fixes several of the non-native sized !pcsections tests in
pcsections-atomics.ll.

Reviewed By: dvyukov

Differential Revision: https://reviews.llvm.org/D144676

d73da868

[X86] Improve atomics test for !pcsections · a5653b82

Marco Elver authored 2 years ago

Extend pcsections-atomics.ll to exhaustively test all atomic ops up to
64 bits. This currently shows that some atomic operations do not end up
in PC sections. This will be addressed in a subsequent change.

Differential Revision: https://reviews.llvm.org/D144710

a5653b82

[X86] Move atomics test for !pcsections into separate file · ba63ddd5

Marco Elver authored 2 years ago

The pcsections.ll test primarily tests that the AsmPrinter produces the
right output in sections. This output is not easily covered by
update_llc_test_checks.py, and as such is hand written. This makes
maintenance rather burdensome. Instead, let's keep pcsections.ll as
simple as possible.

Move the more complex tests that primarily test that some atomic
operations end up in the PC section to pcsections-atomics.ll.

NFC.

Reviewed By: dvyukov, vitalybuka

Differential Revision: https://reviews.llvm.org/D144675

ba63ddd5

[InstCombine] Guard against many users when swapping icmp operands · 47f9109d

Nikita Popov authored 2 years ago

This addresses the compile-time regression reported on D144369.
If we don't fold constant operands early, then we might end up
walking very large use lists of constants here. Explicitly exclude
constants, and also limit the number of inspected users to avoid
degenerate cases like this.

This entire transform shouldn't be part of InstCombine in the
first place though.

47f9109d

[clang-format] Fix assertion that doesn't hold under fuzzing. · 398cddf6
Manuel Klimek authored 2 years ago

398cddf6

[SVE] Add intrinsics for uniform dsp operations that explicitly undefine the... · ec67d703

chendewen authored 2 years ago

[SVE] Add intrinsics for uniform dsp operations that explicitly undefine the result for inactive lanes.

This patch adds new intrinsics for uniform dsp operations and changes the lowering for the following builtins to emit calls to the new aarch64.sve.###.u intrinsics.
  svsqsub_x
  svsqsub_n_x
  svuqsub_x
  svuqsub_n_x
  svsqsubr_x
  svsqsubr_n_x
  svuqsubr_x
  svuqsubr_n_x

Reviewed By: Paul Walker
Differential Revision: https://reviews.llvm.org/D144704

ec67d703

[clang-format] Add macro replacement to fuzzing. · f600a5ae
Manuel Klimek authored 2 years ago

f600a5ae
[bazel] Port Bazel for e7950fce · 0264ca43
Haojian Wu authored 2 years ago

0264ca43

Allow building with CMAKE_SYSTEM_NAME=Generic · 1422f1bf

Michael Platings authored 2 years ago

This is important for building runtimes for bare metal targets.

Differential Revision: https://reviews.llvm.org/D144757

1422f1bf

[Test] Add failing test for PR61022 · 3bfb2357
Max Kazantsev authored 2 years ago
```
Details: https://github.com/llvm/llvm-project/issues/61022
```
3bfb2357
Revert "[GVN] Support address translation through select instructions" · 1aece0e5
Sergey Kachkov authored 2 years ago
```
This reverts commit b5bf6f63.
```
1aece0e5

[SCEV] Make scalable size representation more explicit · 0805d9d5

Nikita Popov authored 2 years ago

Represent scalable type sizes using C * vscale, where vscale is
the vscale constant expression. This exposes a bit more information
to SCEV, because the vscale multiplier is explicitly modeled in SCEV
(rather than part of the sizeof expression).

This is mainly intended as an alternative to D143642.

Differential Revision: https://reviews.llvm.org/D144624

0805d9d5

[clang-format] clang-format.el: fix warnings · 95c3c2b8
Augustin Fabre authored 2 years ago
```
Differential Revision: https://reviews.llvm.org/D143560
```
95c3c2b8
[gn build] Port e7950fce · f4a78830
LLVM GN Syncbot authored 2 years ago

f4a78830

[llvm-debuginfo-analyzer] (09/09) - CodeView Reader · e7950fce

Carlos Alberto Enciso authored 2 years ago

llvm-debuginfo-analyzer is a command line tool that processes debug
info contained in a binary file and produces a debug information
format agnostic “Logical View”, which is a high-level semantic
representation of the debug info, independent of the low-level
format.

The code has been divided into the following patches:

1) Interval tree
2) Driver and documentation
3) Logical elements
4) Locations and ranges
5) Select elements
6) Warning and internal options
7) Compare elements
8) ELF Reader
9) CodeView Reader

Full details:
https://discourse.llvm.org/t/llvm-dev-rfc-llvm-dva-debug-information-visual-analyzer/62570

This patch:

This is a high level summary of the changes in this patch.

CodeView Reader
- Support for CodeView/PDB.
  LVCodeViewReader, LVTypeVisitor, LVSymbolVisitor, LVLogicalVisitor

Reviewed By: psamolysov, probinson, djtodoro, zequanwu

Differential Revision: https://reviews.llvm.org/D125784

e7950fce

[include-cleaner] Fix an unintended early return when checking the · b6f48341
Haojian Wu authored 2 years ago
```
incompatible flags in the CLI tool.
```
b6f48341

[GVN] Support address translation through select instructions · b5bf6f63

Sergey Kachkov authored 2 years ago

Process cases when phi incoming in predecessor block has select
instruction, and this select address is unavailable, but there
are addresses translated from both sides of select instruction.

Differential Revision: https://reviews.llvm.org/D142705

b5bf6f63

[mlir][llvm] Stop exporting empty debug MD strings · d9391a37

Christian Ulmann authored 2 years ago

This commit ensures that no empty debug metadata strings are exported as
these are not legal names. Additionally, this commit ensures that
non-existing strings are not accidentially imported as empty strings.

Reviewed By: gysit

Differential Revision: https://reviews.llvm.org/D144263

d9391a37

[AMDGPU] Run update scripts on existing tests. NFC · 44f1cb04

Diana Picus authored 2 years ago

Update a few tests where the checks aren't exactly kosher.

Differential Revision: https://reviews.llvm.org/D144639

44f1cb04

[mlir][llvm] Builders dont access null attr (NFC) · ddd1d1c5
Christian Ulmann authored 2 years ago
```
Reviewed By: gysit

Differential Revision: https://reviews.llvm.org/D144267
```
ddd1d1c5
[mlir][SCF] Fix incorrect API usage in RewritePatterns · 61f37758
Matthias Springer authored 2 years ago
```
Incorrect API usage was detected by D144552.

Differential Revision: https://reviews.llvm.org/D144636
```
61f37758

[flang][hlfir] Lower associate construct to HLFIR · e5921ef0

Jean Perier authored 2 years ago

- always use genExprAddr when lowering to HLFIR: it does not create
  temporary for array sections without vector subscripts, so there is
  no need to have custom logic.

- update mangling to deal with AssocDetailsEntity. Their name is
  required in HLFIR so that it can be added to the hlfir.declare
  that is created for the selector once it is lowered. This should
  allow getting debug info for selector when debug info are generated
  from hlfir.declare.

The rest of associate construct lowering is unchanged and shared with
the current lowering.

This patch also enables select type lowering to work properly, but some
other todos (mainly about parent component references) prevents porting
the tests for now, so this will be done later.

Differential Revision: https://reviews.llvm.org/D144740

e5921ef0

[flang][hlfir] Lower allocatable assignment to HLFIR · 713b3ad4

Jean Perier authored 2 years ago

Nothing much to do except set the right attributes on hlfir.assign.

Differential Revision: https://reviews.llvm.org/D144727

713b3ad4

[flang][hlfir] add allocatable assignment semantic to hlfir.assign · 275c272c
Jean Perier authored 2 years ago
```
Differential Revision: https://reviews.llvm.org/D144723
```
275c272c

[AArch64] Added tests for inserting scalar result of uaddlv neon instrinsic function into a vector · f3b8aef2

Nilanjana Basu authored 2 years ago

Inserting scalar result of 'uaddlv' neon intrinsic function to a destination vector currently makes use of the integer unit. Subsequent patches will eliminate the redundant use of the integer registers in a more generic way that will include this special case. This is an initial set of tests for this functionality.

Differential Revision: https://reviews.llvm.org/D143038

f3b8aef2

[mlir][spirv] Fix Physical32/Physical64 support for OpenCL · 85365b16

Lei Zhang authored 2 years ago

We use `use64bitIndex` in the option to decide the target device
address bitwidth. This makes it consistent with index type
conversion too.

Reviewed By: kuhar

Differential Revision: https://reviews.llvm.org/D144827

85365b16

[mlir][spirv] Respect client API requirements for 64-bit index · 9a4c768a

Lei Zhang authored 2 years ago

Vulkan requires GPU processor ID/count builtin variables to be
32-bit scalar or vector for all the cases. Similarly there
are special requirements for OpenCL. We need to make sure those
rules are respected when converting using 64bit for index.

Reviewed By: kuhar

Differential Revision: https://reviews.llvm.org/D144819

9a4c768a

[mlir][python] Don't emit diagnostics when printing invalid ops · 2aa12583

Rahul Kayaith authored 2 years ago

The asm printer grew the ability to automatically fall back to the
generic format for invalid ops, so this logic doesn't need to be in the
bindings anymore. The printer already handles supressing diagnostics
that get emitted while checking if the op is valid.

Reviewed By: mehdi_amini, stellaraccident

Differential Revision: https://reviews.llvm.org/D144805

2aa12583

Precommit test for D144777, NFC · cf491a16
Jun Zhang authored 2 years ago
```
Signed-off-by: Jun Zhang <jun@junz.org>
```
cf491a16

[Clang] Copy strictfp attribute from pattern to instantiation · 5cc91f97

Serge Pavlov authored 2 years ago

If a template function contained a pragma that made it strictfp, code
generation for such function crashed, because the instantiation did not
have strictfp attribute. As a solution this attribute is copied from the
template to instantiation.

Differential Revision: https://reviews.llvm.org/D143919

5cc91f97