Skip to content

Commit af9814a

Browse files
committed
[GlobalISel] Enable legalizing non-power-of-2 sized types.
This changes the interface of how targets describe how to legalize, see the below description. 1. Interface for targets to describe how to legalize. In GlobalISel, the API in the LegalizerInfo class is the main interface for targets to specify which types are legal for which operations, and what to do to turn illegal type/operation combinations into legal ones. For each operation the type sizes that can be legalized without having to change the size of the type are specified with a call to setAction. This isn't different to how GlobalISel worked before. For example, for a target that supports 32 and 64 bit adds natively: for (auto Ty : {s32, s64}) setAction({G_ADD, 0, s32}, Legal); or for a target that needs a library call for a 32 bit division: setAction({G_SDIV, s32}, Libcall); The main conceptual change to the LegalizerInfo API, is in specifying how to legalize the type sizes for which a change of size is needed. For example, in the above example, how to specify how all types from i1 to i8388607 (apart from s32 and s64 which are legal) need to be legalized and expressed in terms of operations on the available legal sizes (again, i32 and i64 in this case). Before, the implementation only allowed specifying power-of-2-sized types (e.g. setAction({G_ADD, 0, s128}, NarrowScalar). A worse limitation was that if you'd wanted to specify how to legalize all the sized types as allowed by the LLVM-IR LangRef, i1 to i8388607, you'd have to call setAction 8388607-3 times and probably would need a lot of memory to store all of these specifications. Instead, the legalization actions that need to change the size of the type are specified now using a "SizeChangeStrategy". For example: setLegalizeScalarToDifferentSizeStrategy( G_ADD, 0, widenToLargerAndNarrowToLargest); This example indicates that for type sizes for which there is a larger size that can be legalized towards, do it by Widening the size. For example, G_ADD on s17 will be legalized by first doing WidenScalar to make it s32, after which it's legal. The "NarrowToLargest" indicates what to do if there is no larger size that can be legalized towards. E.g. G_ADD on s92 will be legalized by doing NarrowScalar to s64. Another example, taken from the ARM backend is: for (unsigned Op : {G_SDIV, G_UDIV}) { setLegalizeScalarToDifferentSizeStrategy(Op, 0, widenToLargerTypesUnsupportedOtherwise); if (ST.hasDivideInARMMode()) setAction({Op, s32}, Legal); else setAction({Op, s32}, Libcall); } For this example, G_SDIV on s8, on a target without a divide instruction, would be legalized by first doing action (WidenScalar, s32), followed by (Libcall, s32). The same principle is also followed for when the number of vector lanes on vector data types need to be changed, e.g.: setAction({G_ADD, LLT::vector(8, 8)}, LegalizerInfo::Legal); setAction({G_ADD, LLT::vector(16, 8)}, LegalizerInfo::Legal); setAction({G_ADD, LLT::vector(4, 16)}, LegalizerInfo::Legal); setAction({G_ADD, LLT::vector(8, 16)}, LegalizerInfo::Legal); setAction({G_ADD, LLT::vector(2, 32)}, LegalizerInfo::Legal); setAction({G_ADD, LLT::vector(4, 32)}, LegalizerInfo::Legal); setLegalizeVectorElementToDifferentSizeStrategy( G_ADD, 0, widenToLargerTypesUnsupportedOtherwise); As currently implemented here, vector types are legalized by first making the vector element size legal, followed by then making the number of lanes legal. The strategy to follow in the first step is set by a call to setLegalizeVectorElementToDifferentSizeStrategy, see example above. The strategy followed in the second step "moreToWiderTypesAndLessToWidest" (see code for its definition), indicating that vectors are widened to more elements so they map to natively supported vector widths, or when there isn't a legal wider vector, split the vector to map it to the widest vector supported. Therefore, for the above specification, some example legalizations are: * getAction({G_ADD, LLT::vector(3, 3)}) returns {WidenScalar, LLT::vector(3, 8)} * getAction({G_ADD, LLT::vector(3, 8)}) then returns {MoreElements, LLT::vector(8, 8)} * getAction({G_ADD, LLT::vector(20, 8)}) returns {FewerElements, LLT::vector(16, 8)} 2. Key implementation aspects. How to legalize a specific (operation, type index, size) tuple is represented by mapping intervals of integers representing a range of size types to an action to take, e.g.: setScalarAction({G_ADD, LLT:scalar(1)}, {{1, WidenScalar}, // bit sizes [ 1, 31[ {32, Legal}, // bit sizes [32, 33[ {33, WidenScalar}, // bit sizes [33, 64[ {64, Legal}, // bit sizes [64, 65[ {65, NarrowScalar} // bit sizes [65, +inf[ }); Please note that most of the code to do the actual lowering of non-power-of-2 sized types is currently missing, this is just trying to make it possible for targets to specify what is legal, and how non-legal types should be legalized. Probably quite a bit of further work is needed in the actual legalizing and the other passes in GlobalISel to support non-power-of-2 sized types. I hope the documentation in LegalizerInfo.h and the examples provided in the various {Target}LegalizerInfo.cpp and LegalizerInfoTest.cpp explains well enough how this is meant to be used. This drops the need for LLT::{half,double}...Size(). Differential Revision: https://reviews.llvm.org/D30529 llvm-svn: 317560
1 parent 2375c92 commit af9814a

File tree

14 files changed

+1140
-435
lines changed

14 files changed

+1140
-435
lines changed

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h

+302-75
Large diffs are not rendered by default.

llvm/include/llvm/Support/LowLevelTypeImpl.h

-45
Original file line numberDiff line numberDiff line change
@@ -137,51 +137,6 @@ class LLT {
137137
return scalar(getScalarSizeInBits());
138138
}
139139

140-
/// Get a low-level type with half the size of the original, by halving the
141-
/// size of the scalar type involved. For example `s32` will become `s16`,
142-
/// `<2 x s32>` will become `<2 x s16>`.
143-
LLT halfScalarSize() const {
144-
assert(!IsPointer && getScalarSizeInBits() > 1 &&
145-
getScalarSizeInBits() % 2 == 0 && "cannot half size of this type");
146-
return LLT{/*isPointer=*/false, IsVector ? true : false,
147-
IsVector ? getNumElements() : (uint16_t)0,
148-
getScalarSizeInBits() / 2, /*AddressSpace=*/0};
149-
}
150-
151-
/// Get a low-level type with twice the size of the original, by doubling the
152-
/// size of the scalar type involved. For example `s32` will become `s64`,
153-
/// `<2 x s32>` will become `<2 x s64>`.
154-
LLT doubleScalarSize() const {
155-
assert(!IsPointer && "cannot change size of this type");
156-
return LLT{/*isPointer=*/false, IsVector ? true : false,
157-
IsVector ? getNumElements() : (uint16_t)0,
158-
getScalarSizeInBits() * 2, /*AddressSpace=*/0};
159-
}
160-
161-
/// Get a low-level type with half the size of the original, by halving the
162-
/// number of vector elements of the scalar type involved. The source must be
163-
/// a vector type with an even number of elements. For example `<4 x s32>`
164-
/// will become `<2 x s32>`, `<2 x s32>` will become `s32`.
165-
LLT halfElements() const {
166-
assert(isVector() && getNumElements() % 2 == 0 && "cannot half odd vector");
167-
if (getNumElements() == 2)
168-
return scalar(getScalarSizeInBits());
169-
170-
return LLT{/*isPointer=*/false, /*isVector=*/true,
171-
(uint16_t)(getNumElements() / 2), getScalarSizeInBits(),
172-
/*AddressSpace=*/0};
173-
}
174-
175-
/// Get a low-level type with twice the size of the original, by doubling the
176-
/// number of vector elements of the scalar type involved. The source must be
177-
/// a vector type. For example `<2 x s32>` will become `<4 x s32>`. Doubling
178-
/// the number of elements in sN produces <2 x sN>.
179-
LLT doubleElements() const {
180-
return LLT{IsPointer ? true : false, /*isVector=*/true,
181-
(uint16_t)(getNumElements() * 2), getScalarSizeInBits(),
182-
IsPointer ? getAddressSpace() : 0};
183-
}
184-
185140
void print(raw_ostream &OS) const;
186141

187142
bool operator==(const LLT &RHS) const {

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

+50-24
Original file line numberDiff line numberDiff line change
@@ -173,12 +173,18 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
173173

174174
MIRBuilder.setInstr(MI);
175175

176+
int64_t SizeOp0 = MRI.getType(MI.getOperand(0).getReg()).getSizeInBits();
177+
int64_t NarrowSize = NarrowTy.getSizeInBits();
178+
176179
switch (MI.getOpcode()) {
177180
default:
178181
return UnableToLegalize;
179182
case TargetOpcode::G_IMPLICIT_DEF: {
180-
int NumParts = MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() /
181-
NarrowTy.getSizeInBits();
183+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
184+
// NarrowSize.
185+
if (SizeOp0 % NarrowSize != 0)
186+
return UnableToLegalize;
187+
int NumParts = SizeOp0 / NarrowSize;
182188

183189
SmallVector<unsigned, 2> DstRegs;
184190
for (int i = 0; i < NumParts; ++i) {
@@ -191,9 +197,12 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
191197
return Legalized;
192198
}
193199
case TargetOpcode::G_ADD: {
200+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
201+
// NarrowSize.
202+
if (SizeOp0 % NarrowSize != 0)
203+
return UnableToLegalize;
194204
// Expand in terms of carry-setting/consuming G_ADDE instructions.
195-
int NumParts = MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() /
196-
NarrowTy.getSizeInBits();
205+
int NumParts = SizeOp0 / NarrowTy.getSizeInBits();
197206

198207
SmallVector<unsigned, 2> Src1Regs, Src2Regs, DstRegs;
199208
extractParts(MI.getOperand(1).getReg(), NarrowTy, NumParts, Src1Regs);
@@ -221,9 +230,12 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
221230
if (TypeIdx != 1)
222231
return UnableToLegalize;
223232

224-
int64_t NarrowSize = NarrowTy.getSizeInBits();
225-
int NumParts =
226-
MRI.getType(MI.getOperand(1).getReg()).getSizeInBits() / NarrowSize;
233+
int64_t SizeOp1 = MRI.getType(MI.getOperand(1).getReg()).getSizeInBits();
234+
// FIXME: add support for when SizeOp1 isn't an exact multiple of
235+
// NarrowSize.
236+
if (SizeOp1 % NarrowSize != 0)
237+
return UnableToLegalize;
238+
int NumParts = SizeOp1 / NarrowSize;
227239

228240
SmallVector<unsigned, 2> SrcRegs, DstRegs;
229241
SmallVector<uint64_t, 2> Indexes;
@@ -270,12 +282,12 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
270282
return Legalized;
271283
}
272284
case TargetOpcode::G_INSERT: {
273-
if (TypeIdx != 0)
285+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
286+
// NarrowSize.
287+
if (SizeOp0 % NarrowSize != 0)
274288
return UnableToLegalize;
275289

276-
int64_t NarrowSize = NarrowTy.getSizeInBits();
277-
int NumParts =
278-
MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() / NarrowSize;
290+
int NumParts = SizeOp0 / NarrowSize;
279291

280292
SmallVector<unsigned, 2> SrcRegs, DstRegs;
281293
SmallVector<uint64_t, 2> Indexes;
@@ -330,9 +342,11 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
330342
return Legalized;
331343
}
332344
case TargetOpcode::G_LOAD: {
333-
unsigned NarrowSize = NarrowTy.getSizeInBits();
334-
int NumParts =
335-
MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() / NarrowSize;
345+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
346+
// NarrowSize.
347+
if (SizeOp0 % NarrowSize != 0)
348+
return UnableToLegalize;
349+
int NumParts = SizeOp0 / NarrowSize;
336350
LLT OffsetTy = LLT::scalar(
337351
MRI.getType(MI.getOperand(1).getReg()).getScalarSizeInBits());
338352

@@ -357,9 +371,11 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
357371
return Legalized;
358372
}
359373
case TargetOpcode::G_STORE: {
360-
unsigned NarrowSize = NarrowTy.getSizeInBits();
361-
int NumParts =
362-
MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() / NarrowSize;
374+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
375+
// NarrowSize.
376+
if (SizeOp0 % NarrowSize != 0)
377+
return UnableToLegalize;
378+
int NumParts = SizeOp0 / NarrowSize;
363379
LLT OffsetTy = LLT::scalar(
364380
MRI.getType(MI.getOperand(1).getReg()).getScalarSizeInBits());
365381

@@ -381,9 +397,11 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
381397
return Legalized;
382398
}
383399
case TargetOpcode::G_CONSTANT: {
384-
unsigned NarrowSize = NarrowTy.getSizeInBits();
385-
int NumParts =
386-
MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() / NarrowSize;
400+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
401+
// NarrowSize.
402+
if (SizeOp0 % NarrowSize != 0)
403+
return UnableToLegalize;
404+
int NumParts = SizeOp0 / NarrowSize;
387405
const APInt &Cst = MI.getOperand(1).getCImm()->getValue();
388406
LLVMContext &Ctx = MIRBuilder.getMF().getFunction()->getContext();
389407

@@ -410,9 +428,12 @@ LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
410428
// ...
411429
// AN = BinOp<Ty/N> BN, CN
412430
// A = G_MERGE_VALUES A1, ..., AN
413-
unsigned NarrowSize = NarrowTy.getSizeInBits();
414-
int NumParts =
415-
MRI.getType(MI.getOperand(0).getReg()).getSizeInBits() / NarrowSize;
431+
432+
// FIXME: add support for when SizeOp0 isn't an exact multiple of
433+
// NarrowSize.
434+
if (SizeOp0 % NarrowSize != 0)
435+
return UnableToLegalize;
436+
int NumParts = SizeOp0 / NarrowSize;
416437

417438
// List the registers where the destination will be scattered.
418439
SmallVector<unsigned, 2> DstRegs;
@@ -854,7 +875,12 @@ LegalizerHelper::fewerElementsVector(MachineInstr &MI, unsigned TypeIdx,
854875
case TargetOpcode::G_ADD: {
855876
unsigned NarrowSize = NarrowTy.getSizeInBits();
856877
unsigned DstReg = MI.getOperand(0).getReg();
857-
int NumParts = MRI.getType(DstReg).getSizeInBits() / NarrowSize;
878+
unsigned Size = MRI.getType(DstReg).getSizeInBits();
879+
int NumParts = Size / NarrowSize;
880+
// FIXME: Don't know how to handle the situation where the small vectors
881+
// aren't all the same size yet.
882+
if (Size % NarrowSize != 0)
883+
return UnableToLegalize;
858884

859885
MIRBuilder.setInstr(MI);
860886

0 commit comments

Comments
 (0)