[LAYOUTS] Allow DistributedEncoding attributes to override get[Total]ElemsPerThread() #5980

chsigg · 2025-02-21T12:48:22Z

Change ODS to provide defaultImplementation, not methodBody. Telling from the comment, this was a accidental.

In the get[Total]ElemsPerThread() free functions, use the DistributedEncodingTrait if implemented.

This downstream projects (specifically, OpenXLA's SparseDotMetaEncoding) want to override these functions, but this was accidentally broken by 61b5674.

…ElemsPerThread() Change ODS to provide defaultImplementation, not methodBody. Telling from the comment, this was a accidental. In the get[Total]ElemsPerThread() free functions, use the DistributedEncodingTrait if implemented. This downstream projects (specifically, OpenXLA's [SparseDotMetaEncoding](https://github.com/openxla/xla/blob/6772834e77115e7368a418ef71024540274c93b2/xla/backends/gpu/codegen/triton/ir/triton_xla_attrs.td#L22)) want to override these functions, but this was accidentally broken by triton-lang@61b5674.

lezcano

One nit, otherwise LGTM

lezcano · 2025-02-21T14:31:43Z

lib/Dialect/TritonGPU/IR/Dialect.cpp

+  if (auto distLayout = mlir::dyn_cast<DistributedEncodingTrait>(layout)) {
+    return distLayout.getTotalElemsPerThread(shape);
+  }


You can even assert that this is indeed a DistributedEncoding as otherwise this function does not make any sense. Either that or directly change the input argument to have the right type.

something like:

unsigned getTotalElemsPerThread(RankedTensorType t) { auto layout = cast<DistributedEncodingTrait>(t.getEncoding()); return layout.getTotalElemsPerThread(t.getShape()); }

ThomasRaoux · 2025-02-21T16:10:44Z

lib/Dialect/TritonGPU/IR/Dialect.cpp

@@ -39,11 +39,17 @@ LinearEncodingAttr toLinearEncoding(Attribute layout, ArrayRef<int64_t> shape) {
 }

 unsigned getTotalElemsPerThread(Attribute layout, ArrayRef<int64_t> shape) {
+  if (auto distLayout = mlir::dyn_cast<DistributedEncodingTrait>(layout)) {
+    return distLayout.getTotalElemsPerThread(shape);


so it returns a different value that what the linear layout does? This means you have a distributed layout that is not a linear layout?

ThomasRaoux · 2025-02-21T16:11:10Z

lib/Dialect/TritonGPU/IR/Dialect.cpp

@@ -39,11 +39,17 @@ LinearEncodingAttr toLinearEncoding(Attribute layout, ArrayRef<int64_t> shape) {
 }

 unsigned getTotalElemsPerThread(Attribute layout, ArrayRef<int64_t> shape) {
+  if (auto distLayout = mlir::dyn_cast<DistributedEncodingTrait>(layout)) {
+    return distLayout.getTotalElemsPerThread(shape);
+  }
  return toLinearEncoding(layout, shape).getTotalElemsPerThread(shape);


it feels like this path is not reachable, can we remove it

Done, if you mean the one where layout is not a DistributedEncodingTrait.

ThomasRaoux · 2025-02-21T16:11:17Z

lib/Dialect/TritonGPU/IR/Dialect.cpp

  return toLinearEncoding(layout, shape).getTotalElemsPerThread(shape);
 }

 SmallVector<unsigned> getElemsPerThread(Attribute layout,
                                        ArrayRef<int64_t> shape) {
+  if (auto distLayout = mlir::dyn_cast<DistributedEncodingTrait>(layout)) {
+    return distLayout.getElemsPerThread(shape);
+  }
  return toLinearEncoding(layout, shape).getElemsPerThread(shape);


ThomasRaoux · 2025-02-21T17:24:08Z

@chsigg, could you clarify if your downstream layout is a linear layout or not?

lezcano · 2025-02-21T17:27:28Z

In particular, whether it's a linear layout such that all its bases are either a power of two or zero (i.e. a DistributedEncoding)

chsigg · 2025-02-21T19:30:35Z

It is, but the the number of elements is different. That's why we would like to overload this function.

ThomasRaoux · 2025-02-21T19:36:28Z

It is, but the the number of elements is different. That's why we would like to overload this function.

how can the layout be a linear layout but the number of elements be different than what is calculated by linear layout?

chsigg · 2025-02-21T20:25:42Z

Sorry, I'm not really able to answer your questions well. The sparse metadata converts to linear layout by converting its parent layout. Maybe this is not correct, I'm not really familiar with the code.

The llvm type converter used to go through the interface's getTotalNumElements(), and I'm not sure why it no longer does but the method is still there. Also, from the code it looks like the interface method is intended to be overridable, but it's not.

I really don't mind going either way with this change, I was just trying to fix a bug.

ThomasRaoux · 2025-02-21T20:31:23Z

Also, from the code it looks like the interface method is intended to be overridable, but it's not.

Yes it makes sense to have the interface overridable. Since we are moving all our layouts to be based on linear layout having layouts that don't follow the same property is likely to have other problems which is why I ask. If the layout is not completely representable by LinearLayout I expect so other things will break.

This patch itself is fine but I think supporting layouts that don't map to linear layout is going to be a challenge (if this is what you need)

chsigg · 2025-02-22T06:57:07Z

Thanks Thomas. I think the sparse metadata encoding should work fine with your plan to move to linear layouts everywhere. My relatively random attempts to fix toLinearLayout() for our attribute weren't successful yet, and there are too many other time sensitive tasks for me to look into it properly at the moment. I understand that we will need to do this soon.

chsigg requested a review from ptillet as a code owner February 21, 2025 12:48

chsigg requested review from lezcano and removed request for ptillet February 21, 2025 12:49

lezcano approved these changes Feb 21, 2025

View reviewed changes

ThomasRaoux reviewed Feb 21, 2025

View reviewed changes

Remove fallback path.

fbf0948

chsigg added 2 commits February 22, 2025 07:58

Merge branch 'main' into distributed_elems

297bd1d

Change to distributedEncoding.

0781ff7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LAYOUTS] Allow DistributedEncoding attributes to override get[Total]ElemsPerThread() #5980

[LAYOUTS] Allow DistributedEncoding attributes to override get[Total]ElemsPerThread() #5980

chsigg commented Feb 21, 2025

lezcano left a comment

lezcano Feb 21, 2025

lezcano Feb 21, 2025

chsigg Feb 21, 2025

ThomasRaoux Feb 21, 2025

ThomasRaoux Feb 21, 2025

chsigg Feb 21, 2025

ThomasRaoux Feb 21, 2025

ThomasRaoux commented Feb 21, 2025

lezcano commented Feb 21, 2025 •

edited

Loading

chsigg commented Feb 21, 2025

ThomasRaoux commented Feb 21, 2025

chsigg commented Feb 21, 2025

ThomasRaoux commented Feb 21, 2025

chsigg commented Feb 22, 2025

[LAYOUTS] Allow DistributedEncoding attributes to override get[Total]ElemsPerThread() #5980

Are you sure you want to change the base?

[LAYOUTS] Allow DistributedEncoding attributes to override get[Total]ElemsPerThread() #5980

Conversation

chsigg commented Feb 21, 2025

lezcano left a comment

Choose a reason for hiding this comment

lezcano Feb 21, 2025

Choose a reason for hiding this comment

lezcano Feb 21, 2025

Choose a reason for hiding this comment

chsigg Feb 21, 2025

Choose a reason for hiding this comment

ThomasRaoux Feb 21, 2025

Choose a reason for hiding this comment

ThomasRaoux Feb 21, 2025

Choose a reason for hiding this comment

chsigg Feb 21, 2025

Choose a reason for hiding this comment

ThomasRaoux Feb 21, 2025

Choose a reason for hiding this comment

ThomasRaoux commented Feb 21, 2025

lezcano commented Feb 21, 2025 • edited Loading

chsigg commented Feb 21, 2025

ThomasRaoux commented Feb 21, 2025

chsigg commented Feb 21, 2025

ThomasRaoux commented Feb 21, 2025

chsigg commented Feb 22, 2025

lezcano commented Feb 21, 2025 •

edited

Loading