Optimize chebyshev monomial by Krishn1412 · Pull Request #2510 · google/heir

Krishn1412 · 2026-01-07T18:54:26Z

Fixes issue #2413.
Uses a cost heuristic based on MultiplicativeDepthVisitorImpl to estimate polynomial evaluation complexity.
Computes and compares the multiplicative depth of Chebyshev (PS) and monomial (Horner) evaluation DAGs.
Selects the evaluation strategy with lower multiplicative depth.

j2kun · 2026-01-08T18:05:27Z

lib/Transforms/LowerPolynomialEval/Patterns.cpp

 #include "lib/Utils/Polynomial/Horner.h"
 #include "lib/Utils/Polynomial/PatersonStockmeyer.h"
 #include "lib/Utils/Polynomial/Polynomial.h"
+#include "lib/Utils/Polynomial/PolynomialTestVisitors.h"


nit: since this file was intended for unit testing only, and now it's used outside of a unit test, the MultiplicativeDepthVisitor should be extracted to a standalone library.

j2kun · 2026-01-08T18:09:21Z

lib/Transforms/LowerPolynomialEval/Patterns.cpp

+  double chebDepth = depthVisitor.process(chebDag);
+  double monoDepth = depthVisitor.process(monoDag);
+
+  bool useMonomial = chebDepth > monoDepth;


This code is happening inside of a function that suggests the Paterson Stockmeyer Chebyshev method will be used without qualification.

In addition, if the pass option specifies that "pscheb" method must be used, I don't think the pass should second-guess it and use a different method, even if it's more efficient.

So what we should instead do here is:

Extract the construction of the DAG and the analysis into functions that are outside of the RewritePattern implementation.

In LowerPolynomialEval where the method is "auto", construct the dags, do the analysis on the depth, and then use that to determine which rewrite pattern to apply.

As I understand from the code, LowerPolynomialEval only registers the rewrite patterns, the actual rewriting happens later when MLIR applies them. At registration time we don’t have access to the polynomials, so any analysis can’t be done there? Can we instead do the analysis inside matchAndRewrite, where the polynomial.eval op is available. In automatic mode, a pattern can return failure() if it decides it’s too expensive or unstable, allowing MLIR to try the other patterns. Apologies for the delay!

Anything inside runOnOperation has access to the entire IR, but yes, you can move the code that actually mutates the IR into helper functions and then do the analysis inside a smaller number of patterns.

j2kun · 2026-01-08T18:09:55Z

tests/Transforms/lower_polynomial_eval/chebyshev_to_monomial.mlir

+
+module {
+  func.func @chebyshev(%ct: f32) -> f32 {
+    %ct_0 = polynomial.eval #poly, %ct {coefficients = [0.0, 0.75, 0.0, 0.25], domain_lower = -1.000000e+00 : f64, domain_upper = 1.000000e+00 : f64} : f32


This file needs at least one // CHECK: ... statement to assert the output is correct.

j2kun · 2026-01-08T18:29:01Z

lib/Transforms/LowerPolynomialEval/Patterns.cpp

+  double chebDepth = depthVisitor.process(chebDag);
+  double monoDepth = depthVisitor.process(monoDag);
+
+  bool useMonomial = chebDepth > monoDepth;


The second problem listed in #2413 which is not covered by this PR is numerical stability. The reason we use the Chebyshev basis instead of the monomial basis is that, for larger-degree polynomials, the monomial basis coefficients of high-degree terms will necessarily grow to be quite small (e.g., 1e-15) but cannot be ignored because of their large influence on the evaluated result, while Chebyshev basis coefficients remain relatively well normalized and small magnitude coefficients can be dropped without influencing the output.

So the other check that needs to occur to allow the monomial lowering is: will the monomial representation be unstable? While it may depend on which FHE scheme is being used and what precision is supported in that scheme, a good place to start would be to compute the condition number of polynomial evaluation for monomials, in https://epubs.siam.org/doi/10.1137/1.9780898718027.ch5 (Higham's Accuracy and Stability of Numerical Algorithms). However, that requires knowing the right value of x in advance, which may not be true in this case. We should have access to lower and upper bounds on the domain, so maybe sampling a few values in the domain would suffice.

Krishn1412 added 4 commits January 1, 2026 14:16

First draft

7f4541f

changes

ce67d91

Changes

a3a2836

Update Patterns.cpp

a7084b0

j2kun requested changes Jan 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize chebyshev monomial#2510

Optimize chebyshev monomial#2510
Krishn1412 wants to merge 4 commits intogoogle:mainfrom
Krishn1412:optimize_chebyshev_monomial

Krishn1412 commented Jan 7, 2026

Uh oh!

j2kun Jan 8, 2026

Uh oh!

j2kun Jan 8, 2026

Uh oh!

Krishn1412 Feb 5, 2026

Uh oh!

j2kun Feb 23, 2026

Uh oh!

j2kun Jan 8, 2026

Uh oh!

j2kun Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Krishn1412 commented Jan 7, 2026

Uh oh!

j2kun Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

j2kun Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Krishn1412 Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

j2kun Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

j2kun Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

j2kun Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants