Update CHANGELOG: Fix handling of duplicate terms in SparsePolynomial #915

woojucy · 2025-01-04T12:08:16Z

Description

In the current implementation of univariate::SparsePolynomial::from_coefficients_vec, duplicate terms with the same degree are not explicitly handled. As a result, if multiple terms with the same degree are provided, they remain as separate entries in the resulting sparse polynomial. This can lead to inconsistencies in polynomial operations and unnecessary redundancy.

Current Behavior:
Given an input like:

vec![
    (0, F::from(2)), 
    (0, F::from(1)), 
    (0, F::from(1)), 
    (3, F::from(5)),
    (3, F::from(3)),
    (4, F::from(0)),
]

The resulting polynomial contains duplicate terms without merging them.

Proposed Improvement:

Sort Terms by Degree: Ensure terms are sorted by degree.
Merge Duplicate Terms: For terms with the same degree, sum their coefficients into a single term.
Remove Zero Coefficient Terms: Eliminate terms where the coefficient becomes zero after merging.

Proposed Implementation Example:

fn from_coefficients_vec(mut coeffs: Vec<(usize, F)>) -> Self {
    // Sort terms by degree
    coeffs.sort_by(|(c1, _), (c2, _)| c1.cmp(c2));

    // Merge duplicate terms
    let mut merged_coeffs: Vec<(usize, F)> = Vec::new();
    for (degree, coeff) in coeffs {
        if let Some((last_degree, last_coeff)) = merged_coeffs.last_mut() {
            if *last_degree == degree {
                *last_coeff += coeff; // Combine coefficients for duplicate degrees
                continue;
            }
        }
        merged_coeffs.push((degree, coeff));
    }

    // Remove zero coefficient terms
    merged_coeffs.retain(|(_, coeff)| !coeff.is_zero());

    // Create the SparsePolynomial
    Self { coeffs: merged_coeffs }
}

Expected Behavior:
For the same input:

[(0, F::from(4)), (3, F::from(8))]

The resulting polynomial will now be consistent and free from redundant terms.

Why is this change beneficial?

Ensures clarity and consistency in sparse polynomial representations.
Prevents redundant terms, which may cause unexpected results in polynomial operations.
Aligns with expected mathematical representations of polynomials.

Before we can merge this PR, please make sure that all the following items have been
checked off. If any of the checklist items are not applicable, please leave them but
write a little note why.

Targeted PR against correct branch (master)
Linked to GitHub issue with discussion and accepted design OR have an explanation in the PR that describes this work.
Wrote unit tests
Updated relevant documentation in the code
Added a relevant changelog entry to the Pending section in CHANGELOG.md
Re-reviewed Files changed in the GitHub PR explorer

z-tech · 2025-01-04T16:39:24Z

Hi, thank you for raising this PR

Do we think this case of inputs with duplicate terms is occurring organically?
Have we considered instead that the function should reject when the vector is not already simplified?

At the least, approach 2 will force callers to become aware of situation 1, which seems to be the focus of the PR.

To do this safely, I think it would be more prudent to open a function call with a strongly defined "valid input" and then later deprecate this one so that callers can at their own pace address the potential unexpected results in polynomial operations mentioned.

What are thoughts?

woojucy · 2025-01-06T11:41:42Z

That's a good point. We need to examine how frequently duplicate terms naturally occur.

Additionally, we should also consider the issue of "consistency" in the function's behavior. For example, comparing the two polynomial cases below:

univariate::SparsePolynomial example:

let poly = univariate::SparsePolynomial::from_coefficients_vec(vec![
    (0, Fq::from(2)), 
    (0, Fq::from(1)), 
    (0, Fq::from(1)), 
    (3, Fq::from(5)),
    (3, Fq::from(3)),
    (4, Fq::from(0)),
]);

multivariate::SparsePolynomial example:

let poly2 = multivariate::SparsePolynomial::from_coefficients_vec(
    1,
    vec![
        (Fq::from(0), SparseTerm::new(vec![(0, 4)])),
        (Fq::from(3), SparseTerm::new(vec![(0, 3)])),
        (Fq::from(5), SparseTerm::new(vec![(0, 3)])),
        (Fq::from(1), SparseTerm::new(vec![])),
        (Fq::from(1), SparseTerm::new(vec![])),
        (Fq::from(2), SparseTerm::new(vec![])),
    ],
);

In the first case, it seems the function does not handle duplicate terms explicitly, while in the second case, the input is considered valid and behaves as expected. While we would expect both polynomials to behave consistently, the first function appears to lack clear input validation, resulting in inconsistency.

To address this issue, do you think introducing a new function call that enforces valid input, followed by a gradual deprecation of the current function, would be sufficient as you suggested?

I’d appreciate your thoughts on this.

Update CHANGELOG: Fix handling of duplicate terms in SparsePolynomial

a3fb3b4

woojucy requested review from a team as code owners January 4, 2025 12:08

woojucy requested review from z-tech, Pratyush and tyshko-rostyslav and removed request for a team January 4, 2025 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update CHANGELOG: Fix handling of duplicate terms in SparsePolynomial #915

Update CHANGELOG: Fix handling of duplicate terms in SparsePolynomial #915

woojucy commented Jan 4, 2025

z-tech commented Jan 4, 2025

woojucy commented Jan 6, 2025

Update CHANGELOG: Fix handling of duplicate terms in SparsePolynomial #915

Are you sure you want to change the base?

Update CHANGELOG: Fix handling of duplicate terms in SparsePolynomial #915

Conversation

woojucy commented Jan 4, 2025

Description

z-tech commented Jan 4, 2025

woojucy commented Jan 6, 2025