[Bug]: Pyspark reduction does not preserve names root names with `nw.all()` #1780

FBruzzesi · 2025-01-10T10:47:09Z

Pyspark reductions do not preserve original root name when applied to nw.all().

I am tempted to say that the issue is in this line. However I could not manage to fix it without breaking the code elsewhere so far 🙈

Running:

df.select(nw.all().n_unique())

with a lazyframe backed by pyspark DataFrame, will result in a column named (count(DISTINCT a) + max(CAST((a IS NULL) AS INT))).

Original names to be preserved

Pyspark expression names

Latest

No response

The text was updated successfully, but these errors were encountered:

camriddell linked a pull request Jan 10, 2025 that will close this issue

fix root names in pyspark reduction with nw.all() #1787

Open

10 tasks

Provide feedback