SPRKPY1016
pyspark.sql.functions.collect_set has a workaround
Last updated
pyspark.sql.functions.collect_set has a workaround
Last updated
This issue code has been deprecated since
Message: pyspark.sql.functions.collect_set has a workaround
Category: Warning.
This issue appears when the tool detects the usage of which has a workaround.
Input
Using collect_set to get the elements of colname without duplicates:
Output
SMA returns the EWI SPRKPY1016 over the line where collect_set is used, so you can use to identify where to fix.
Recommended fix
Use function array_agg, and add a second argument with the value True.
For more support, you can email us at or post an issue .