SPRKSCL1155
org.apache.spark.sql.functions.countDistinct
This issue code has been deprecated since Spark Conversion Core Version 4.3.2
Message: org.apache.spark.sql.functions.countDistinct has a workaround, see documentation for more info
Category: Warning
Description
This issue appears when the SMA detects a use of the org.apache.spark.sql.functions.countDistinct function, which has a workaround.
Scenario
Input
Below is an example of the org.apache.spark.sql.functions.countDistinct
function, first used with column names as arguments and then with column objects.
Output
The SMA adds the EWI SPRKSCL1155
to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.
Recommended fix
As a workaround, you can use the count_distinct function. For the Spark overload that receives string arguments, you additionally have to convert the strings into column objects using the com.snowflake.snowpark.functions.col function.
Additional recommendations
For more support, you can email us at sma-support@snowflake.com or post an issue in the SMA.
Last updated