SPRKSCL1135
org.apache.spark.sql.functions.mean
This issue code is deprecated since Spark Conversion Core 4.3.2
Message: org.apache.spark.sql.functions.mean has a workaround, see documentation for more info
Category: Warning
Description
This issue appears when the SMA detects a use of the org.apache.spark.sql.functions.mean function, which has a workaround.
Scenario
Input
Below is an example of the org.apache.spark.sql.functions.mean
function, first used with a column name as an argument and then with a column object.
val df = Seq(1, 3, 10, 1, 3).toDF("value")
val result1 = df.select(mean("value"))
val result2 = df.select(mean(col("value")))
Output
The SMA adds the EWI SPRKSCL1135
to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.
val df = Seq(1, 3, 10, 1, 3).toDF("value")
/*EWI: SPRKSCL1135 => org.apache.spark.sql.functions.mean has a workaround, see documentation for more info*/
val result1 = df.select(mean("value"))
/*EWI: SPRKSCL1135 => org.apache.spark.sql.functions.mean has a workaround, see documentation for more info*/
val result2 = df.select(mean(col("value")))
Recommended fix
Snowpark has an equivalent mean function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.
For the overload that receives a string argument, you can convert the string into a column object using the com.snowflake.snowpark.functions.col function as a workaround.
val df = Seq(1, 3, 10, 1, 3).toDF("value")
val result1 = df.select(mean(col("value")))
val result2 = df.select(mean(col("value")))
Additional recommendations
For more support, you can email us at [email protected] or post an issue in the SMA.
Last updated