SPRKPY1042
pyspark.sql.functions.posexplode
Message: pyspark.sql.functions.posexplode has a workaround
Category: Warning
Description
This issue appears when the tool detects the usage of pyspark.sql.functions.posexplode which has a workaround.
Scenarios
There are a couple of scenarios that this method can handle depending on the type of column it is passed as a parameter, it can be a list of values
or a map/directory (keys/values)
.
Scenario 1
Input
Below is an example of the usage of posexplode
passing as a parameter of a list of values.
Output
The tool adds the EWI SPRKPY1042
indicating that a workaround can be implemented.
Recommended fix
For having the same behavior, use the method functions.flatten, drop extra columns, and rename index and value column names.
Scenario 2
Input
Below is another example of the usage of posexplode
passing as a parameter a map/dictionary (keys/values)
Output
The tool adds the EWI SPRKPY1042
indicating that a workaround can be implemented.
Recommended fix
As a workaround, you can use functions.row_number to get the position and functions.explode with the name of the field to get the value the key/value for dictionaries.
Note: using row_number is not fully equivalent, because it starts with 1 (not zero as spark method)
Additional recommendations
For more support, you can email us at sma-support@snowflake.com or post an issue in the SMA.
Last updated