SPRKPY1085

pyspark.ml.feature.VectorAssembler

Message: pyspark.ml.feature.VectorAssembler is not supported.

Category: Warning

Description

The pyspark.ml.feature.VectorAssembler is not supported.

Scenario

Input code

VectorAssembler is used to combine several columns into a single vector.

data = [
        (1, 10.0, 20.0),
        (2, 25.0, 30.0),
        (3, 50.0, 60.0)
    ]

df = SparkSession.createDataFrame(data, schema=["Id", "col1", "col2"])
vector = VectorAssembler(inputCols=["col1", "col2"], output="cols")

Output code

The tool adds this EWI SPRKPY1085 on the output code to let you know that this class is not supported by Snowpark.

data = [
        (1, 10.0, 20.0),
        (2, 25.0, 30.0),
        (3, 50.0, 60.0)
    ]

df = spark.createDataFrame(data, schema=["Id", "col1", "col2"])
#EWI: SPRKPY1085 => The pyspark.ml.feature.VectorAssembler function is not supported.

vector = VectorAssembler(inputCols=["col1", "col2"], output="cols")

Recommended fix

The pyspark.ml.feature.VectorAssembler does not have a recommended fix.

Additional recommendations

Last updated