SPRKPY1085
pyspark.ml.feature.VectorAssembler
Description
Scenario
data = [
(1, 10.0, 20.0),
(2, 25.0, 30.0),
(3, 50.0, 60.0)
]
df = SparkSession.createDataFrame(data, schema=["Id", "col1", "col2"])
vector = VectorAssembler(inputCols=["col1", "col2"], output="cols")Additional recommendations
Last updated
