Dataset
org.apache.spark.sql.Dataset[T] => com.snowflake.snowpark.DataFrame
This section describes the mappings from org.apache.spark.sql.Dataset[T] to com.snowflake.snowpark.DataFrame. These methods are mapped to the DataFrame class since there is no Dataset class available in Snowpark.
filter(Column condition)
Mapped to method in com.snowflake.snowpark.DataFrame
orderBy(String sortCol, Seq[String] sortCols)
*
orderBy(String sortCol, String... sortCols)
*
repartition(partitionExprs: Column*)
N/A
Repartition is a Spark concept that is not needed in Snowpark
repartition(numPartitions: Int, partitionExprs: Column*)
N/A
Repartition is a Spark concept that is not needed in Snowpark
repartition(numPartitions: Int)
N/A
Repartition is a Spark concept that is not needed in Snowpark
repartitionByRange(*cols: Column*): DataFrame
N/A
Repartition by range is a Spark concept that is not needed in Snowpark
repartitionByRange(numPartitions: int, *cols: Column*): DataFrame
N/A
Repartition by range is a Spark concept that is not needed in Snowpark
transform(scala.Function1<Dataset,Dataset> t)
*
unionByName(Dataset other)
unionByName ( other: DataFrame ) : DataFrame
Pending: Functional comparison
unionByName(Dataset other, boolean allowMissingColumns)
**
Last updated
Was this helpful?