你的位置：首页>programmer>Local Spark and Yarn Spark giving different results - Stack Overflow

Local Spark and Yarn Spark giving different results - Stack Overflow

programmeradmin2025-03-111浏览0评论

I have a function called runSparkFlow() which takes SparkSession object and hashmap of Dataset<Row>. This hashmap of datasets is created from a function called getSourceDatasets(). I am running both of these functions in local and yarn and they are reading the same tables or precisely same data.

But there's a difference in the number of output rows in the local output and the yarn output. I looked at both the DAG visualisation but not able to figure out at which point the difference is getting created.

I just want to understand conceptually, what are the possible points from where the difference can occur? Like one of the case is filtering on timestamp column and due to timezone difference this can occur (although this is not the case).

与本文相关的文章

Local Spark and Yarn Spark giving different results - Stack Overflow

评论列表(0)

暂无评论

科技改变生活-雨落星辰 - 所有的伟大,都源于一个勇敢的开始

与本文相关的文章

评论列表(0)