spark.read vs spark.sql - caching issues?
For some reason, when we bring data into a data frame in Notebooks and we use spark.read we were seeing old data that has been removed from our lakehouse table for over a few days. When we bring in data using spark.sql data is correct. Is there a way that the spark.read is caching pretty old data?
vs.
df = spark.sql("SELECT * FROM SilverLakehouse.dbo.datatable")
0
1 comment
Justin Sweet
2
spark.read vs spark.sql - caching issues?
Learn Microsoft Fabric
skool.com/microsoft-fabric
Helping passionate analysts, data engineers, data scientists (& more) to advance their careers on the Microsoft Fabric platform.
Leaderboard (30-day)
powered by