Skip to content
Advertisement

Tag: apache-spark

How does spark SQL access databases

Suppose you access a SQL database with spark SQL. With RDD spark partitions the data into many different parts that all together make the data set. My question is how does Spark SQL manages this access from the N nodes to the database. I can see several possibilities: Each nodes of the RDD access to the database and builds up

Advertisement