Hi! And thanks so much for writing this great package!
When I run the following command:
df = spark.read.format("com.elastacloud.spark.excel").option("cellAddress", "A1").load(file_location)
I get the following error:
java.lang.OutOfMemoryError: Java heap space
The excel file is 218MB roughly 750K rows with ~50 or so fields (no long text strings).
and I'm running in Azure Databricks: 9.1 LTS (includes Apache Spark 3.1.2, Scala 2.12)
Running: Standard_F4 with 8GB Memory and 4 Cores
I downloaded the following JAR and Installed on the Cluster (Above): spark_excel_3_1_2_0_1_7.jar
Any Help or Advice would be appreciated.
Hi! And thanks so much for writing this great package!
When I run the following command:
df = spark.read.format("com.elastacloud.spark.excel").option("cellAddress", "A1").load(file_location)I get the following error:
java.lang.OutOfMemoryError: Java heap space
The excel file is 218MB roughly 750K rows with ~50 or so fields (no long text strings).
and I'm running in Azure Databricks: 9.1 LTS (includes Apache Spark 3.1.2, Scala 2.12)
Running: Standard_F4 with 8GB Memory and 4 Cores
I downloaded the following JAR and Installed on the Cluster (Above): spark_excel_3_1_2_0_1_7.jar
Any Help or Advice would be appreciated.