We have a ducklake server connected to postgres + s3 parquet files all in AWS (RDS + S3). The workload was tuned and we are able to get 20 seconds performance on first time query execution when duckdb is started in the machine. The same duckdb process inside JVM for the same tpch query responds only after 60 to 70 seconds. The subsequent query executions are faster both in the machine and JVM with approximately 3 second response time.
Do we have to tune some specific setting for duckdb with ducklake and parquet files to be performant when running inside JVM?
Thanks,
Prakash