Skip to content

Ducklake slowness with embedded duckdb server. #289

@prakashbalaji

Description

@prakashbalaji

We have a ducklake server connected to postgres + s3 parquet files all in AWS (RDS + S3). The workload was tuned and we are able to get 20 seconds performance on first time query execution when duckdb is started in the machine. The same duckdb process inside JVM for the same tpch query responds only after 60 to 70 seconds. The subsequent query executions are faster both in the machine and JVM with approximately 3 second response time.

Do we have to tune some specific setting for duckdb with ducklake and parquet files to be performant when running inside JVM?

Image Image

Thanks,
Prakash

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions