feat: parallelize parquet load by row group by iamlinjunhong · Pull Request #24808 · matrixorigin/matrixone

iamlinjunhong · 2026-06-03T08:32:24Z

What type of PR is this?

Which issue(s) this PR fixes:

What this PR does / why we need it:

Plan Parquet LOAD around file and row-group fanout, carry shard metadata through ExternalScan, and keep S3 prefetch behavior bounded for sharded readers.

Harden unsupported option handling, add Parquet profile stats, and cover compile/runtime/BVT regressions for schema, conversion, rollback, and parallel-load paths.

Plan Parquet LOAD around file and row-group fanout, carry shard metadata through ExternalScan, and keep S3 prefetch behavior bounded for sharded readers. Harden unsupported option handling, add Parquet profile stats, and cover compile/runtime/BVT regressions for schema, conversion, rollback, and parallel-load paths.

qodo-code-review · 2026-06-03T08:32:28Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

Add the missing DATE32 Parquet resource used by load_data_parquet.sql so the 4.0-dev BVT can load the cherry-picked date32-to-DATETIME case instead of failing on a missing file.

iamlinjunhong requested review from XuPeng-SH, aunjgr, heni02 and ouyuanning as code owners June 3, 2026 08:32

iamlinjunhong temporarily deployed to ci June 3, 2026 08:35 — with GitHub Actions Inactive

iamlinjunhong had a problem deploying to ci June 3, 2026 08:35 — with GitHub Actions Failure

iamlinjunhong temporarily deployed to ci June 3, 2026 08:35 — with GitHub Actions Inactive

iamlinjunhong had a problem deploying to ci June 3, 2026 08:35 — with GitHub Actions Failure

iamlinjunhong had a problem deploying to ci June 3, 2026 08:35 — with GitHub Actions Error

iamlinjunhong temporarily deployed to ci June 3, 2026 08:35 — with GitHub Actions Inactive

iamlinjunhong had a problem deploying to ci June 3, 2026 08:35 — with GitHub Actions Failure

iamlinjunhong had a problem deploying to ci June 3, 2026 08:35 — with GitHub Actions Error

iamlinjunhong temporarily deployed to ci June 3, 2026 08:45 — with GitHub Actions Inactive

iamlinjunhong had a problem deploying to ci June 3, 2026 08:45 — with GitHub Actions Error

iamlinjunhong temporarily deployed to ci June 3, 2026 08:45 — with GitHub Actions Inactive

matrix-meow added the size/XXL Denotes a PR that changes 2000+ lines label Jun 3, 2026

test: add parquet date32 datetime fixture

16bcaf2

Add the missing DATE32 Parquet resource used by load_data_parquet.sql so the 4.0-dev BVT can load the cherry-picked date32-to-DATETIME case instead of failing on a missing file.

iamlinjunhong temporarily deployed to ci June 3, 2026 10:06 — with GitHub Actions Inactive

iamlinjunhong had a problem deploying to ci June 3, 2026 10:06 — with GitHub Actions Failure

ouyuanning approved these changes Jun 3, 2026

View reviewed changes

heni02 approved these changes Jun 3, 2026

View reviewed changes

iamlinjunhong temporarily deployed to ci June 3, 2026 10:49 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: parallelize parquet load by row group#24808

feat: parallelize parquet load by row group#24808
iamlinjunhong wants to merge 2 commits into
matrixorigin:4.0-devfrom
iamlinjunhong:d4-24254

iamlinjunhong commented Jun 3, 2026

Uh oh!

qodo-code-review Bot commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

iamlinjunhong commented Jun 3, 2026

What type of PR is this?

Which issue(s) this PR fixes:

What this PR does / why we need it:

Uh oh!

qodo-code-review Bot commented Jun 3, 2026

Qodo reviews are paused for this user.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants