Add project-context.md for datastream-to-spanner by aasthabharill · Pull Request #3902 · GoogleCloudPlatform/DataflowTemplates

aasthabharill · 2026-06-09T12:08:47Z

b/521743991

codecov · 2026-06-09T12:15:59Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 62.04%. Comparing base (3b2def5) to head (da5aded).
⚠️ Report is 68 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #3902      +/-   ##
============================================
+ Coverage     53.73%   62.04%   +8.30%     
+ Complexity     6743     2971    -3772     
============================================
  Files          1087      532     -555     
  Lines         66794    32141   -34653     
  Branches       7478     3515    -3963     
============================================
- Hits          35890    19941   -15949     
+ Misses        28477    11179   -17298     
+ Partials       2427     1021    -1406

Components	Coverage Δ
spanner-templates	`80.43% <ø> (+7.59%)`	⬆️
spanner-import-export	`∅ <ø> (∅)`
spanner-live-forward-migration	`90.16% <ø> (+9.21%)`	⬆️
spanner-live-reverse-replication	`84.37% <ø> (+7.27%)`	⬆️
spanner-bulk-migration	`92.58% <ø> (+1.47%)`	⬆️
gcs-spanner-dv	`90.39% <ø> (+4.63%)`	⬆️
see 657 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

gemini-code-assist · 2026-06-10T04:46:39Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces documentation and architectural diagrams for the datastream-to-spanner Dataflow template. The goal is to provide a centralized source of truth regarding the pipeline's design, technical constraints, and operational guidelines to assist future development and maintenance.

Highlights

Project Documentation: Added a comprehensive project-context.md file to provide AI agents and developers with architectural insights, coding standards, and best practices for the datastream-to-spanner template.
Architecture Visualization: Included an updated architecture diagram in both .dot and .svg formats to reflect the current data flow and component interactions.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize the Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counterproductive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds comprehensive architectural documentation for the Datastream to Spanner Dataflow template, including a Graphviz DOT file, its corresponding SVG diagram, and a detailed project context markdown file. Feedback on the architecture diagram points out a misleading label 'Write To DLQ2' and suggests renaming it to 'Write To Severe DLQ' to correctly represent that severe errors are routed to a subdirectory of the main DLQ rather than a separate resource.

darshan-sj · 2026-06-17T05:17:53Z

+## Technical Details
+
+*   **Tech Stack & Versions:**
+    *   **Languages:** Java 17


This information needs to be derived from the main readme page. How does AI agent knows to do that? Where is that instruction?

darshan-sj · 2026-06-17T05:21:05Z

+*   **Coding Standards & Best Practices:**
+    *   Individual CEs are processed separately for parallel scaling, rather than grouping them into the original source transactions. Consistency is managed using lateness checks on the Shadow Tables.
+    *   **Avoid Serial Processing:** Do not attempt to group events by transaction or serially order them. The approach relies on parallel workers, taking advantage of Cloud Dataflow's scale.
+    *   **Avoid GroupBy:** Do not use `GroupByKey` or internal worker state to filter stale events before writing. It doesn't scale well and complicates state recovery. Always use Shadow Tables for the lateness check.


I'm not very sure about this point. I don't understand what it is trying to say.

pull-request-size Bot added the size/XXL label Jun 9, 2026

Add project context for datastream-to-spanner

5595f80

aasthabharill force-pushed the datastream-to-spanner-context branch from 610c543 to 5595f80 Compare June 9, 2026 12:11

pull-request-size Bot added size/L and removed size/XXL labels Jun 9, 2026

aasthabharill added the addition New feature or request label Jun 9, 2026

VardhanThigle reviewed Jun 9, 2026

View reviewed changes

Comment thread v2/datastream-to-spanner/project-context.md Outdated

Clarify Datastream/Dataflow boundary

a114714

aasthabharill requested a review from darshan-sj June 10, 2026 04:46

aasthabharill marked this pull request as ready for review June 10, 2026 04:46

aasthabharill requested a review from a team as a code owner June 10, 2026 04:46

aasthabharill requested a review from sm745052 June 10, 2026 04:46

gemini-code-assist Bot reviewed Jun 10, 2026

View reviewed changes

Comment thread v2/datastream-to-spanner/architecture.dot Outdated

shreyakhajanchi reviewed Jun 10, 2026

View reviewed changes

Comment thread v2/datastream-to-spanner/project-context.md Outdated

aasthabharill added 3 commits June 15, 2026 17:15

review1

450d57c

readme call

b56244d

reference improvements

da5aded

darshan-sj reviewed Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add project-context.md for datastream-to-spanner#3902

Add project-context.md for datastream-to-spanner#3902
aasthabharill wants to merge 5 commits into
GoogleCloudPlatform:mainfrom
aasthabharill:datastream-to-spanner-context

aasthabharill commented Jun 9, 2026

Uh oh!

codecov Bot commented Jun 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

gemini-code-assist Bot commented Jun 10, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

darshan-sj Jun 17, 2026

Uh oh!

darshan-sj Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

aasthabharill commented Jun 9, 2026

Uh oh!

codecov Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

gemini-code-assist Bot commented Jun 10, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

darshan-sj Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

darshan-sj Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov Bot commented Jun 9, 2026 •

edited

Loading