Skip to content

Recipe Modularization, DAG Integration, and BigQuery Integration#2715

Open
DannyLiCom wants to merge 1 commit intomainfrom
lidanny/pw-DAG-integrate2
Open

Recipe Modularization, DAG Integration, and BigQuery Integration#2715
DannyLiCom wants to merge 1 commit intomainfrom
lidanny/pw-DAG-integrate2

Conversation

@DannyLiCom
Copy link
Collaborator

@DannyLiCom DannyLiCom commented Nov 19, 2025

Description

When integrating Recipe Modularization, DAG Integration, and BigQuery, the DAG will pass in a command that includes multiple command-line flags.

Tests

Users can use commands similar to the following to run tests

python3 -m benchmarks.recipes.pw_mcjax_benchmark_recipe \
--user='user' \
--cluster_name='pw-scale-test-v5e-32' \
--project='cloud-tpu-multipod-dev' \
--zone='us-south1-a' \
--device_type='v5litepod-32' \
--benchmark_steps=20 \
--num_slices_list='2' \
--server_image='gcr.io/tpu-prod-env-one-vm/lidanny/unsanitized_server:latest' \
--proxy_image='gcr.io/tpu-prod-env-one-vm/lidanny/unsanitized_proxy_server:latest' \
--runner='gcr.io/tpu-prod-env-one-vm/lidanny_latest:latest' \
--selected_model_framework='pathways' \
--selected_model_names='llama3_1_8b_8192_v5e_256' \
--priority='medium' \
--max_restarts=0 \
--bq_enable=True \
--bq_db_project='cloud-tpu-multipod-dev' \
--bq_db_dataset="chzheng_test_100steps"

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run end-to-end tests tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch from a961644 to f2a46a6 Compare November 19, 2025 05:46
@DannyLiCom DannyLiCom marked this pull request as ready for review November 19, 2025 06:05
@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch from f2a46a6 to 3840670 Compare November 23, 2025 16:39
@DannyLiCom DannyLiCom requested a review from notabee as a code owner November 23, 2025 16:39
@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch 2 times, most recently from 8adde2d to b08ea95 Compare November 28, 2025 02:17


def handle_cmd_args(cluster_config: XpkClusterConfig, *actions: str, **kwargs) -> bool:
def handle_cmd_args(cluster_config: XpkClusterConfig, is_delete: bool, user: str, **kwargs) -> bool:
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since the original recipe could only accept the --delete flag, this section had to be modified to allow it to accept multiple flags.

parser_utils.add_arguments(parser)
args = parser.parse_args()

if len(sys.argv) > 2:
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This logic is implemented to determine whether multiple flags are being used.

@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch 3 times, most recently from 0303bcf to 78b8073 Compare December 17, 2025 09:02
@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch 3 times, most recently from ce0fc18 to c37edc0 Compare December 29, 2025 02:19
@codecov
Copy link

codecov bot commented Dec 29, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

@github-actions
Copy link

This PR has been automatically marked as stale because it has not had recent activity. It will be closed soon if no further activity occurs. Thank you for your contributions.

@github-actions github-actions bot added the stale Automatically applied to stale PRs. label Jan 28, 2026
@github-actions
Copy link

github-actions bot commented Feb 4, 2026

This PR was closed because it has been inactive for a while. Please reopen it if you are still working on it.

@github-actions github-actions bot removed the stale Automatically applied to stale PRs. label Feb 5, 2026
@SujeethJinesh
Copy link
Collaborator

LGTM, but I would just verify that the imports work properly at the top of the files. Since the MaxText modularization refactor, we may need to check that since a lot of these changes were from before that.

Copy link
Collaborator

@aireenmei aireenmei left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the work! I run the PR through AI and got these comments: https://paste.googleplex.com/6085130735190016. I looked at some points and they make sense to me, so I left inline comments. But there are more points, could you take a look?

@github-actions
Copy link

github-actions bot commented Feb 5, 2026

🤖 Hi @aireenmei, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

Copy link

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

## 📋 Review Summary

This pull request introduces significant refactoring to the benchmarking recipes, modularizing argument parsing and improving configuration management. The changes make the recipes more flexible and easier to use from the command line, especially for DAG integration.

🔍 General Feedback

  • The introduction of parser_utils.py is a good step towards centralizing command-line argument handling.
  • The use of a UserConfig class helps in managing configurations cleanly.
  • The addition of check_and_create_bucket is a useful utility, though its error handling could be improved in the calling code.

@github-actions
Copy link

github-actions bot commented Feb 5, 2026

🤖 I'm sorry @aireenmei, but I was unable to process your request. Please see the logs for more details.

@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch from 9bb39cd to a57612a Compare February 6, 2026 01:04
@DannyLiCom
Copy link
Collaborator Author

Hi @aireenmei I noticed a lot of these errors in Pylint.
E0611: No name 'multimodal' in module 'MaxText' (no-name-in-module)
And they were all from code that was just merged
https://screenshot.googleplex.com/8ddsSFkx9RuLo6u

@DannyLiCom
Copy link
Collaborator Author

@aireenmei Also, these failures don't seem to be related to my code at all.
https://screenshot.googleplex.com/4BRE3k8xXhoft4s

@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch from a57612a to 2e66a21 Compare February 6, 2026 01:20
@ycchenzheng ycchenzheng requested a review from aireenmei February 6, 2026 16:01
@DannyLiCom DannyLiCom force-pushed the lidanny/pw-DAG-integrate2 branch from 2e66a21 to cd65cbd Compare February 7, 2026 01:52
@DannyLiCom
Copy link
Collaborator Author

Hi @aireenmei I noticed a lot of these errors in Pylint. E0611: No name 'multimodal' in module 'MaxText' (no-name-in-module) And they were all from code that was just merged https://screenshot.googleplex.com/8ddsSFkx9RuLo6u

Resolved

@DannyLiCom
Copy link
Collaborator Author

@aireenmei Also, these failures don't seem to be related to my code at all. https://screenshot.googleplex.com/4BRE3k8xXhoft4s

Resolved

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants