Titration analysis with CPJUMP1 profiles by axiomcura · Pull Request #5 · WayScience/Buscar-benchmark-analysis

axiomcura · 2026-06-10T16:28:56Z

This pull request introduces a titration analysis for the top 3 compounds across both cell types in the cpjump1 analysis module.

Key updates include:

Added a new Jupyter notebook (5.cpjump1-titration-analysis.ipynb) and its corresponding Python script.
Updated the main analysis shell script to include the titration analysis pipeline.
Added necessary project dependencies to pyproject.toml.
Included output results (parquet and plot files) for the titration analysis.
Updated pre-commit configurations and lock files to support the new workflow.

review-notebook-app · 2026-06-10T16:29:02Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Copilot

Pull request overview

This PR adds a CPJUMP1 titration analysis to quantify how Buscar score stability changes as the number of pooled perturbed cells decreases, and wires it into the CPJUMP1 analysis workflow.

Changes:

Added a new CPJUMP1 titration analysis notebook and its nbconverted Python script, producing parquet + plot outputs.
Updated the CPJUMP1 runner shell script to execute the new titration step.
Added dependencies (e.g., pycytominer, requests, tqdm) and updated lock/pre-commit configuration.

Reviewed changes

Copilot reviewed 5 out of 8 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
`uv.lock`	Locks new dependencies needed for titration analysis and related tooling.
`pyproject.toml`	Adds build-system config and new runtime dependencies for the analysis workflow.
`notebooks/3.cpjump1-analysis/run-cpjump1-buscar-analysus.sh`	Adds execution of the titration analysis script (but currently references incorrect script filenames in the pipeline).
`notebooks/3.cpjump1-analysis/nbconverted/5.cpjump1-titration-analysis.py`	Implements the titration analysis pipeline and writes results/plots (but currently has import/path and avoidable per-iteration I/O issues).
`notebooks/3.cpjump1-analysis/5.cpjump1-titration-analysis.ipynb`	Source notebook for the titration analysis.
`.pre-commit-config.yaml`	Updates Ruff pre-commit revision.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

wli51

LGTM! I don't think there are any fatal flaws with this experiment but maybe the analysis can be made more robust with sampling with replacement, see comments for details.

wli51 · 2026-06-11T00:44:00Z

+            .mean()
+            .alias("mean_abs_score_error_from_1"),
+        )
+        .sort(["mean_abs_score_error_from_1", "var_on_score"])


i like the ranking method here and I think it makes sense. I have one concern being that if two perturbations have respective mean error from 1 of 0.001 and 0.002 but the one with less err had an insanely huge variance compared to the second it would still rank first which I guess is what we don't want.

Maybe the variance should be normalized by error to indicate true consistency?

either way, i think you have to show the actual rank df and see the real values of variance of top ranked compounds. currently your notebook is not ran and that can't be seen

wli51 · 2026-06-11T00:49:22Z

+
+
+# setting random seed for reproducibility
+np.random.seed(rng_seed)


having a standalone random state e.g. rng = np.random.RandomState(rng_seed) and calling rng.choice when you need to sample is always better than relying with global np random

If Cameron were to review this PR he would suggest hasjing something like the cell line name or some CPJUMP1 experiment specific ID to get a always reproducible seed to use as rng for both cell types.

wli51 · 2026-06-11T00:52:39Z

+    selected_plate_id = np.random.choice(plate_ids)
+
+    for treatment in tqdm(
+        selected_treatments, desc=f"{cell_type} treatments", unit="treatment"
+    ):


If we ever want k fold we should just exhuast over all plate_ids, setting the one as reference and all others as titration sample pool. maybe ask greg if he thinks kfold titration would worth the effort?

wli51 · 2026-06-11T00:55:52Z

+                ).sample(
+                    fraction=negcon_subsample_fraction,
+                    seed=iter_seed,
+                    with_replacement=False,


for smaller number of single cell i think with replacement will probably make the titration readouts more realistic, because with small sample sizes at larger titration proportions you essentially get the exact same sample, which can cause artificial stability.

axiomcura added 5 commits June 9, 2026 12:04

added notebook and updated pre-commit

a153f7f

added support repo utils module

ce2987b

added titration anlaysis

2418faf

removed checkpoint file

2ca322c

updated notebook and shell script

1c3a60b

axiomcura requested review from Copilot and wli51 June 10, 2026 16:29

Copilot started reviewing on behalf of axiomcura June 10, 2026 16:29 View session

Copilot AI reviewed Jun 10, 2026

View reviewed changes

axiomcura added 2 commits June 10, 2026 11:02

fixed bug in shell script

ed50fba

updates

da6c28e

wli51 approved these changes Jun 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Titration analysis with CPJUMP1 profiles #5

Titration analysis with CPJUMP1 profiles #5
axiomcura wants to merge 7 commits into
WayScience:mainfrom
axiomcura:titration-analysis

axiomcura commented Jun 10, 2026

Uh oh!

review-notebook-app Bot commented Jun 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wli51 left a comment

Uh oh!

wli51 Jun 11, 2026

Uh oh!

wli51 Jun 11, 2026

Uh oh!

wli51 Jun 11, 2026

Uh oh!

wli51 Jun 11, 2026

Uh oh!

wli51 Jun 11, 2026

Uh oh!

wli51 Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		# setting random seed for reproducibility
		np.random.seed(rng_seed)

Conversation

axiomcura commented Jun 10, 2026

Uh oh!

review-notebook-app Bot commented Jun 10, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wli51 left a comment

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

wli51 Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants