Skip to content

Best practice for Milo with mixed-effects models and multiple comparisons #381

Description

@nmdrf

Hi,

I am using Milo with mixed-effects models and have a question about best practices when performing multiple pairwise comparisons.

When testing several contrasts between experimental conditions, there seem to be two possible approaches:

  1. Build the kNN graph on the full dataset
    • Construct the graph and neighborhoods using all conditions.
    • Run differential abundance testing for each contrast separately.

  2. Build the graph separately for each comparison
    • Subset the dataset to only the conditions involved in a given contrast.
    • Construct the graph and neighborhoods on that subset before running the test.

In my analysis, I observed that these two approaches produce noticeably different results. When building the graph on the full dataset, the results for specific contrasts become less interpretable, whereas building the graph separately for each comparison produces results that appear more consistent with expectations.

My questions are:
• Is there a recommended best practice for this scenario?
• Should the neighborhood graph ideally be built globally across all conditions, or separately for each comparison?
• How is this situation conceptually handled within the Milo framework?

Any guidance or experience with this type of analysis would be greatly appreciated.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions