Skip to content

Clarification on attention mask "order of prediction" and location of iterative selection code #338

@nc1m

Description

@nc1m

Hello, and thanks for the great work on scGPT.

Context:

  • In "Attention mask for generative pretraining." you state that attention scores define the order of prediction: "To address this challenge, we developed a specialized attention-masking mechanism for scGPT that defines the order of prediction based on attention scores."
  • Later you state it is based on prediction confidence: "In each iteration, scGPT selects the top 1/K genes from the unknown set with the highest prediction confidence to be included as known genes in the next iteration i + 1."
  • I couldn't find any iterative code in the repo.

Questions:

  • How is the order of prediction specified in the current implementation: is it determined by attention scores, prediction confidence, or something else?
  • Where is the relevant iterative code that selects top 1/K genes by prediction confidence and updates the known/unknown sets across iterations? Please point to the file(s), function(s), and, if possible, line ranges.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions