Skip to content

Question about extracting cross-chain information using ESM-PPI #14

Description

@wyattxuanyang

Thank you for your great work!

As a user I have some questions. Suppose that there are four or five chains (eg. the TCR-pMHC complex, with the chains of A, B, M, N, P), could you please tell me whether I can still extract cross-chain information using ESM-PPI, since I notice TCR-pMHC complex is included in the PPI training process. And what are the exact meaning of the final inputs in the provided code here? (the output keys are (['labl', 'mask', 'pred', 'sfea', 'pfea']))

Thank you! :)

import torch
import tfold

# Download the pre-trained model
model_path = tfold.model.esm_ppi_650m_ab()

# Load the model
model = tfold.model.PPIModel.restore(model_path)

# Prepare antibody sequences (can be single or multiple sequences)
data = [
        'QVQLVQSGAEVKKPGASVKVSCKASGYPFTSYGISWVRQAPGQGLEWMGWISTYNGNTNYAQKFQGRVTMTTDTSTTTGYMELRRLRSDDTAVYYCARDYTRGAWFGESLIGGFDNWGQGTLVTVSS', # Heavy chain
        'EIVLTQSPGTLSLSPGERATLSCRASQTVSSTSLAWYQQKPGQAPRLLIYGASSRATGIPDRFSGSGSGTDFTLTISRLEPEDFAVYYCQQHDTSLTFGGGTKVEIK' # Light chain
]
ppi_output = model(data)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions