ERROR: Too many blocks, the phasing process must be wrong.

Hi,

I'm working on FFPE tumour samples (RNAseq and WES) and I'm getting an error during the phasing process.

```
Num of hete small variant is 26 in HLA_DQB1.
11 blocks after phasing.
Start link blocks with database...
Num of all possible haps is 1024.
ERROR: Too many blocks, the phasing process must be wrong.
```


This error leads to missing files during the annotation process, resulting in no results.

Do you have a suggestion for bypassing the phasing process for problematic HLA genes?

Many thanks for your help

```
start annotation...
parameter:	sample:D240219	dir:specHLA//D240219	pop:Unknown	wxs:exon	G_nom:0
specHLA//D240219/hla.allele.1.HLA_DQB1.fasta: No such file or directory
specHLA//D240219/hla.allele.1.HLA_DQB1.fasta: No such file or directory
BLAST engine error: Warning: Sequence contains no data 
specHLA//D240219/hla.allele.2.HLA_DQB1.fasta: No such file or directory
specHLA//D240219/hla.allele.2.HLA_DQB1.fasta: No such file or directory
BLAST engine error: Warning: Sequence contains no data 
mpileup: invalid option -- 't'

Usage: samtools mpileup [options] in1.bam [in2.bam [...]]

Input options:
  -6, --illumina1.3+      quality is in the Illumina-1.3+ encoding
  -A, --count-orphans     do not discard anomalous read pairs
  -b, --bam-list FILE     list of input BAM filenames, one per line
  -B, --no-BAQ            disable BAQ (per-Base Alignment Quality)
  -C, --adjust-MQ INT     adjust mapping quality; recommended:50, disable:0 [0]
  -d, --max-depth INT     max per-file depth; avoids excessive memory usage [8000]
  -E, --redo-BAQ          recalculate BAQ on the fly, ignore existing BQs
  -f, --fasta-ref FILE    faidx indexed reference sequence file
  -G, --exclude-RG FILE   exclude read groups listed in FILE
  -l, --positions FILE    skip unlisted positions (chr pos) or regions (BED)
  -q, --min-MQ INT        skip alignments with mapQ smaller than INT [0]
  -Q, --min-BQ INT        skip bases with baseQ/BAQ smaller than INT [13]
  -r, --region REG        region in which pileup is generated
  -R, --ignore-RG         ignore RG tags (one BAM = one sample)
  --rf, --incl-flags STR|INT  required flags: include reads with any of the mask bits set []
  --ff, --excl-flags STR|INT  filter flags: skip reads with any of the mask bits set
                                            [UNMAP,SECONDARY,QCFAIL,DUP]
  -x, --ignore-overlaps   disable read-pair overlap detection
  -X, --customized-index  use customized index files

Output options:
  -o, --output FILE        write output to FILE [standard output]
  -O, --output-BP          output base positions on reads, current orientation
      --output-BP-5        output base positions on reads, 5' to 3' orientation
  -M, --output-mods        output base modifications
  -s, --output-MQ          output mapping quality
      --output-QNAME       output read names
      --output-extra STR   output extra read fields and read tag values
      --output-sep CHAR    set the separator character for tag lists [,]
      --output-empty CHAR  set the no value character for tag lists [*]
      --no-output-ins      skip insertion sequence after +NUM
                           Use twice for complete insertion removal
      --no-output-ins-mods don't display base modifications within insertions
      --no-output-del      skip deletion sequence after -NUM
                           Use twice for complete deletion removal
      --no-output-ends     remove ^MQUAL and $ markup in sequence column
      --reverse-del        use '#' character for deletions on the reverse strand
  -a                       output all positions (including zero depth)
  -a -a (or -aa)           output absolutely all positions, including unused ref. sequences

Generic options:
      --input-fmt-option OPT[=VAL]
               Specify a single input file format option in the form
               of OPTION or OPTION=VALUE
      --reference FILE
               Reference sequence FASTA FILE [null]
      --verbosity INT
               Set level of verbosity

Note that using "samtools mpileup" to generate BCF or VCF files has been
removed.  To output these formats, please use "bcftools mpileup" instead.
No such file or directory
Start G group resolution annotation...
The region with low read depth is masked by N. The cutoff is specified by -k.
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.1.HLA_A.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.2.HLA_A.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.1.HLA_B.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.2.HLA_B.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.1.HLA_C.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.2.HLA_C.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.1.HLA_DPA1.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.2.HLA_DPA1.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.1.HLA_DPB1.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.2.HLA_DPB1.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.1.HLA_DQA1.fasta
The ratio of N (masked) is 0.0 for the allele specHLA//D240219/hla.allele.2.HLA_DQA1.fasta
Traceback (most recent call last):
  File "SpecHLA/script/whole/g_group_annotation.py", line 244, in <module>
    g_ann.main()
  File "SpecHLA/script/whole/g_group_annotation.py", line 146, in main
    n_ratio = count_n_ratio(infer_hap_file) # cal the ratio of N in the fasta
  File "SpecHLA/script/whole/g_group_annotation.py", line 23, in count_n_ratio
    with open(fasta_file, 'r') as f:
FileNotFoundError: [Errno 2] No such file or directory: 'specHLA//D240219/hla.allele.1.HLA_DQB1.fasta'
Clean output dir.
# version: IPD-IMGT/HLA 3.58.0
Sample	HLA_A_1	HLA_A_2	HLA_B_1	HLA_B_2	HLA_C_1	HLA_C_2	HLA_DPA1_1	HLA_DPA1_2	HLA_DPB1_1	HLA_DPB1_2	HLA_DQA1_1	HLA_DQA1_2	HLA_DQB1_1	HLA_DQB1_2	HLA_DRB1_1	HLA_DRB1_2
D240219 is done.
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ERROR: Too many blocks, the phasing process must be wrong. #47

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ERROR: Too many blocks, the phasing process must be wrong. #47

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions