deepfold.data.tools.hhblits.HHBlits

class deepfold.data.tools.hhblits.HHBlits(*, binary_path: str, databases: Sequence[str], n_cpu: int = 4, n_iter: int = 3, e_value: float = 0.001, maxseq: int = 1000000, realign_max: int = 100000, maxfilt: int = 100000, min_prefilter_hits: int = 1000, all_seqs: bool = False, alt: int | None = None, p: int = 20, z: int = 500)[source]

Python wrapper of the HHblits binary.

__init__(*, binary_path: str, databases: Sequence[str], n_cpu: int = 4, n_iter: int = 3, e_value: float = 0.001, maxseq: int = 1000000, realign_max: int = 100000, maxfilt: int = 100000, min_prefilter_hits: int = 1000, all_seqs: bool = False, alt: int | None = None, p: int = 20, z: int = 500)[source]

Initializes the Python HHblits wrapper.

Parameters:
  • binary_path – The path to the HHblits executable.

  • databases – A sequence of HHblits database paths. This should be the common prefix for the database files (i.e. up to but not including _hhm.ffindex etc.)

  • n_cpu – The number of CPUs to give HHblits.

  • n_iter – The number of HHblits iterations.

  • e_value – The E-value, see HHblits docs for more details.

  • maxseq – The maximum number of rows in an input alignment. Note that this parameter is only supported in HHBlits version 3.1 and higher.

  • realign_max – Max number of HMM-HMM hits to realign. HHblits default: 500.

  • maxfilt – Max number of hits allowed to pass the 2nd prefilter. HHblits default: 20000.

  • min_prefilter_hits – Min number of hits to pass prefilter. HHblits default: 100.

  • all_seqs – Return all sequences in the MSA / Do not filter the result MSA. HHblits default: False.

  • alt – Show up to this many alternative alignments.

  • p – Minimum Prob for a hit to be included in the output hhr file. HHblits default: 20.

  • z – Hard cap on number of hits reported in the hhr file. HHblits default: 500. NB: The relevant HHblits flag is -Z not -z.

Raises:

RuntimeError – If HHblits binary not found within the path.

Methods

__init__(*, binary_path, databases[, n_cpu, ...])

Initializes the Python HHblits wrapper.

query(input_fasta_path)

Queries the database using HHblits.