DustMasker Module Documentation

DustMasker is a program that identifies and masks out low complexity parts of a genome using a new and improved DUST algorithm.

Morgulis A, Gertz EM, Schaffer AA, Agarwala R. A Fast and Symmetric DUST Implementation to Mask Low-Complexity DNA Sequences.

ensembl.tools.anno.repeat_annotation.dust.run_dust(genome_file: PathLike, output_dir: Path, dust_bin: Path = PosixPath('dustmasker'), num_threads: int = 1) None[source]
Run Dust on genomic slices with mutiprocessing
param genome_file:

Genome file path.

type genome_file:

PathLike

param output_dir:

Working directory path.

type output_dir:

Path

param dust_bin:

Dust software path.

type dust_bin:

Path, default dustmasker

param num_threads:

Number of threads.

type num_threads:

int, default 1

return:

None

rtype:

None