DustMasker Module Documentation¶
DustMasker is a program that identifies and masks out low complexity parts of a genome using a new and improved DUST algorithm.
Morgulis A, Gertz EM, Schaffer AA, Agarwala R. A Fast and Symmetric DUST Implementation to Mask Low-Complexity DNA Sequences.
- ensembl.tools.anno.repeat_annotation.dust.run_dust(genome_file: PathLike, output_dir: Path, dust_bin: Path = PosixPath('dustmasker'), num_threads: int = 1) None [source]¶
- Run Dust on genomic slices with mutiprocessing
- param genome_file:
Genome file path.
- type genome_file:
PathLike
- param output_dir:
Working directory path.
- type output_dir:
Path
- param dust_bin:
Dust software path.
- type dust_bin:
Path, default dustmasker
- param num_threads:
Number of threads.
- type num_threads:
int, default 1
- return:
None
- rtype:
None