overlaps
ensembl.io.genomio.gff3.overlaps
¶
Scan a GFF3 file to detect overlapping SeqFeature objects. Default object level => gene.
get_intervals(record, genes_dict, seq_dict, seq_name)
¶
Extract start/stop feature coordinates for use in creating intervaltree object.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
record
|
SeqRecord
|
Individual sequence record. |
required |
genes_dict
|
dict
|
Genes. |
required |
seq_dict
|
dict
|
Sequences. |
required |
seq_name
|
str
|
Feature sequence name. |
required |
Source code in src/python/ensembl/io/genomio/gff3/overlaps.py
152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 |
|
identify_feature_overlaps(gff_in, output_file, isolate_feature)
¶
Detect overlapping GFF3 SeqFeature objects and dump to a report.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
gff_in
|
Path
|
User supplied GFF3 input file. |
required |
output_file
|
Path
|
Output file to write feature overlaps. |
required |
isolate_feature
|
str
|
Sequence feature type to filter by. |
required |
Source code in src/python/ensembl/io/genomio/gff3/overlaps.py
56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 |
|
main()
¶
Module entry-point.
Source code in src/python/ensembl/io/genomio/gff3/overlaps.py
183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 |
|
scan_tree(feature_intervals)
¶
Construct an interval tree using supplied genomic intervals, check all elements on the tree against itself and return any that hit 2 or more intervals (i.e. itself + 1 other)
Parameters:
Name | Type | Description | Default |
---|---|---|---|
feature_intervals
|
list
|
Genome features to examine for coordinate (start/end) overlaps. |
required |
Return
Set of intervals identified in the input GFF3 file that overlap with 2 or more intervals.
Source code in src/python/ensembl/io/genomio/gff3/overlaps.py
93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 |
|
summarize_feature_stats(gff_in)
¶
Analyse a GFF3 file and produce a summary of its feature types.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
gff_in
|
Path
|
User supplied GFF3 input file. |
required |
Source code in src/python/ensembl/io/genomio/gff3/overlaps.py
41 42 43 44 45 46 47 48 49 50 51 52 53 |
|