getTIRs Class
getTIRs
getTIRs(
fasta_file: str,
flankdist: int = 10,
minid: float = 80,
minterm: int = 10,
minseed: int = 5,
diagfactor: float = 0.3,
mites: bool = False,
report: str = 'split',
temp: Optional[str] = None,
keeptemp: bool = False,
alignTool: str = 'nucmer',
verbose: bool = True,
) -> Generator[SeqRecord, None, None]
Align elements to self and attempt to identify TIRs.
Processes sequences to identify Terminal Inverted Repeats (TIRs) by performing self-alignment. Can optionally construct synthetic Miniature Inverted-repeat Transposable Elements (MITEs).
| PARAMETER | DESCRIPTION |
|---|---|
fasta_file
|
Path to the multifasta file containing sequence records.
TYPE:
|
flankdist
|
Maximum distance from element start for TIR candidates, by default 10.
TYPE:
|
minid
|
Minimum identity between terminal repeat pairs, by default 80.
TYPE:
|
minterm
|
Minimum length for a terminal repeat to be considered, by default 10.
TYPE:
|
minseed
|
Minimum seed length for nucmer, by default 5.
TYPE:
|
diagfactor
|
Diagonal factor for nucmer, by default 0.3.
TYPE:
|
mites
|
Whether to attempt to construct synthetic MITEs, by default False.
TYPE:
|
report
|
Reporting mode for TIRs ('split', 'external', 'internal', 'all'), by default 'split'.
TYPE:
|
temp
|
Path to the temporary directory, by default None.
TYPE:
|
keeptemp
|
Whether to keep the temporary directory after processing, by default False.
TYPE:
|
alignTool
|
Alignment tool to use ('nucmer' or 'blastn'), by default 'nucmer'.
TYPE:
|
verbose
|
Whether to print verbose output, by default True.
TYPE:
|
| YIELDS | DESCRIPTION |
|---|---|
SeqRecord
|
Segments of the sequence based on the reporting mode: - 'split': TIRs and internal regions separately - 'external': Only TIRs - 'internal': Only internal regions - 'all': Original sequences plus all segments |
Notes
When mites=True, the function will also yield synthetic MITEs constructed by joining the identified TIRs.
options: members: - init
getLTRs Class
getLTRs
getLTRs(
fasta_file: str,
flankdist: int = 10,
minid: float = 80,
minterm: int = 10,
minseed: int = 5,
diagfactor: float = 0.3,
report: str = 'split',
temp: Optional[str] = None,
keeptemp: bool = False,
alignTool: str = 'nucmer',
verbose: bool = True,
) -> Generator[SeqRecord, None, None]
Align elements to self and attempt to identify LTRs.
Processes sequences to identify Long Terminal Repeats (LTRs) by performing self-alignment and filtering results.
| PARAMETER | DESCRIPTION |
|---|---|
fasta_file
|
Path to the multifasta file containing sequence records.
TYPE:
|
flankdist
|
Maximum distance from element start for LTR candidates, by default 10.
TYPE:
|
minid
|
Minimum identity between terminal repeat pairs, by default 80.
TYPE:
|
minterm
|
Minimum length for a terminal repeat to be considered, by default 10.
TYPE:
|
minseed
|
Minimum seed length for nucmer, by default 5.
TYPE:
|
diagfactor
|
Diagonal factor for nucmer, by default 0.3.
TYPE:
|
report
|
Reporting mode for LTRs ('split', 'external', 'internal', 'all'), by default 'split'.
TYPE:
|
temp
|
Path to the temporary directory, by default None.
TYPE:
|
keeptemp
|
Whether to keep the temporary directory after processing, by default False.
TYPE:
|
alignTool
|
Alignment tool to use ('nucmer' or 'blastn'), by default 'nucmer'.
TYPE:
|
verbose
|
Whether to print verbose output, by default True.
TYPE:
|
| YIELDS | DESCRIPTION |
|---|---|
SeqRecord
|
Segments of the sequence based on the reporting mode: - 'split': LTRs and internal regions separately - 'external': Only LTRs - 'internal': Only internal regions - 'all': Original sequences plus all segments |
options: members: - init