The fetch_nucleotide_chains module uses the yielded chaining results (tab-delimited table) to extract nucleotide chains in FASTA-format from the genomic sequence(s).
The fetch_nucleotide_chains module requires the FASTA-file(s) to extract nucleotide chained sequences using the tab-delimited ouptut table of one of the chaining modules, for example, from SegMantX’s chain_self_alignments or chain_alignments module:
Click here to visit an example table
SegMantX fetch_nucleotide_chains --input_file tests/NZ_AP022172.1.chains.tsv --fasta_file_query tests/NZ_AP022172.1.fasta --output_file tests/NZ_AP022172.1.chains.fasta
-i or --input_file: Output file from chaining results as input.-fq or --fasta_file_query: Fasta file to extract from chains as nucleotide sequences.-o, --output_file: Output file: Fasta file containing nucleotide chains.SegMantX fetch_nucleotide_chains --input_file tests/NZ_CP018634.1_vs_NZ_CP022004.1.chains.tsv --fasta_file_query tests/NZ_CP018634.1.fasta --fasta_file_subject tests/NZ_CP022004.1.fasta --output_file tests/NZ_CP018634.1_vs_NZ_CP022004.1.chains.fasta
-i or --input_file: Output file from chaining results as input.-fq or --fasta_file_query: Fasta file to extract from chains as nucleotide sequences.-fs or --fasta_file_subject: Fasta file to extract from chains as nucleotide sequences.-o, --output_file: Output file: Fasta file containing nucleotide chains.Output of fetching nucleotide chains module:
| (Default) output filename | Description |
|---|---|
| chains.fasta | A fasta file containing the nucleotide sequences of yielded chains. |
For example:
>1_query NZ_AP022172.1:19940-79291 (+)
ATGCCGCATCAGCGGCATGGAAGGCGGCACTCTGTTGTTTCATATGATACAGGAGTAAAA
CCGCCGAAGCCCGGCGTAAGCCGGTACTGATTGATAGATTTCACCTTACCCATCCCCAGC
CCTGCCAGACCATACCCGCTTTCAGCCATGAGAGAGCTTCTGTGCGCGGTCGGAGTGGTC
CCGACGAGGGTTTACCCGAAGTCGGGGCGTATCTCCGCGTTAGCGGGCCGTGAGGGCCGC
TTACGAGCGTGTACTGAGAACTTCCAGCGAGAAGACTGACAGCGATGAAGATGTAGTTAC
...
>1_subject NZ_AP022172.1:89352-147930 (+)
ATGCCGCGCCAGCGGCATGGAAGGCGGCACGCTGTGGTTACATGTGATACCGGAGTAAAA
CCGCCGAAGCCCGGCGTCAGCCGGTACTGATTGATAGATTTCACCTTACCCATCCCTTGC
CCTGCCAGACCATACCTGTTTTCAGCCATGAGAGAGCTTCTGTGCGCGGTCGGAGTGGTC
CCGACGAGGGTTTACCCGAAGTCGGGGCGTATCTCCGCGTTAGCGGGCCGTCAGGGCCGC
TTACGAGCGTGTACTGAGGACTTCCAGCCAGAACACTGACAGCGATGACGGAGTATAGTT
...