Off topic:Python Script to Calculate Total Number of genes
1
0
Entering edit mode
5.6 years ago
anasjamshed ▴ 120

This is the HCV Genome Sequence:

>GU294484.1 Hepatitis C virus isolate PK-1, complete genome
ACCTGCCTCTTACGAGGCGACACTCCACCATGGATCACTCCCCTGTGAGGAACTACTGTCTTCACGCAGAAAGCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATAAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGCAACATGAGCACACTTCCTAAACCTCAAAGAAAAACCAAAAGAAAACCCATCCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGACAGATCGTTGGTGGAGTATACGTGTTGCCGCGCAGGGGCCCACGATTGGGTGTGCGCGCGACGCGTAAGGCTTCTGAACGGTCACAGCCTCGCGAACGACGACAGCCTATCCCCAAGGCGCGTCGGAGCGAAGGCCGGTCCTGGGCTCAGCCTGGGTACCCTTGGCCCCTCTATGGTAATGAGGGCTGCGGGTGGGCAGGGTGGCTCCTGTCCCCCCGCGGCTCCCGTCCATCTTGGGGCCCAAACGACCCCCGGCGAAGATCCCGCAACTTGGGTAAAGTCATCGATACCCTTACGTGCGGATTCGCCGACCTCATGGGGTACATCCCGCTCGTCGGCGCTCCCGTAGGAGGCGTCGCAAGAGCCCTCGCGCATGGCGTGAGGGCCCTTGAAGACGGGATAAATTTTGCGACAGGGAACTTGCCCGGTTGCTCCTTTTCTATCTTCCTTCTTGCTCTACTCTCTTGCTTAATTCATCCAGCAGCCAGTCTAGAGTGGCGGAATACGTCTGGTCTCTATGTCCTTACCAACGCCCGTTCCAACAGCAGTATAGTGTACGAGGCCGACGACGTTATCCTGCACACACCCGGCTGTATACCTTGTGTTCAGACCGGCAACACATCCAAGTGCTGGACCCCAATGACACCCACGGTGGCAGTTAAGTATGTCGGAGCAACCACCGCTTCGATACGCGGTCATGTGGACCTGTTAGTGGGCGCAGCCACGATGTGTTCTGCGCTCTACGTGGGTGATGTGTGCGGAGCCGTCTTCCTCGTGGGGCAAGCCTTCACGTTCAGGCCGCGACGCCATCAAACGGTCCAGACCTGCAACTGCTCGCTGTACCCAGGCCATCTCACAGGACATCGAATGGCTTGGGATATGATGATGAACTGGTCCCCTGCTGTTGGCATGGTGGTGGCGCACATCTTACGCCTACCCCAGACCCTGTTTGATATAATAGCCGGGGCCCATTGGGGCGTCTTGGCGGGTCTAGCCTACTATACCATGCAGGGCAACTGGGCCAAGGTCGCAATCATCATGGTTATGTTCTCAGGGGTCGATGCCGTTACGTACATCACTGGGGGCACTGCAGCTCGTGGGGGCCAAGGGCTGGCTAGCCTAATCGTCCGGGGGCCTGAGCAGCGCCTGGAGCTGATCAACACCCATGGCTCGTGGCACATCAACAGTACTGTCCTCCACTGCAATGAGTCCATAAACACAGGGTTTATAGCTGGGTTGTTTTATTATCATAAGTTCAACTTACTGGATGTCCCGAAGGCTCAGCAGCTGCAAGCCCATCACTTTCTTCAGGCAGGGGTGGGGCCCCTTGACAGATGCCAACATCCACCGGCCCTTCTGATGACAACCGTACTGCTGGCATACGCACCTAGACCTTGTGACAGCGTAAAGCAGCACGTGTCTCCGGTCCTGTGTATGCTTCCACACCATCGCCCAGTGGTGGTAGGCACTACTGATCCTAAGGGCGCTCCCACCTATAACTGGGGCGAGAATGAGACAGACGTGTTCCTGCTGAATCCCTGCGGCCTCCTAGTGGTCGGTGGTTTGGGTGGCACGTGGGAGGAACTCCACCGGGGTTTGTCAAGACGTGCGGAGGTTCCCCCTTGTGACATCTATGGGGGTGGGGGGGAGATCCACCAATGGTTCAGACCTCTTCTGCCCCACCGACTGCTTCAGGAAACATCCCGAGGCCACATACAGCCGGTGCGGCTCGGGGCCCTGGTTGACACCTCGATGCATGGTCGACTATCCATACCGGCTTTGGCATTACCCATGTACAGTCAATTTTACACTGTTCAAGGTGAGGATGTTTGTGGGTGGGTTTGGCATCGGTTTACCGCCGCTTGCAACTGGACTAGGGGGGAGCGCTGCGATATCGAGGATCGTGACCGCAGCGAGCAACATCCCCTGCTGCATTCAACAACTGAGCTTGCCATACTGCCTTGCTCTTTCACGCCCATGCCCGCATTGTCAACAGGGTTAATACACCTCCACCAAAACATCGTGGATGTCCAATACCTTTATGGCGTTGGATCTGGCATGGTGGGATGGGCGTTGAAATGGGAATTTGTCATCCTCGTTTTCCTCCTCCTAGCAGATGCACGCGTGTGCGTTGCCCTTTGGCTGATGCTGATGATATCACAAGCAGAAGCAGCCTGGAGAACTTGTCACGCTGAACGCCGTCTCTGCTGCCGGGACACATGGTATCGGCTGGTACCTGGTAGCATTTTGCGCGGCGTGGTACGTGCGGGGAAACTCGTCCCGCTGGTGACCTACAGCCTGACGGGTCTTTGGTCCCTAGCATTGCTCGTCCTTCTACTCCCCCAGCGGGCGTATGCTTGGTCGGGTGAAGACAGTGCCACCCTCGGCGCTGGGATCTTGGTCCTCTTCGGCTTCTTTACCCTGTCACCTTGGTATAAGCACTGGATCAGCCGCCTCATGTGGTGGAACCAGTACGCCATATGTAGGTGTGAGTCTGCTCTCCAAGTATGGGTCCCCCCCCTACTTGCCCGCGGGAGTAGGGACGGTGTTATCCTGCTAACAAGCCTGCTTTATCCATCATTAGTTTTTGACATCGCTAAGCTGCTGATAGCCGTAATAGGCCCATTATATCTAATACAGGCCGCCATCACTACTACCCCCTACTTTGTGCGTGCGCATGTTCTGGTCCGCCTTTGCATGTTCGTGCGCTCCGTGACGGGGGGAAAGTACTTCCAGATGGCCATACTGAGCGTCGGCAGATGGTTTAACACCTACCTATATGACCACCTTGCACCGATGCAACACTGGGCCGCAGCAGGCCTCAAAGACCTGGCAGTAGCCACTGAACCTGTAATATTCAGTCCCATGGAAATCAAGGTCATCACTTGGGGCGCGGACACGGCAGCTTGCGGAGATATCCTATGCGGGCTGCCCGTCTCTGCACGATTAGGCCGTGAGGTGTTGTTGGGACCTGCTGATGACTATCGGGAGATGGGCTGGCGTCTGTTGGCCCCGATTACAGCATACGCCCAGCAAACTAGGCGTCTTTTTGGGACTATTGTGACCAGCTTGACTGGCAGGGACAAGAACGTGGTGGCCGGCGAAGTGCAGGTGCTTTCTACGGCTACCCAGACCTTCCTAGGTACAACATTGGGAGGGGTTATGTGGACTGTTTACCATGGAGCAGGTTCGAGAACACTTGCGGGCGTCAAACATCCTGCGCTCCAAATGTACACAAATGTAGATCAGGACCTCGTTGGATGGCCAGCTCCTCCGGGGGCTAAGTCTCTTGAACCGTGCACCTGCGGGTCTGCGGACTTGTACTTGGTTACCCGCGAAGCTGATGTCATCCCTGCTAGACGCAGGGGGGACTCCACAGCGAGCTTGCTCAGTCCTAGGCCTCTCGCCTGTCTCAAAGGTTCCTCTGGAGGTCCTGTTATGTGCCCTTCGGGCCACGTAGCGGGGATCTTTAGGGCTGCTGTGTGCACCAGAGGTGTAGCAAAAGCCCTACAGTTCATACCAGTGGAAACCCTTAGCACACAGGCTAGGTCTCCATCCTTTTCTGACAATTCAACTCCTCCTGCTGTTCCACAGAGCTATCAAGTAGGGTACCTTCATGCCCCGACCGGCAGCGGTAAGAGCACAAAGGTCCCGGCCGCTTATGTAGCACAAGGATATAATGTTCTCGTGTTGAATCCATCAGTGGCGGCCACACTAGGCTTCGGCTCTTTCATGTCGCGAGCTTATGGGATCGACCCCAACATCCGCACCGGGAACGGCACGGTTACAACTGGTGCTAATCTGACCTATTCCACCTATGGTAAGTTTCTCGCGGACGGGGGTTGCTCGGGGGGAGCATATGATGTGATTATCTGTGATGAGTGTCATGCCCAAGACTCTACTAGCATACTGGGTATAGGCACGGTCCTAGATCAGGCTGAAACGGCTGGGGTGAGGCTGACGGTTTTAGCAACAGCAACTCCCCCAGGCAGCATCATTGGGCCCCATTCTAACCTCAAAGAAGTGGCCCTTGGTTCTGAGGGGGAGATCCCTTTCTTCGGCAAGGCCATACCGCTAGCCCTGCTAAGGGGGAAAGGCACCTTATTTTTTTCCATTCCAAGAAAAAATGTGATGAGATGGCATCCAAACTCAGAGGCATGGGGCTCAACGCTGAAGGAGTACTACAGGGGTCTTGATGTGTCCGTCATACCAACATCAGGAGACGTTGTAGTTTGCGCTACTGACGCCCTCATGACTGGATTCACCGGAGACTTCGACTCTGTCATAGATTGCAACGTGGCTGTTGAACAGTACGTTGATTTCAGCTTGGACCCCACCTTTTCCATTGAGACTCGCACTGCTCCCCAAGACGCGGTTTCCCGCAGTCAACGTCGTGGCCGTACGGGCCGAGGTAGACTCGGCACGTACCGATATGTCACCCCCGGTGAAAGACCGTCTGGGATGTTTGACTCGGCTGTTCTCTGTGAGTGCTATGACGCGGGCTGCTCGTGGTACGACTTGCAGCCCGCCGAGACCACAGTCAGACTAAGAGCTTACTTGTCCACGCCGGGGTTACCTGTCTGCCAAGACCACTTGGAATTTTGGGAGAGCGTCTTCACTGGACTAACTCACATAGATGCCCACTTTCTATCACAGACCAAGCAGCAGGGACTCAACTTCCCATACCTAGCTGCCTACCAAGCCACTGTGTGCGCTCGCGCGCAAGCTCCTCCCCCAAGTTGGGACGAGACATGGAAGTGTCTCGTGCGGCTTAAGCCAACACTACATGGACCTACACCCCTTCGATATCGGCCGGGGCCTGTCCAAAATGAAACCTGCTTGACACACCCCATCACAAAATACCTCATGGCATGCATGTCAGCCGATCTGGAAGTAACCACCAGCACCTGGAGCACCTGGGTGTTGCTCGGAGGGGTCCTCGCGGCCCTGGCAGCCTACTGCTTGTCGGTCGGCTGCGTAGTCATTGTGGGCCACATTGAGCTGGGGGGCAAGCCGGCGCTCGTTCCTGACAAAGAAGTGTTGTATCAACAATACGATGAGATGGAGGAGTGCTCACAAGCTGCCCCATATATCGAACAAGCTCAAGTAATAGCCCACCAGTTCAAGGAAAAAGTCCTTGGATTGCTACAGCGAGCTACCCAACAACAAGCTGTCATTGAGCCCATAGTAGTTACCAACTGGCAAAAGCTTGAGGCCTTCTGGCACAAGCACATGTGGAACTTTGTGAGTGGGATTCAGTACCTAGCAGGTCTCTCCACTTTGCCCGGCAACCCCGCTGTGGCGTCTCTTATGGCGTTCGCTGCTTCAGTCACCAGTCCCCTGACGACCAATCAAACTATGTTTTTTAACATACTCGGGGGATGGGTTGCTACTCATTTGGCAGGGCCCCAGAGCTCTTCCGCATTCGTGGTAAGCGGCTTGGCCGGCGCTGCCATAGGGGGCATAGGCCTGGGCAGGGTCTTACTTGACATCCTGGCAGGATACGGAGCTGGTGTCTCAGGCGCCTTGGTGGCTTTCAAAATCATGGGGGGGGAACTCCCCAATGCCGAGGACGTGGTCAATCTGTTGCCCGCCATACTATCTCCGGGTGCTCTCGTCGTCGGGGTGATATGCGCTGCCCTACTACGTCGGCACGTGGGACCTGGGGAGGGAGCGGTACAGTGGATGAACAGGCTCATCGCGTTCGCATCCCGGGGCAACCACGTCTCACCGACGCACTATGTTCCCGAGAGCGATGCTGCGGCAAGGGTCACCGCATTGCTGAGTTCTCTAACTGTCACAAGTCTGCTCCGGCGGTTACACCAGTGGATCAATGAAGACTACCCAAGCCCTTGTAGCGACGATTGGCTACGTACCATCTGGGACTGGGTCTGCATGGTGTTGCTCGACTTCAAGACATGGCTGTCTGCTAAGATCATGCCATTGCTCCCTGGGTTGCCCTTCATTTCCTGTCAAAAGGGATATAAGGGCGTTTGGCAGGGGGACGGCGTGGTGTCCACTCGCTGTCCTTGCGGAGCAGTGATAACCGGTCATGTGAAGAACGGGTCCATGCGGCTTGCAGGACCACGTACATGTGCTAACATGTGGCACGGCACCTTCCCCATCAACGAGTACACCACCGGACCCAGCACACCTTGCCCATCACCCAACTACACTCGTGCACTGTGGCGCGTGGCTGCCAACAGCTACGTCGAAGTGCGACGGGTGGGAGACTTCCACTACATCACGGGGGCCACAGAAGATGAGCTCAAGTGTCCGTGCCAAGTGCCGGCTGCTGAGTTCTTTACTGAAGTGGATGGGGTGAGACTTCACCGTTACGCCCCTCCATGCAGGCCCCTGTTGAGGGATGAGATCACTTTCGTAGTAGGGCTGAATTCTTACGCGATAGGATCCCAACTCCCTTGTGAGCCCGAACCGGACGTCTCTGTGCTGACCTCGATGTTGAGAGACCCTTCCCATATCACCGCCGAGACGGCAGCGCGCCGCCTTGCACGCGGGTCCCCTCCATCAGAGGCAAGCTCATCCGCCAGTCAACTATCGGCTCCATCGTTGAAGGCCACTTGCCAAACGCATAGGCCTCATCCCGACGCGAGCTGGTGGACGCCAACTTGTGTTTGGCGACAAGAGATGGGCAGCAACATCACACGGGTAGAGTCCGAAACAAAGGTTGTGATTCTTGACTCATTCGAACCTCTGAGGGCCGAGACTGATGACACCGAGCTCTCGGTAGCAGCAGAGTGTTTCAAGAAACCTCCCAAGTATCCTCCAGCCCTCCCTATCTGGGCTAGGCCAGACTACAACCCTCCACTGTTGGATCGTTGGAAATCACCGGATTATGAACCACCAATTGTTCATGGGTGCGCCTTACCACCACAGGGTACTCCACCGGTGCCTCCCCCTCGGAGGAAAAGAACAATCCAGCTGGACGGCTCCAATGTGTCCGCGGCGCTAGCTGCGCTAGCGGAAAAATCATTCCCGGCCTCAAAACCGTTGGAAGCGGGTAGCTCATCCTCAGGGGTCGATACACAGTCCAGCACTACTTCCAAGGTGCCTCCCTCTTCGGAGAGAGAGTCCGACACAGAATCGTGCTCGTCCATGCCTCCTCTCGAGGGGGAGCCGGGCGATCCAGACTTGAGTTGCGACTCTTGGTCCACTGTTAGTGACAGCGAGGAGCAGAGCGTGGTCTGCTGCTCTATGTCGTATTCTTGGACCGACGCCCTGATAACACCATGTAGTGCTGAGGGAGAGAACTGCCCATCAGCCCACTCAGCAATCTTGGAGAGACATCACAACCTAATCTATTCAACGTCGTCTAGAATCGCTTCTCAACGTCAGAAGAAGGTCACCTTCGACAGGCTGCAGGTGCTCGACGACCATTACAAAACTGCATTAAAGGAGATAAAGGAGCGAGCGTCAAGGGTAAAGGCTCGCATGCTCACCATCGAGGAAGCGTGCGCGCTCGTCCCTCCTCACTCTGCTCGGTCAAAGTTCGGGTATAGTGCGAAGGACGCTCGCTCCCTGTCCAGCAAGGCCATTAACCAGATCCGCTCCGTCTGGGAGGACTTGCTGGAAGACACCACAACTCCAATTCCAACCACCATCATGGCGAAGAGCGAGGTTTTTTGTGTGGATCCTACTAAAGGAGGCCGTTTTTTGCTCGTCTCATTGTCTACCCTGACCTGGGGGTGCGCATCTGTGAGAACGTGCCCTATATGGCGTGATACAGAAGTGGGGAGTGGGGACGATGGGTCCTGCCTATGGATTCCAATACTCGCCTCAACAGCGGGTCGAACGTCTGCTGAAGATGTGGACCTCAAAGAAAGCCCAGTTGGGGTTCTCGTATGGTACCCGCTGCTTTGGCTCGACTGCCACTGGACAGGACATCAGGGTGGAAGAGGAGATATACCAATGCTGGAGCCTTGGACCGGAGGCCAGGAAAGTGATCTCCTCCCTCACGGAGCGGCTTTACTGCGGAGGCCCTATGTTCAACAGCAAGGGGGCCCAGTGTGGTTATCGCCGTTGCCGTGCCAGTGGAGTTCTGCCTACCAGCTTCGGCAACACGATCACTTGTTACATCAAGGCCACAGCGGCTGCAAAGGCCGCAAACCTCCGGAAGCCTGGCTTCTTGTTTGGGGAGGATGGATCCTGGTCGTATTACCTGAGGACCGAATGGGGTTCGAATGGAGATACGGGCAGTCCTGGGAGAGCCTTCACCGGAGGCTATGGACCAGGTATTTCTTGCTTCCACCCGGAGATGGCCCCCACAGGCCAACCCTACGACCTTTGGGCTCATTACATCTGGCTCCTCCAACGTCTCCGTGGCGCGGGACGATACGGGGAAGAGGTATTATTACCTCACTCGTGGTGCCACCACCCCCCTGGCCCGTGCTGCTTGGGAGACAGCTCGTCACACTCCAGTTACCTCCTGGCTGGGGAACATCATCATGTACGCGCCTACTATTTGGGTGCGCATGGTGGTGGTGGCACACTTTTTCTCCATACTCCAATCCCAGGAGATACTTGGTCGCCCCCTTGGCTTTGGAATGTACGGGGCCACTTACTCTGTCACTCCGCTGGATTTACCAGCAATCATTGGAAGACTCCATGGTCTACGCGCGTTTACGCTCCATATTTACTCTCCAGCAGAGCTCAATACGGTCGCGGGGACACTCAGGAAGCTTGTGCCCCCCCCTACGAGCTTGGAGACATCGGGCACGAGCAGTGCGCGCTATGCTTATCGCCCAGGGAGGGAAGGCCAGGATTTGTGGGCTTTATCACTTCAATTGGGCGGTACGCACCAAGACCACCCTCACTCCACTGCCAGCCGCTGGCCAGTTGGATTTATCCATCTGGTTTACGGTTGGTGTCGGCGGGAACGACATTCTCGCAGCGTGTCACGCGCCCGAACCCGCCATTTGCTGCTTTGCCTACTCCCTACTAACAGTAGGGGTAGGCATCTTTCTCTTGCCAGCTCGATGAGCTGGTAAGATAACACTCCATTTCTTTTTTGTTTTTTTTTTTTTTTTTTTT

I want to write Python script to Calculate Total Genes present in this 9474 bp sequence. The Start Codon is (ATG) and I want to calculate the genes from all 3 reading frames

Hepatatis-C python • 3.5k views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 2423 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6