Rhiza Labs FluTracker Forum

The place to discuss the flu
It is currently Wed Sep 20, 2017 1:37 am

All times are UTC - 5 hours [ DST ]




Post new topic Reply to topic  [ 173 posts ]  Go to page 1, 2, 3, 4, 5 ... 18  Next
Author Message
PostPosted: Tue Jul 29, 2014 9:57 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
Viral Hemorrhagic Fever Consortium has released 84 new "deep" Ebola sequences from June 2 - June 12 collections from Sierra Leone cases.

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 9:58 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
LOCUS KM233035 18912 bp cRNA linear VRL 25-JUL-2014
DEFINITION Zaire ebolavirus strain EBOV_1, partial genome.
ACCESSION KM233035
VERSION KM233035.1 GI:667852489
KEYWORDS .
SOURCE Zaire ebolavirus (ZEBOV)
ORGANISM Zaire ebolavirus
Viruses; ssRNA negative-strand viruses; Mononegavirales;
Filoviridae; Ebolavirus.
REFERENCE 1 (bases 1 to 18912)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Deep sequencing analysis of Ebola virus transmission in Sierra
Leone
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 18912)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Direct Submission
JOURNAL Submitted (25-JUL-2014) Infectious Disease Initiative, Broad
Institute of MIT and Harvard, 75 Ames St., Cambridge, MA 02142, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Novoalign v. v.3
Sequencing Technology :: Illumina; Nextera
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..18912
/organism="Zaire ebolavirus"
/mol_type="viral cRNA"
/strain="EBOV_1"
/host="Homo sapiens"
/db_xref="taxon:186538"
/country="Sierra Leone"
/collection_date="02-Jun-2014"
gene 12..2982
/gene="NP"
mRNA 12..2982
/gene="NP"
/product="nucleoprotein"
misc_signal 12..23
/gene="NP"
/note="putative transcription start signal"
CDS 426..2645
/gene="NP"
/note="encapsidation of genomic RNA"
/codon_start=1
/product="nucleoprotein"
/protein_id="AIG95884.1"
/db_xref="GI:667852490"
/translation="MDSRPQKVWMTPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
GFRFEVKKCDGVKRLEELLPAVSSGRNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
DTTIPDVVVDPDDGGYGEYQSYSENGMSAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
KNSQKGQHTEGRQTQSTPTQNVTGPRRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
QQDQDHIQEARNQDSDNTQPEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
GKEYTYPDSLEEEYPPWLTEKEAMNDENRFVTLDGQQFYWPVMNHRNKFMAILQHHQ"
polyA_signal 2971..2982
/gene="NP"
gene 2988..4363
/gene="VP35"
mRNA 2988..4363
/gene="VP35"
/product="VP35 matrix protein"
misc_signal 2988..2999
/gene="VP35"
/note="putative transcription start signal"
CDS 3085..4107
/gene="VP35"
/note="polymerase complex protein"
/codon_start=1
/product="VP35 matrix protein"
/protein_id="AIG95885.1"
/db_xref="GI:667852491"
/translation="MTTRTKGRGHTVATTQNDRMPGPELSGWISEQLMTGRIPVNDIF
CDIENNPGLCYASQMQQTKPNPKMRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLDSTTSLTEENFGKPD
ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
TLGLKI"
gene 4346..5850
/gene="VP40"
mRNA 4346..5850
/gene="VP40"
/product="matrix protein"
misc_signal 4346..4357
/gene="VP35"
/note="transcription start signal"
polyA_signal 4353..4363
/gene="VP35"
CDS 4435..5415
/gene="VP40"
/codon_start=1
/product="matrix protein"
/protein_id="AIG95886.1"
/db_xref="GI:667852492"
/translation="MRRVILPTAPPEYMEAIYPARSNSTIARGGNSNTGFLTPESVNG
DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVVEK"
polyA_signal 5839..5850
/gene="VP40"
gene 5856..8261
/gene="GP"
mRNA 5856..8261
/gene="GP"
/product="ssGP"
/note="unedited mRNA"
misc_signal 5856..5867
/gene="GP"
/note="putative transcription start signal"
CDS join(5995..6879,6879..8024)
/gene="GP"
/ribosomal_slippage
/note="additional a residue inserted during transcription;
encodes two disulfide linked subunits GP1 and GP2;
receptor binding and fusion"
/codon_start=1
/product="virion spike glycoprotein precursor"
/protein_id="AIG95887.1"
/db_xref="GI:667852493"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNGPKNISGQSPARTSSDPETNT
TNEDHKIMASENSSAMVQVHSQGRKAAVSHLTTLATISTSPQPPTTKTGPDNSTHNTP
VYKLDISEATQVGQHHRRADNDSTASDTPPATTAAGPLKAENTNTSKSADSLDLATTT
SPQNYSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREVIVNAQ
PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQLANETT
QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
CDS 5995..7089
/gene="GP"
/note="small non-structural secreted glycoprotein; sGP
secreted as an anti-parallel oriented homodimer"
/codon_start=1
/product="sGP"
/protein_id="AIG95888.1"
/db_xref="GI:667852494"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTDPKTSVVRVRRELLPTQRPTQ
QMKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
CDS join(5995..6879,6881..6889)
/gene="GP"
/ribosomal_slippage
/note="second non-structural secreted glycoprotein;
secreted in a monomeric form; one a residue is deleted or
two additional a residues are inserted at the editing site
during transcription of the GP gene"
/codon_start=1
/product="ssGP"
/protein_id="AIG95889.1"
/db_xref="GI:667852495"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKPH"
gene 8244..9696
/gene="VP30"
mRNA 8244..9696
/gene="VP30"
/product="VP30 minor nucleoprotein"
misc_signal 8244..8255
/gene="VP30"
/note="putative transcription start signal"
polyA_signal 8251..8261
/gene="VP30"
CDS 8465..9331
/gene="VP30"
/note="minor nucleoprotein; polymerase complex protein"
/codon_start=1
/product="VP30 minor nucleoprotein"
/protein_id="AIG95890.1"
/db_xref="GI:667852496"
/translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
PGTCSWSDEGTP"
polyA_signal 9686..9696
/gene="VP30"
/note="putative"
gene 9841..11474
/gene="VP24"
/note="putative"
mRNA 9841..11452
/gene="VP24"
/product="VP24 membrane-associated protein"
misc_signal 9841..9852
/gene="VP24"
/note="transcription start signal"
CDS 10301..11056
/gene="VP24"
/note="membrane-associated protein"
/codon_start=1
/product="VP24 membrane-associated protein"
/protein_id="AIG95891.1"
/db_xref="GI:667852497"
/translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRKKPGPAK
FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
polyA_signal 11441..11452
/gene="VP24"
/note="putative"
gene 11457..18238
/gene="L"
mRNA 11457..18238
/gene="L"
/product="polymerase"
misc_signal 11457..11468
/gene="VP24"
/note="transcription start signal"
polyA_signal 11464..11474
/gene="VP24"
/note="putative"
CDS 11537..18175
/gene="L"
/note="polymerase; synthesis of viral RNAs;
transcriptional RNA editing"
/codon_start=1
/product="polymerase"
/protein_id="AIG95892.1"
/db_xref="GI:667852498"
/translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPILLKALSGNGFCPVEPRCQQFLDE
IIKYTMQDALFLKYYLKNVGAQEDCVDDHFQEKILSSIQGNEFLHQMFFWYDLAILTR
RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISLLPLNTQGIPHAAMDWYQTSVF
KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEVEDPVCSDYPNFKIVSML
YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
TEIRALKPSQAHKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
PHNLTLENRNNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
GDNQCITVLSVFPLETDAGEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVVWLKP
YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
LIYDNNPLKGGLNCNISFDNPFFQGKQLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLHQIVELLVHDSSRHQAF
KTTINDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKDLTRNSSTGSSTNNS
DGHIKRSQEQTTRDPHDGTERSLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
ANPKLNFDRSRHNVKSQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIENFKSAVTLAEGEGAGAL
LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
ITDITNPTWFKDQRARLPRQVEVITMDAETTENINRSKLYEAVHKLILHHVDPSVLKA
VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCDLHLSYIRLGFPSLEKVLYHRYNL
VDSKRGPLVSVTQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
VKLIERLTGLLSLFPDGLYRFD"
polyA_signal 18228..18238
/gene="L"

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 9:59 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
1 cgaataacta tgaggaagat taataatttt cctctcattg aaatttatat cggaatttaa
61 attgaaattg ttactgtaat catacctggt ttgtttcaga gccatatcac caagatagag
121 aacaacctag gtctccggag ggggcaaggg catcagtgtg ctcagttgaa aatcccttgt
181 caacatctag gccttatcac atcacaagtt ccgccttaaa ctctgcaggg tgatccaaca
241 accttaatag caacattatt gttaaaggac agcattagtt cacagtcaaa caagcaagat
301 tgagaattaa ctttgatttt gaacctgaac acccagagga ctggagactc aacaacccta
361 aagcctgggg taaaacatta gaaatagttt aaagacaaat tgctcggaat cacaaaattc
421 cgagtatgga ttctcgtcct cagaaagtct ggatgacgcc gagtctcact gaatctgaca
481 tggattacca caagatcttg acagcaggtc tgtccgttca acaggggatt gttcggcaaa
541 gagtcatccc agtgtatcaa gtaaacaatc ttgaggaaat ttgccaactt atcatacagg
601 cctttgaagc tggtgttgat tttcaagaga gtgcggacag tttccttctc atgctttgtc
661 ttcatcatgc gtaccaagga gattacaaac ttttcttgga aagtggcgca gtcaagtatt
721 tggaagggca cgggttccgt tttgaagtca agaagtgtga tggagtgaag cgccttgagg
781 aattgctgcc agcagtatct agtgggagaa acattaagag aacacttgct gccatgccgg
841 aagaggagac gactgaagct aatgccggtc agttcctctc ctttgcaagt ctattccttc
901 cgaaattggt agtaggagaa aaggcttgcc ttgagaaggt tcaaaggcaa attcaagtac
961 atgcagagca aggactgata caatatccaa cagcttggca atcagtagga cacatgatgg
1021 tgattttccg tttgatgcga acaaattttt tgatcaaatt tcttctaata caccaaggga
1081 tgcacatggt tgccggacat gatgccaacg atgctgtgat ttcaaattca gtggctcaag
1141 ctcgtttttc aggtctattg attgtcaaaa cagtacttga tcatatccta caaaagacag
1201 aacgaggagt tcgtctccat cctcttgcaa ggaccgccaa ggtaaaaaat gaggtgaact
1261 ccttcaaggc tgcactcagc tccctggcca agcatggaga gtatgctcct ttcgcccgac
1321 ttttgaacct ttctggagta aataatcttg agcatggtct tttccctcaa ctgtcggcaa
1381 ttgcactcgg agtcgccaca gcccacggga gcaccctcgc aggagtaaat gttggagaac
1441 agtatcaaca gctcagagag gcagccactg aggctgagaa gcaactccaa caatatgcgg
1501 agtctcgtga acttgaccat cttggacttg atgatcagga aaagaaaatt cttatgaact
1561 tccatcagaa aaagaacgaa atcagcttcc agcaaacaaa cgcgatggta actctaagaa
1621 aagagcgcct ggccaagctg acagaagcta tcactgctgc atcactgccc aaaacaagtg
1681 gacattacga tgatgatgac gacattccct ttccaggacc catcaatgat gacgacaatc
1741 ctggccatca agatgatgat ccgactgact cacaggatac gaccattccc gatgtggtag
1801 ttgaccccga tgatggaggc tacggcgaat accaaagtta ctcggaaaac ggcatgagtg
1861 caccagatga cttggtccta ttcgatctag acgaggacga cgaggacacc aagccagtgc
1921 ctaacagatc gaccaagggt ggacaacaga aaaacagtca aaagggccag catacagagg
1981 gcagacagac acaatccacg ccaactcaaa acgtcacagg ccctcgcaga acaatccacc
2041 atgccagtgc tccactcacg gacaatgaca gaagaaacga accctccggc tcaaccagcc
2101 ctcgcatgct gaccccaatc aacgaagagg cagacccact ggacgatgcc gacgacgaga
2161 cgtctagcct tccgccctta gagtcagatg atgaagaaca ggacagggac ggaacttcta
2221 accgcacacc cactgtcgcc ccaccggctc ccgtatacag agatcactcc gaaaagaaag
2281 aactcccgca agatgaacaa caagatcagg accacattca agaggccagg aaccaagaca
2341 gtgacaacac ccagccagaa cattcttttg aggagatgta tcgccacatt ctaagatcac
2401 aggggccatt tgatgccgtt ttgtattatc atatgatgaa ggatgagcct gtagttttca
2461 gtaccagtga tggtaaagag tacacgtatc cggactccct tgaagaggaa tatccaccat
2521 ggctcactga aaaagaggcc atgaatgatg agaatagatt tgttacactg gatggtcaac
2581 aattttattg gccagtaatg aatcacagga ataaattcat ggcaatcctg caacatcatc
2641 agtgaatgag catgtaataa tgggatgatt taatcgacaa atagctaaca ttaaatagtc
2701 aaggaacgca aacaggaaga atttttgatg tctaaggtgt gaattattat cacaataaaa
2761 gtgattctta gttttgaatt taaagctagc ttattattac tagccgtttt tcaaagttca
2821 atttgagtct taatgcaaat aagcgttaag ccacagttat agccataatg gtaactcaat
2881 atcttagcca gcgatttatc taaattaaat tacattatgc ttttataact tacctactag
2941 cctgcccaac atttacacga tcgttttata attaagaaaa aactaatgat gaagattaaa
3001 accttcatca tccttacgtc aattgaattc tctagcacta gaagcttatt gtcttcaatg
3061 taaaagaaaa gctggcctaa caagatgaca actagaacaa agggcagggg ccatactgtg
3121 gccacgactc aaaacgacag aatgccaggc cctgagcttt cgggctggat ctctgagcag
3181 ctaatgaccg gaaggattcc tgtaaacgac atcttctgtg atattgagaa caatccagga
3241 ttatgctacg catcccaaat gcaacaaacg aagccaaacc cgaagatgcg caacagtcaa
3301 acccaaacgg acccaatttg caatcatagt tttgaggagg tagtacaaac attggcttca
3361 ttggctactg ttgtgcaaca acaaaccatc gcatcagaat cattagaaca acgcattacg
3421 agtcttgaga atggtctaaa gccagtttat gatatggcaa aaacaatctc ctcattgaac
3481 agggtttgtg ctgagatggt tgcaaaatat gatcttctgg tgatgacaac cggtcgggca
3541 acagcaaccg ctgcggcaac tgaggcttat tgggctgaac atggtcaacc accacctgga
3601 ccatcacttt atgaagaaag tgcgattcgg ggtaagattg aatctagaga tgagactgtc
3661 cctcaaagtg ttagggaggc attcaacaat ctagacagta ccacttcact aactgaggaa
3721 aattttggga aacctgacat ttcggcaaag gatttgagaa acattatgta tgatcacttg
3781 cctggttttg gaactgcttt ccaccaatta gtacaagtga tttgtaaatt gggaaaagat
3841 agcaattcat tggacattat tcatgctgag ttccaggcca gcctggctga aggagactcc
3901 cctcaatgtg ccctaattca aattacaaaa agagttccaa tcttccaaga tgctgctcca
3961 cctgtcatcc acatccgctc tcgaggtgac attccccgag cttgccagaa gagcttgcgt
4021 ccagtcccac catcacccaa gattgatcga ggttgggtat gtgtttttca gcttcaagat
4081 ggtaaaacac ttggactcaa aatttgagcc aatctctttt ccctccgaaa gaggcaacta
4141 atagcagagg cttcaactgc tgaactatag ggtatgttac attaatgata cacttgtgag
4201 tatcagccct agataatata agtcaattaa acaaccaaga taaaattgtt catatcccgc
4261 tagcagcttt aaagataaat gtaataggag ctatacctct gacagtatta taattaattg
4321 ttattaagta acccaaacca aaaatgatga agattaagaa aaacctacct cgactgagag
4381 agtgtttttt cattaacctt catcttgtaa acgttgagca aaattgttaa aaatatgagg
4441 cgggttatat tgcctactgc tcctcctgaa tatatggagg ccatataccc tgccaggtca
4501 aattcaacaa ttgctagggg tggcaacagc aatacaggct tcctgacacc ggagtcagtc
4561 aatggagaca ctccatcgaa tccactcagg ccaattgctg atgacaccat cgaccatgcc
4621 agccacacac caggcagtgt gtcatcagca ttcatcctcg aagctatggt gaatgtcata
4681 tcgggcccca aagtgctaat gaagcaaatt ccaatttggc ttcctctagg tgtcgctgat
4741 caaaagacct acagctttga ctcaactacg gccgccatca tgcttgcttc atatactatc
4801 acccatttcg gcaaggcaac caatccgctt gtcagagtca atcggctggg tcctggaatc
4861 ccggatcacc ccctcaggct cctgcgaatt ggaaaccagg ctttcctcca ggagttcgtt
4921 cttccaccag tccaactacc ccagtatttc acctttgatt tgacagcact caaactgatc
4981 actcaaccac tgcctgctgc aacatggacc gatgacactc caactggatc aaatggagcg
5041 ttgcgtccag gaatttcatt tcatccaaaa cttcgcccca ttcttttacc caacaaaagt
5101 gggaagaagg ggaacagtgc cgatctaaca tctccggaga aaatccaagc aataatgact
5161 tcactccagg actttaagat cgttccaatt gatccaacca aaaatatcat gggtatcgaa
5221 gtgccagaaa ctctggtcca caagctgacc ggtaagaagg tgacttccaa aaatggacaa
5281 ccaatcatcc ctgttctttt gccaaagtac attgggttgg acccggtggc tccaggagac
5341 ctcaccatgg taatcacaca ggattgtgac acgtgtcatt ctcctgcaag tcttccagct
5401 gtggttgaga agtaattgca ataattgact cagatccagt tttacagaat cttctcaggg
5461 atagtgataa catcttttta ataatccgtc tactagaaga gatacttcta attgatcaat
5521 atactaaagg tgctttacac cattgtctct tttctctcct aaatgtagag cttaacaaaa
5581 gactcataat atacctgttt ttaaaagatt gattgatgaa agatcatgac taataacatt
5641 acaaacaatc ctactataat caatacggtg attcaaatgt caatctttct cattgcacat
5701 actctttgtc cttatcctca aattgcctac atgcttacat ctgaggacag ccagtgtgac
5761 ttggattgga gatgtggagg aaaaatcggg gcccatttct aagttgttca caatctaagt
5821 acagacattg ctcttctaat taagaaaaaa tcggcgatga agattaagcc gacagtgagc
5881 gtaatcttca tctctcttag attatttgtc ttccagagta ggggtcatca ggtccttttc
5941 aattggataa ccaaaataag cttcactaga aggatattgt gaggcgacaa cacaatgggt
6001 gttacaggaa tattgcagtt acctcgtgat cgattcaaga ggacatcatt ctttctttgg
6061 gtaattatcc ttttccaaag aacattttcc atcccgcttg gagttatcca caatagtaca
6121 ttacaggtta gtgatgtcga caaactagtt tgtcgtgaca aactgtcatc cacaaatcaa
6181 ttgagatcag ttggactgaa tctcgagggg aatggagtgg caactgacgt gccatctgtg
6241 actaaaagat ggggcttcag gtccggtgtc ccaccaaagg tggtcaatta tgaagctggt
6301 gaatgggctg aaaactgcta caatcttgaa atcaaaaaac ctgacgggag tgagtgtcta
6361 ccagcagcgc cagacgggat tcggggcttc ccccggtgcc ggtatgtgca caaagtatca
6421 ggaacgggac catgtgccgg agactttgcc ttccacaaag agggtgcttt cttcctgtat
6481 gatcgacttg cttccacagt tatctaccga ggaacgactt tcgctgaagg tgtcgttgca
6541 tttctgatac tgccccaagc taagaaggac ttcttcagct cacacccctt gagagagccg
6601 gtcaatgcaa cggaggaccc gtcgagtggc tattattcta ccacaattag atatcaggct
6661 accggttttg gaactaatga gacagagtac ttgttcgagg ttgacaattt gacctacgtc
6721 caacttgaat caagattcac accacagttt ctgctccagc tgaatgagac aatatatgca
6781 agtgggaaga ggagcaacac cacgggaaaa ctaatttgga aggtcaaccc cgaaattgat
6841 acaacaatcg gggagtgggc cttctgggaa actaaaaaaa cctcactaga aaaattcgca
6901 gtgaagagtt gtctttcaca gctgtatcaa acggacccaa aaacatcagt ggtcagagtc
6961 cggcgcgaac ttcttccgac ccagagacca acacaacaaa tgaagaccac aaaatcatgg
7021 cttcagaaaa ttcctctgca atggttcaag tgcacagtca aggaaggaaa gctgcagtgt
7081 cgcatctgac aacccttgcc acaatctcca cgagtcctca acctcccaca accaaaacag
7141 gtccggacaa cagcacccat aatacacccg tgtataaact tgacatctct gaggcaactc
7201 aagttggaca acatcaccgt agagcagaca acgacagcac agcctccgac actccccccg
7261 ccacgaccgc agccggaccc ttaaaagcag agaacaccaa cacgagtaag agcgctgact
7321 ccctggacct cgccaccacg acaagccccc aaaactacag cgagactgct ggcaacaaca
7381 acactcatca ccaagatacc ggagaagaga gtgccagcag cgggaagcta ggcttaatta
7441 ccaatactat tgctggagta gcaggactga tcacaggcgg gagaaggact cgaagagaag
7501 taattgtcaa tgctcaaccc aaatgcaacc ccaatttaca ttactggact actcaggatg
7561 aaggtgctgc aatcggattg gcctggatac catatttcgg gccagcagcc gaaggaattt
7621 acacagaggg gctaatgcac aaccaagatg gtttaatctg tgggttgagg cagctggcca
7681 acgaaacgac tcaagctctc caactgttcc tgagagccac aactgagctg cgaacctttt
7741 caatcctcaa ccgtaaggca attgacttcc tgctgcagcg atggggtggc acatgccaca
7801 ttttgggacc ggactgctgt atcgaaccac atgattggac caagaacata acagacaaaa
7861 ttgatcagat tattcatgat tttgttgata aaacccttcc ggaccagggg gacaatgaca
7921 attggtggac aggatggaga caatggatac cggcaggtat tggagttaca ggtgttataa
7981 ttgcagttat cgctttattc tgtatatgca aatttgtctt ttagtctttc ttcagattgt
8041 ttcacggcaa aactcaacct caaatcaatg aaactaggat ttaattatat gaatcacttg
8101 aatctaagat tacttgacaa atgataacat aatacactgg agcttcaaac atagccaatg
8161 tgattctaac tcctttaaac tcacagttaa tcataaacaa ggtttgacat caatctagct
8221 atatctttaa gaatgataaa cttgatgaag attaagaaaa aggtaatctt tcgattatct
8281 ttagtcttca tccttgattc tacaatcatg acagttgtct ttaatgaaaa aggaaaaaag
8341 cctttttatt aagttgtaat aatcagatct gcaaaccggt agaatttagt tgtaacctaa
8401 cacacacaaa gcattggtaa aaaagtcaat agaaatttaa acagtgagtg cagacaactc
8461 ttaaatggaa gcttcatatg agagaggacg cccccgagct gccagacagc attcaaggga
8521 tggacacgac caccatgttc gagcacgatc atcatccaga gagaattatc gaggtgagta
8581 ccgtcaatca aggagcgcct cacaagtgcg cgttcctact gtatttcata agaagagagt
8641 tgaaccatta acagttcctc cagcacctaa agacatatgt ccgaccttga aaaaaggatt
8701 tttgtgtgac agtagttttt gcaaaaaaga ccaccagtta gaaagtttaa ctgataggga
8761 attactccta ctaatcgccc gtaagacttg tggatcagta gaacaacaat taaatataac
8821 tgcacccaag gactcgcgct tagcaaatcc aacggctgat gatttccagc aagaggaagg
8881 tcccaaaatt accttgttga cactgatcaa gacggcagaa cactgggcga gacaagacat
8941 ccgaaccata gaggattcca aattaagggc attgttaact ctatgtgctg tgatgacgag
9001 gaaattctca aaatcccagc tgagtctttt gtgtgagaca cacctaaggc gcgaagggct
9061 tgggcaagat caggcagaac ccgttctcga agtatatcaa cgattacaca gtgataaagg
9121 aggcagtttt gaagctgcac tatggcaaca atgggaccga caatccctaa ttatgtttat
9181 cactgcattc ttgaatatcg ctctccagtt accgtgtgaa agttctgctg tcgttgtttc
9241 agggttaaga acattggttc ctcaatcaga taatgaggaa gcttcaacca acccggggac
9301 atgctcatgg tctgatgagg gtacccctta ataaggctga ctaaaacact atataacctt
9361 ctacttgatc acaatactcc gtatacctat catcatatat ttaatcaaga cgatatcctt
9421 taaaacttat tcagtactat aatcactctc atttcaaatt gataagatat gcataattgc
9481 cttaatatat aaagaggtat gatataaccc aaacattgac caaagaaaat cataatctcg
9541 tatcgctcgc aatataacct gccaagcata cctcttgcac aaagtgattc ttgtacacaa
9601 ataatgtttg actctacagg aggtagcaac gatccatctc atcaaaaaat aagtatttta
9661 tgatttacta atgatctctt aaaatattaa gaaaaactga cggaacataa attctttctg
9721 cttcaagttg tggaggaggt ctatggtatt cgctattgtt atattacaat caataacaag
9781 cttgtaaaaa tattgttctt gtttcaggag gtatattgtg accggaaaag ctaaactaat
9841 gatgaagatt aatgcggagg tctgatgaga ataaacctta ttattcagat taggccccaa
9901 gaggcattct tcatctcctt ttagcaaaat actatttcag gatagtccag ctagtgacac
9961 gtcttttagc tgtataccag ttgcccctga gatacgccac aaaagtgtct ctgagctaaa
10021 gtggtctgta cacatctcat acattgtatt aggggcaata atatctaatt gaacttagcc
10081 atttaaaatt tagtgcataa atctgggcta actccaccag gtcaactcca ttggctgaaa
10141 agaagcccac ctacaacgaa cattactttg agcaccctca caattaaaaa ataagagcgt
10201 cgttccaaca atcgagcgca aggttacaag gttgaactga gagtgtctag acaacaaaat
10261 atcgatactc cagacaccaa gcaagacctg agaaaaaacc atggccaaag ctacgggacg
10321 atacaatcta atatcgccca aaaaggacct ggagaaaggg gttgtcttaa gcgacctctg
10381 taacttctta gttagtcaaa ctattcaagg gtggaaagtt tattgggctg gtattgagtt
10441 tgatgtgact cacaaaggaa tggccctatt gcatagactg aaaactaatg actttgcccc
10501 tgcatggtca atgacaagga acctatttcc ccatttattt caaaatccga attccactat
10561 tgaatcaccg ctgtgggcac tgagagtcat ccttgcagca gggatacagg accagttaat
10621 tgaccagtct ttgattgaac ccttagcagg agcccttggt ctgatctctg attggctgct
10681 aacaaccaac actaaccact tcaacatgcg aacacaacgt gtcaaggaac aattgagcct
10741 aaaaatgctg tcgttgattc gatccaatat tctcaagttt attaacaaat tggatgctct
10801 acatgtcgtg aactacaatg gattattgag cagtattgaa attggaactc aaaatcatac
10861 aatcatcata actcgaacta acatgggttt tctggtggag ctccaagaac ccgacaaatc
10921 ggcaatgaac cgcaagaagc ctgggccggc gaaattttcc ctccttcatg agtccacact
10981 gaaagcattt acacaagggt cctcgacacg aatgcaaagt ttaattcttg aattcaatag
11041 ctctcttgct atctaactaa gatggaatac ttcatattgg gctaactcat atatgctgac
11101 tcaatagtta acttgacatc tctgccttca taatcagata tataagcata ataaataaat
11161 actcatattt cttgataatt tgtttaacca cagataaatc ctcactgtaa gccagcttcc
11221 aagttgacac ccttacaaaa accaggactc agaatccctc aaataagaga ttccaagaca
11281 acatcataga attgctttat tatattaata agcattttat cactagaaat ccaatatacg
11341 aaatggttaa ttgtaactaa acccgcaggt catgtgtgtt aggtttcaca aattatatat
11401 attactaact ccatactcgt aactaacatt agataagtag gttaagaaaa aagcttgagg
11461 aagattaaga aaaactgctt attgggtctt tccgtgtttt agatgaagca gttgacattc
11521 ttcctcttga tattaaatgg ctacacaaca tacccaatac ccagacgcca ggttatcatc
11581 accaattgta ttggaccaat gtgaccttgt cactagagct tgcgggttgt attcatcata
11641 ctcccttaat ccgcaactac gcaactgtaa actcccgaaa catatatacc gtttaaaata
11701 tgatgtaact gttaccaagt tcttaagtga tgtaccagtg gcgacattgc ccatagattt
11761 catagtccca attcttctca aggcactatc aggcaatggg ttctgtcctg ttgagccgcg
11821 gtgccaacag ttcttagatg aaattattaa gtacacaatg caagatgctc tcttcctgaa
11881 atattatctc aaaaatgtgg gtgctcaaga agactgtgtt gatgaccact ttcaagaaaa
11941 aatcttatct tcaattcagg gcaatgaatt tttacatcaa atgtttttct ggtatgacct
12001 ggctatttta actcgaaggg gtagattaaa tcgaggaaac tctagatcaa cgtggtttgt
12061 tcatgatgat ttaatagaca tcttaggcta tggggactat gttttttgga agatcccaat
12121 ttcactgtta ccactgaaca cacaaggaat cccccatgct gctatggatt ggtatcagac
12181 atcagtattc aaagaagcgg ttcaagggca tacacacatt gtttctgttt ctactgccga
12241 tgtcttgata atgtgcaaag atttaattac atgtcgattc aacacaactc taatctcaaa
12301 aatagcagag gttgaggacc cagtttgctc tgattatccc aattttaaga ttgtgtctat
12361 gctttaccag agcggagatt acttactctc catattaggg tctgatgggt ataaaatcat
12421 taagtttctc gaaccattgt gcttggctaa aattcaattg tgctcaaagt acaccgagag
12481 gaagggccga ttcttaacac aaatgcattt agctgtaaat cacaccctgg aagaaattac
12541 agaaatacgt gcactaaagc cttcacaggc tcacaagatc cgtgaattcc atagaacatt
12601 gataaggctg gagatgacgc cacaacaact ttgtgagcta ttttccatac aaaaacactg
12661 ggggcatcct gtgctacata gtgaaacagc aatccaaaaa gttaaaaaac atgctacggt
12721 gctaaaagca ttacgcccta tcgtgatttt cgagacatat tgtgttttta aatatagcat
12781 tgcaaaacat tattttgata gtcaaggatc ttggtacagt gttacctcag atagaaatct
12841 aacaccaggt cttaattctt atatcaaaag aaatcaattc cctccgttgc caatgattaa
12901 agaactgcta tgggaatttt accaccttga ccatcctcca cttttctcaa ccaaaattat
12961 tagtgactta agtattttta taaaagacag agctactgca gtagaaagga catgctggga
13021 tgcagtattc gagcctaatg ttctgggata taatccacct cacaaattca gtaccaaacg
13081 tgtaccggaa caatttttag agcaagaaaa cttttctatt gagaatgttc tttcctacgc
13141 gcaaaaactc gagtatctac taccacaata tcggaatttt tctttctcat tgaaagagaa
13201 agagttgaat gtaggtagaa ctttcggaaa attgccttat ccgactcgca atgttcaaac
13261 actttgtgaa gctctgttag ctgatggtct tgctaaagca tttcctagca atatgatggt
13321 agttacggaa cgtgaacaaa aagaaagctt attgcatcaa gcatcatggc accacacaag
13381 tgatgatttc ggtgagcatg ccacagttag agggagtagc tttgtaactg atttagagaa
13441 atacaatctt gcatttaggt atgagtttac agcacctttt atagaatatt gcaaccgttg
13501 ctatggtgtt aagaatgttt ttaattggat gcattataca atcccacagt gttatatgca
13561 tgtcagtgat tattataatc caccgcataa cctcacactg gaaaatcgaa acaacccccc
13621 tgaagggcct agttcataca ggggtcatat gggagggatt gaaggactgc aacaaaaact
13681 ctggacaagt atttcatgtg ctcaaatttc tttagttgaa attaagactg gttttaagtt
13741 gcgctcagct gtgatgggtg acaatcagtg cattaccgtt ttatcagtct tccccttaga
13801 gactgatgca ggcgagcagg aacagagcgc cgaggacaat gcagcgaggg tggccgccag
13861 cctagcaaaa gttacaagtg cctgtggaat ctttttaaaa cctgatgaaa catttgtaca
13921 ttcaggtttt atctattttg gaaaaaaaca atatttgaat ggggtccaat tgcctcagtc
13981 ccttaaaacg gctacaagaa tggcaccatt gtctgatgca atttttgatg atcttcaagg
14041 gaccctggct agtataggta ctgcttttga gcgatccatc tctgagacac gacatatctt
14101 tccttgcaga ataaccgcag ctttccatac gttcttttcg gtgagaatct tgcaatatca
14161 tcacctcgga tttaataaag gttttgacct tggacagtta acactcggca aacctctgga
14221 tttcggaaca atatcattgg cactagcggt accgcaggtg cttggagggt tatccttctt
14281 gaatcctgag aaatgtttct accggaatct aggagatcca gttacctcag gtttattcca
14341 gttaaaaact tatctccgaa tgattgagat ggatgattta ttcttacctt taattgcgaa
14401 gaaccctggg aactgcactg ccattgactt tgtgctaaat cctagcggat taaatgttcc
14461 tgggtcgcaa gacttaactt catttctgcg ccagattgta cgtaggacta tcaccctaag
14521 tgcgaaaaac aaacttatta ataccttatt tcatgcatca gctgacttcg aagacgaaat
14581 ggtttgtaag tggctcttat catcaactcc tgttatgagt cgtttcgcag ccgatatatt
14641 ttcacgcacg ccgagcggga agcgattgca aattctagga tacttggaag gaacacgcac
14701 attattagcc tctaagatca tcaacaataa tacagagacg ccggttttgg acagactgag
14761 gaagataaca ttgcaaaggt ggagtctatg gtttagttat cttgatcatt gtgataatat
14821 cctggcggag gctttaaccc aaataacttg cacagttgat ttagcacaga tcctgaggga
14881 atattcatgg gcacatattt tagaggggag acctcttatt ggagccacac tcccatgtat
14941 gattgagcaa ttcaaagtgg tttggctgaa accctacgaa caatgtccgc agtgttcaaa
15001 tgccaagcaa cctggtggga aaccattcgt gtcagtagca gtcaagaaac atattgttag
15061 tgcatggcca aatgcatccc gaataagctg gactatcggg gatggaatcc catacattgg
15121 atcaaggaca gaagataaga tagggcaacc tgctattaaa ccaaaatgtc cttccgcagc
15181 cttaagagag gccattgaat tggcgtcccg tttaacatgg gtaactcaag gcagttcgaa
15241 cagtgacttg ctaataaaac catttttgga agcacgagta aatttaagtg ttcaagaaat
15301 acttcaaatg accccttcac attactcggg aaatattgtt cataggtaca acgatcaata
15361 cagtcctcat tctttcatgg ccaatcgtat gagtaactca gcaacgcgat tgattgtttc
15421 tacaaacact ttaggtgagt tttcaggagg tggccaatcg gcacgcgaca gcaatattat
15481 tttccagaat gttataaatt atgcagttgc actgttcgat attaaattta gaaacactga
15541 ggctacagat atccagtata atcgtgctca ccttcatcta actaagtgtt gcacccggga
15601 ggtaccagct cagtacttaa catacacatc tacattggat ttagatttaa caagataccg
15661 agaaaatgaa ttgatttatg acaataatcc tctaaaagga ggactcaatt gcaatatctc
15721 atttgataac ccatttttcc aaggcaaaca gctgaacatt atagaagatg accttattcg
15781 actgcctcac ttatctggat gggagctagc taagaccatc atgcaatcaa ttatttcaga
15841 tagcaataat tcgtctacag acccaattag cagtggagaa acaagatcat tcactaccca
15901 tttcttaact tatcccaaaa taggacttct gtacagtttt ggggcctttg taagttatta
15961 tcttggcaat acaattcttc ggactaagaa attaacactt gacaattttt tatattactt
16021 aactacccaa attcataatc taccacatcg ctcattgcga atacttaagc caacattcaa
16081 acatgcaagc gttatgtcac gattaatgag tattgatccc catttttcta tttacatagg
16141 cggtgctgca ggtgacagag gactctcaga tgcggccagg ttatttttga gaacgtccat
16201 ttcatctttt cttacatttg taaaggaatg gataattaat cgcggaacaa ttgtcccttt
16261 atggatagta tatccattag agggtcaaaa tccaacacct gttaataatt tcctccatca
16321 gatcgtagaa ctgctggtgc atgattcatc aagacaccag gcttttaaaa ctaccataaa
16381 tgatcatgta catcctcacg acaatcttgt ttacacatgt aagagtacag ccagcaattt
16441 cttccatgcg tcattggcgt actggaggag caggcacaga aacagcaacc gaaaagactt
16501 gacaagaaac tcttcaactg gatcaagcac aaacaacagt gatggtcata ttaagagaag
16561 tcaagaacaa accaccagag atccacatga tggcactgaa cggagtctag tcctgcaaat
16621 gagccatgaa ataaaaagaa cgacaattcc acaagagaac acgcaccagg gtccgtcgtt
16681 ccagtcattt ctaagtgact ctgcttgcgg tacagcaaac ccaaaactaa atttcgatag
16741 atcgagacac aatgtgaaat ctcaggatca taactcagca tccaagaggg aaggtcatca
16801 aataatctca catcgtctag tcctaccttt ctttacatta tctcaaggga cacgccaatt
16861 aacgtcatcc aatgagtcac aaacccaaga tgagatatca aagtacttac ggcaattgag
16921 atccgtcatt gataccacag tttattgtag gtttaccggt atagtctcgt ccatgcatta
16981 caaacttgat gaggtccttt gggaaataga gaattttaag tcggctgtga cgctggcaga
17041 gggagaaggt gctggtgcct tactattgat tcagaaatac caagttaaga ccttattctt
17101 caacacgcta gctactgagt ccagtataga gtcagaaata gtatcaggaa tgactactcc
17161 taggatgctt ctacctgtta tgtcaaaatt ccataatgac caaattgaga ttattcttaa
17221 caactcagca agccaaataa cagacataac aaatcctact tggtttaaag accaaagagc
17281 aaggctacct aggcaagtcg aggttataac catggatgca gagacgacag agaatataaa
17341 cagatcgaaa ttgtacgaag ctgtacataa attgatctta caccatgttg atcccagcgt
17401 attgaaagca gtggtcctta aagtctttct aagtgatacc gagggtatgt tatggctaaa
17461 tgataatcta gccccgtttt ttgccactgg gtatttaatt aagccaataa cgtcaagtgc
17521 caggtctagt gagtggtatc tttgtctgac gaacttctta tcaactacac gtaagatgcc
17581 acaccaaaac catctcagtt gtaagcaggt aatacttacg gcattgcaac tgcaaattca
17641 acggagccca tactggctaa gtcatttaac tcagtatgct gactgcgatt tacatttaag
17701 ctatatccgc cttggttttc catcattaga gaaagtacta taccacaggt ataaccttgt
17761 cgattcaaaa agaggtccac tagtctctgt cactcagcac ttagcacatc ttagggcaga
17821 gattcgagaa ttgaccaatg attataatca acagcgacaa agtcggactc aaacatatca
17881 ctttattcgt actgcaaaag gacgaatcac aaaactagtc aatgattatt taaaattctt
17941 tcttattgta caagcattaa aacataatgg gacatggcaa gctgagttta agaaattacc
18001 agagttgatt agtgtgtgca ataggttcta tcatattaga gattgtaatt gtgaagaacg
18061 tttcttagtt caaaccttat atttacatag aatgcaggat tctgaagtta agcttatcga
18121 aaggctgaca gggcttctga gtttatttcc agatggtctc tacaggttcg attgaataac
18181 cgtgcatagt attttgatac ttgtaaaggt tggttatcaa catacagatt ataaaaaact
18241 cataaattgc tctcatacat catcttgatc tgatttcaat aaataactat ttagataacg
18301 aaaggagtcc ttacattata cactatattt ggcctctctc cctgcgtgat aatcaaaaaa
18361 ttcacaatac agcatgtgtg acatattact gctgcaatga gtctaacgca acataataaa
18421 ctccgcactc tttataatta agctttaacg ataggtctgg gctcatattg ttattgatat
18481 agtaatgttg tatcaatatc ttgccagatg gaatagtgct ttggttgata acacgacttc
18541 ttaaaacaaa actgatcttt aagattaagt tttttataat tgtcattgct ttaatttgtc
18601 gatttaaaaa tggtgatagc cttaatcttt gtgtaaaata agagattagg tgtaataact
18661 ttaacatttt tgtctagtaa gctactattc cattcagaat gataaaatta aaagaaaaga
18721 catgactgta aaatcagaaa taccttcttt acaatatagc agactagata ataatcttcg
18781 tgttaatgat aattaaggca ttgaccacgc tcatcagaag gctcactaga ataaacgttg
18841 caaaaaggat ccctggaaaa atggtcgcac acaaaaattt aaaaataaat ctatttcttc
18901 ttttttgtgt gt

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:09 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
LOCUS KM233036 18873 bp cRNA linear VRL 25-JUL-2014
DEFINITION Zaire ebolavirus strain EBOV_2, partial genome.
ACCESSION KM233036
VERSION KM233036.1 GI:667852500
KEYWORDS .
SOURCE Zaire ebolavirus (ZEBOV)
ORGANISM Zaire ebolavirus
Viruses; ssRNA negative-strand viruses; Mononegavirales;
Filoviridae; Ebolavirus.
REFERENCE 1 (bases 1 to 18873)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Deep sequencing analysis of Ebola virus transmission in Sierra
Leone
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 18873)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Direct Submission
JOURNAL Submitted (25-JUL-2014) Infectious Disease Initiative, Broad
Institute of MIT and Harvard, 75 Ames St., Cambridge, MA 02142, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Novoalign v. v.3
Sequencing Technology :: Illumina; Nextera
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..18873
/organism="Zaire ebolavirus"
/mol_type="viral cRNA"
/strain="EBOV_2"
/host="Homo sapiens"
/db_xref="taxon:186538"
/country="Sierra Leone"
/collection_date="02-Jun-2014"
gene 8..2978
/gene="NP"
mRNA 8..2978
/gene="NP"
/product="nucleoprotein"
misc_signal 8..19
/gene="NP"
/note="putative transcription start signal"
CDS 422..2641
/gene="NP"
/note="encapsidation of genomic RNA"
/codon_start=1
/product="nucleoprotein"
/protein_id="AIG95893.1"
/db_xref="GI:667852501"
/translation="MDSRPQKVWMTPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
GFRFEVKKCDGVKRLEELLPAVSSGRNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
DTTIPDVVVDPDDGGYGEYQSYSENGMSAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
KNSQKGQHTEGRQTQSTPTQNVTGPRRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
QQDQDHIQEARNQDSDNTQPEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
GKEYTYPDSLEEEYPPWLTEKEAMNDENRFVTLDGQQFYWPVMNHRNKFMAILQHHQ"
polyA_signal 2967..2978
/gene="NP"
gene 2984..4359
/gene="VP35"
mRNA 2984..4359
/gene="VP35"
/product="VP35 matrix protein"
misc_signal 2984..2995
/gene="VP35"
/note="putative transcription start signal"
CDS 3081..4103
/gene="VP35"
/note="polymerase complex protein"
/codon_start=1
/product="VP35 matrix protein"
/protein_id="AIG95894.1"
/db_xref="GI:667852502"
/translation="MTTRTKGRGHTVATTQNDRMPGPELSGWISEQLMTGRIPVNDIF
CDIENNPGLCYASQMQQTKPNPKMRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLDSTTSLTEENFGKPD
ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
TLGLKI"
gene 4342..5846
/gene="VP40"
mRNA 4342..5846
/gene="VP40"
/product="matrix protein"
misc_signal 4342..4353
/gene="VP35"
/note="transcription start signal"
polyA_signal 4349..4359
/gene="VP35"
CDS 4431..5411
/gene="VP40"
/codon_start=1
/product="matrix protein"
/protein_id="AIG95895.1"
/db_xref="GI:667852503"
/translation="MRRVILPTAPPEYMEAIYPARSNSTIARGGNSNTGFLTPESVNG
DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVVEK"
polyA_signal 5835..5846
/gene="VP40"
gene 5852..8257
/gene="GP"
mRNA 5852..8257
/gene="GP"
/product="ssGP"
/note="unedited mRNA"
misc_signal 5852..5863
/gene="GP"
/note="putative transcription start signal"
CDS join(5991..6875,6875..8020)
/gene="GP"
/ribosomal_slippage
/note="additional a residue inserted during transcription;
encodes two disulfide linked subunits GP1 and GP2;
receptor binding and fusion"
/codon_start=1
/product="virion spike glycoprotein precursor"
/protein_id="AIG95896.1"
/db_xref="GI:667852504"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNGPKNISGQSPARTSSDPETNT
TNEDHKIMASENSSAMVQVHSQGRKAAVSHLTTLATISTSPQPPTTKTGPDNSTHNTP
VYKLDISEATQVGQHHRRADNDSTASDTPPATTAAGPLKAENTNTSKSADSLDLATTT
SPQNYSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREVIVNAQ
PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQLANETT
QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
CDS 5991..7085
/gene="GP"
/note="small non-structural secreted glycoprotein; sGP
secreted as an anti-parallel oriented homodimer"
/codon_start=1
/product="sGP"
/protein_id="AIG95897.1"
/db_xref="GI:667852505"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTDPKTSVVRVRRELLPTQRPTQ
QMKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
CDS join(5991..6875,6877..6885)
/gene="GP"
/ribosomal_slippage
/note="second non-structural secreted glycoprotein;
secreted in a monomeric form; one a residue is deleted or
two additional a residues are inserted at the editing site
during transcription of the GP gene"
/codon_start=1
/product="ssGP"
/protein_id="AIG95898.1"
/db_xref="GI:667852506"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKPH"
gene 8240..9692
/gene="VP30"
mRNA 8240..9692
/gene="VP30"
/product="VP30 minor nucleoprotein"
misc_signal 8240..8251
/gene="VP30"
/note="putative transcription start signal"
polyA_signal 8247..8257
/gene="VP30"
CDS 8461..9327
/gene="VP30"
/note="minor nucleoprotein; polymerase complex protein"
/codon_start=1
/product="VP30 minor nucleoprotein"
/protein_id="AIG95899.1"
/db_xref="GI:667852507"
/translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
PGTCSWSDEGTP"
polyA_signal 9682..9692
/gene="VP30"
/note="putative"
gene 9837..11470
/gene="VP24"
/note="putative"
mRNA 9837..11448
/gene="VP24"
/product="VP24 membrane-associated protein"
misc_signal 9837..9848
/gene="VP24"
/note="transcription start signal"
CDS 10297..11052
/gene="VP24"
/note="membrane-associated protein"
/codon_start=1
/product="VP24 membrane-associated protein"
/protein_id="AIG95900.1"
/db_xref="GI:667852508"
/translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRKKPGPAK
FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
polyA_signal 11437..11448
/gene="VP24"
/note="putative"
gene 11453..18234
/gene="L"
mRNA 11453..18234
/gene="L"
/product="polymerase"
misc_signal 11453..11464
/gene="VP24"
/note="transcription start signal"
polyA_signal 11460..11470
/gene="VP24"
/note="putative"
CDS 11533..18171
/gene="L"
/note="polymerase; synthesis of viral RNAs;
transcriptional RNA editing"
/codon_start=1
/product="polymerase"
/protein_id="AIG95901.1"
/db_xref="GI:667852509"
/translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPILLKALSGNGFCPVEPRCQQFLDE
IIKYTMQDALFLKYYLKNVGAQEDCVDDHFQEKILSSIQGNEFLHQMFFWYDLAILTR
RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISLLPLNTQGIPHAAMDWYQTSVF
KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEVEDPVCSDYPNFKIVSML
YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
TEIRALKPSQAHKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
PHNLTLENRNNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
GDNQCITVLSVFPLETDAGEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVVWLKP
YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
LIYDNNPLKGGLNCNISFDNPFFQGKQLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLHQIVELLVHDSSRHQAF
KTTINDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKDLTRNSSTGSSTNNS
DGHIKRSQEQTTRDPHDGTERSLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
ANPKLNFDRSRHNVKSQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIENFKSAVTLAEGEGAGAL
LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
ITDITNPTWFKDQRARLPRQVEVITMDAETTENINRSKLYEAVHKLILHHVDPSVLKA
VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCDLHLSYIRLGFPSLEKVLYHRYNL
VDSKRGPLVSVTQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
VKLIERLTGLLSLFPDGLYRFD"
polyA_signal 18224..18234
/gene="L"

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:09 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
1 taactatgag gaagattaat aattttcctc tcattgaaat ttatatcgga atttaaattg
61 aaattgttac tgtaatcata cctggtttgt ttcagagcca tatcaccaag atagagaaca
121 acctaggtct ccggaggggg caagggcatc agtgtgctca gttgaaaatc ccttgtcaac
181 atctaggcct tatcacatca caagttccgc cttaaactct gcagggtgat ccaacaacct
241 taatagcaac attattgtta aaggacagca ttagttcaca gtcaaacaag caagattgag
301 aattaacttt gattttgaac ctgaacaccc agaggactgg agactcaaca accctaaagc
361 ctggggtaaa acattagaaa tagtttaaag acaaattgct cggaatcaca aaattccgag
421 tatggattct cgtcctcaga aagtctggat gacgccgagt ctcactgaat ctgacatgga
481 ttaccacaag atcttgacag caggtctgtc cgttcaacag gggattgttc ggcaaagagt
541 catcccagtg tatcaagtaa acaatcttga ggaaatttgc caacttatca tacaggcctt
601 tgaagctggt gttgattttc aagagagtgc ggacagtttc cttctcatgc tttgtcttca
661 tcatgcgtac caaggagatt acaaactttt cttggaaagt ggcgcagtca agtatttgga
721 agggcacggg ttccgttttg aagtcaagaa gtgtgatgga gtgaagcgcc ttgaggaatt
781 gctgccagca gtatctagtg ggagaaacat taagagaaca cttgctgcca tgccggaaga
841 ggagacgact gaagctaatg ccggtcagtt cctctccttt gcaagtctat tccttccgaa
901 attggtagta ggagaaaagg cttgccttga gaaggttcaa aggcaaattc aagtacatgc
961 agagcaagga ctgatacaat atccaacagc ttggcaatca gtaggacaca tgatggtgat
1021 tttccgtttg atgcgaacaa attttttgat caaatttctt ctaatacacc aagggatgca
1081 catggttgcc ggacatgatg ccaacgatgc tgtgatttca aattcagtgg ctcaagctcg
1141 tttttcaggt ctattgattg tcaaaacagt acttgatcat atcctacaaa agacagaacg
1201 aggagttcgt ctccatcctc ttgcaaggac cgccaaggta aaaaatgagg tgaactcctt
1261 caaggctgca ctcagctccc tggccaagca tggagagtat gctcctttcg cccgactttt
1321 gaacctttct ggagtaaata atcttgagca tggtcttttc cctcaactgt cggcaattgc
1381 actcggagtc gccacagccc acgggagcac cctcgcagga gtaaatgttg gagaacagta
1441 tcaacagctc agagaggcag ccactgaggc tgagaagcaa ctccaacaat atgcggagtc
1501 tcgtgaactt gaccatcttg gacttgatga tcaggaaaag aaaattctta tgaacttcca
1561 tcagaaaaag aacgaaatca gcttccagca aacaaacgcg atggtaactc taagaaaaga
1621 gcgcctggcc aagctgacag aagctatcac tgctgcatca ctgcccaaaa caagtggaca
1681 ttacgatgat gatgacgaca ttccctttcc aggacccatc aatgatgacg acaatcctgg
1741 ccatcaagat gatgatccga ctgactcaca ggatacgacc attcccgatg tggtagttga
1801 ccccgatgat ggaggctacg gcgaatacca aagttactcg gaaaacggca tgagtgcacc
1861 agatgacttg gtcctattcg atctagacga ggacgacgag gacaccaagc cagtgcctaa
1921 cagatcgacc aagggtggac aacagaaaaa cagtcaaaag ggccagcata cagagggcag
1981 acagacacaa tccacgccaa ctcaaaacgt cacaggccct cgcagaacaa tccaccatgc
2041 cagtgctcca ctcacggaca atgacagaag aaacgaaccc tccggctcaa ccagccctcg
2101 catgctgacc ccaatcaacg aagaggcaga cccactggac gatgccgacg acgagacgtc
2161 tagccttccg cccttagagt cagatgatga agaacaggac agggacggaa cttctaaccg
2221 cacacccact gtcgccccac cggctcccgt atacagagat cactccgaaa agaaagaact
2281 cccgcaagat gaacaacaag atcaggacca cattcaagag gccaggaacc aagacagtga
2341 caacacccag ccagaacatt cttttgagga gatgtatcgc cacattctaa gatcacaggg
2401 gccatttgat gccgttttgt attatcatat gatgaaggat gagcctgtag ttttcagtac
2461 cagtgatggt aaagagtaca cgtatccgga ctcccttgaa gaggaatatc caccatggct
2521 cactgaaaaa gaggccatga atgatgagaa tagatttgtt acactggatg gtcaacaatt
2581 ttattggcca gtaatgaatc acaggaataa attcatggca atcctgcaac atcatcagtg
2641 aatgagcatg taataatggg atgatttaat cgacaaatag ctaacattaa atagtcaagg
2701 aacgcaaaca ggaagaattt ttgatgtcta aggtgtgaat tattatcaca ataaaagtga
2761 ttcttagttt tgaatttaaa gctagcttat tattactagc cgtttttcaa agttcaattt
2821 gagtcttaat gcaaataagc gttaagccac agttatagcc ataatggtaa ctcaatatct
2881 tagccagcga tttatctaaa ttaaattaca ttatgctttt ataacttacc tactagcctg
2941 cccaacattt acacgatcgt tttataatta agaaaaaact aatgatgaag attaaaacct
3001 tcatcatcct tacgtcaatt gaattctcta gcactagaag cttattgtct tcaatgtaaa
3061 agaaaagctg gcctaacaag atgacaacta gaacaaaggg caggggccat actgtggcca
3121 cgactcaaaa cgacagaatg ccaggccctg agctttcggg ctggatctct gagcagctaa
3181 tgaccggaag gattcctgta aacgacatct tctgtgatat tgagaacaat ccaggattat
3241 gctacgcatc ccaaatgcaa caaacgaagc caaacccgaa gatgcgcaac agtcaaaccc
3301 aaacggaccc aatttgcaat catagttttg aggaggtagt acaaacattg gcttcattgg
3361 ctactgttgt gcaacaacaa accatcgcat cagaatcatt agaacaacgc attacgagtc
3421 ttgagaatgg tctaaagcca gtttatgata tggcaaaaac aatctcctca ttgaacaggg
3481 tttgtgctga gatggttgca aaatatgatc ttctggtgat gacaaccggt cgggcaacag
3541 caaccgctgc ggcaactgag gcttattggg ctgaacatgg tcaaccacca cctggaccat
3601 cactttatga agaaagtgcg attcggggta agattgaatc tagagatgag actgtccctc
3661 aaagtgttag ggaggcattc aacaatctag acagtaccac ttcactaact gaggaaaatt
3721 ttgggaaacc tgacatttcg gcaaaggatt tgagaaacat tatgtatgat cacttgcctg
3781 gttttggaac tgctttccac caattagtac aagtgatttg taaattggga aaagatagca
3841 attcattgga cattattcat gctgagttcc aggccagcct ggctgaagga gactcccctc
3901 aatgtgccct aattcaaatt acaaaaagag ttccaatctt ccaagatgct gctccacctg
3961 tcatccacat ccgctctcga ggtgacattc cccgagcttg ccagaagagc ttgcgtccag
4021 tcccaccatc acccaagatt gatcgaggtt gggtatgtgt ttttcagctt caagatggta
4081 aaacacttgg actcaaaatt tgagccaatc tcttttccct ccgaaagagg caactaatag
4141 cagaggcttc aactgctgaa ctatagggta tgttacatta atgatacact tgtgagtatc
4201 agccctagat aatataagtc aattaaacaa ccaagataaa attgttcata tcccgctagc
4261 agctttaaag ataaatgtaa taggagctat acctctgaca gtattataat taattgttat
4321 taagtaaccc aaaccaaaaa tgatgaagat taagaaaaac ctacctcgac tgagagagtg
4381 ttttttcatt aaccttcatc ttgtaaacgt tgagcaaaat tgttaaaaat atgaggcggg
4441 ttatattgcc tactgctcct cctgaatata tggaggccat ataccctgcc aggtcaaatt
4501 caacaattgc taggggtggc aacagcaata caggcttcct gacaccggag tcagtcaatg
4561 gagacactcc atcgaatcca ctcaggccaa ttgctgatga caccatcgac catgccagcc
4621 acacaccagg cagtgtgtca tcagcattca tcctcgaagc tatggtgaat gtcatatcgg
4681 gccccaaagt gctaatgaag caaattccaa tttggcttcc tctaggtgtc gctgatcaaa
4741 agacctacag ctttgactca actacggccg ccatcatgct tgcttcatat actatcaccc
4801 atttcggcaa ggcaaccaat ccgcttgtca gagtcaatcg gctgggtcct ggaatcccgg
4861 atcaccccct caggctcctg cgaattggaa accaggcttt cctccaggag ttcgttcttc
4921 caccagtcca actaccccag tatttcacct ttgatttgac agcactcaaa ctgatcactc
4981 aaccactgcc tgctgcaaca tggaccgatg acactccaac tggatcaaat ggagcgttgc
5041 gtccaggaat ttcatttcat ccaaaacttc gccccattct tttacccaac aaaagtggga
5101 agaaggggaa cagtgccgat ctaacatctc cggagaaaat ccaagcaata atgacttcac
5161 tccaggactt taagatcgtt ccaattgatc caaccaaaaa tatcatgggt atcgaagtgc
5221 cagaaactct ggtccacaag ctgaccggta agaaggtgac ttccaaaaat ggacaaccaa
5281 tcatccctgt tcttttgcca aagtacattg ggttggaccc ggtggctcca ggagacctca
5341 ccatggtaat cacacaggat tgtgacacgt gtcattctcc tgcaagtctt ccagctgtgg
5401 ttgagaagta attgcaataa ttgactcaga tccagtttta cagaatcttc tcagggatag
5461 tgataacatc tttttaataa tccgtctact agaagagata cttctaattg atcaatatac
5521 taaaggtgct ttacaccatt gtctcttttc tctcctaaat gtagagctta acaaaagact
5581 cataatatac ctgtttttaa aagattgatt gatgaaagat catgactaat aacattacaa
5641 acaatcctac tataatcaat acggtgattc aaatgtcaat ctttctcatt gcacatactc
5701 tttgtcctta tcctcaaatt gcctacatgc ttacatctga ggacagccag tgtgacttgg
5761 attggagatg tggaggaaaa atcggggccc atttctaagt tgttcacaat ctaagtacag
5821 acattgctct tctaattaag aaaaaatcgg cgatgaagat taagccgaca gtgagcgtaa
5881 tcttcatctc tcttagatta tttgtcttcc agagtagggg tcatcaggtc cttttcaatt
5941 ggataaccaa aataagcttc actagaagga tattgtgagg cgacaacaca atgggtgtta
6001 caggaatatt gcagttacct cgtgatcgat tcaagaggac atcattcttt ctttgggtaa
6061 ttatcctttt ccaaagaaca ttttccatcc cgcttggagt tatccacaat agtacattac
6121 aggttagtga tgtcgacaaa ctagtttgtc gtgacaaact gtcatccaca aatcaattga
6181 gatcagttgg actgaatctc gaggggaatg gagtggcaac tgacgtgcca tctgtgacta
6241 aaagatgggg cttcaggtcc ggtgtcccac caaaggtggt caattatgaa gctggtgaat
6301 gggctgaaaa ctgctacaat cttgaaatca aaaaacctga cgggagtgag tgtctaccag
6361 cagcgccaga cgggattcgg ggcttccccc ggtgccggta tgtgcacaaa gtatcaggaa
6421 cgggaccatg tgccggagac tttgccttcc acaaagaggg tgctttcttc ctgtatgatc
6481 gacttgcttc cacagttatc taccgaggaa cgactttcgc tgaaggtgtc gttgcatttc
6541 tgatactgcc ccaagctaag aaggacttct tcagctcaca ccccttgaga gagccggtca
6601 atgcaacgga ggacccgtcg agtggctatt attctaccac aattagatat caggctaccg
6661 gttttggaac taatgagaca gagtacttgt tcgaggttga caatttgacc tacgtccaac
6721 ttgaatcaag attcacacca cagtttctgc tccagctgaa tgagacaata tatgcaagtg
6781 ggaagaggag caacaccacg ggaaaactaa tttggaaggt caaccccgaa attgatacaa
6841 caatcgggga gtgggccttc tgggaaacta aaaaaacctc actagaaaaa ttcgcagtga
6901 agagttgtct ttcacagctg tatcaaacgg acccaaaaac atcagtggtc agagtccggc
6961 gcgaacttct tccgacccag agaccaacac aacaaatgaa gaccacaaaa tcatggcttc
7021 agaaaattcc tctgcaatgg ttcaagtgca cagtcaagga aggaaagctg cagtgtcgca
7081 tctgacaacc cttgccacaa tctccacgag tcctcaacct cccacaacca aaacaggtcc
7141 ggacaacagc acccataata cacccgtgta taaacttgac atctctgagg caactcaagt
7201 tggacaacat caccgtagag cagacaacga cagcacagcc tccgacactc cccccgccac
7261 gaccgcagcc ggacccttaa aagcagagaa caccaacacg agtaagagcg ctgactccct
7321 ggacctcgcc accacgacaa gcccccaaaa ctacagcgag actgctggca acaacaacac
7381 tcatcaccaa gataccggag aagagagtgc cagcagcggg aagctaggct taattaccaa
7441 tactattgct ggagtagcag gactgatcac aggcgggaga aggactcgaa gagaagtaat
7501 tgtcaatgct caacccaaat gcaaccccaa tttacattac tggactactc aggatgaagg
7561 tgctgcaatc ggattggcct ggataccata tttcgggcca gcagccgaag gaatttacac
7621 agaggggcta atgcacaacc aagatggttt aatctgtggg ttgaggcagc tggccaacga
7681 aacgactcaa gctctccaac tgttcctgag agccacaact gagctgcgaa ccttttcaat
7741 cctcaaccgt aaggcaattg acttcctgct gcagcgatgg ggtggcacat gccacatttt
7801 gggaccggac tgctgtatcg aaccacatga ttggaccaag aacataacag acaaaattga
7861 tcagattatt catgattttg ttgataaaac ccttccggac cagggggaca atgacaattg
7921 gtggacagga tggagacaat ggataccggc aggtattgga gttacaggtg ttataattgc
7981 agttatcgct ttattctgta tatgcaaatt tgtcttttag tctttcttca gattgtttca
8041 cggcaaaact caacctcaaa tcaatgaaac taggatttaa ttatatgaat cacttgaatc
8101 taagattact tgacaaatga taacataata cactggagct tcaaacatag ccaatgtgat
8161 tctaactcct ttaaactcac agttaatcat aaacaaggtt tgacatcaat ctagctatat
8221 ctttaagaat gataaacttg atgaagatta agaaaaaggt aatctttcga ttatctttag
8281 tcttcatcct tgattctaca atcatgacag ttgtctttaa tgaaaaagga aaaaagcctt
8341 tttattaagt tgtaataatc agatctgcaa accggtagaa tttagttgta acctaacaca
8401 cacaaagcat tggtaaaaaa gtcaatagaa atttaaacag tgagtgcaga caactcttaa
8461 atggaagctt catatgagag aggacgcccc cgagctgcca gacagcattc aagggatgga
8521 cacgaccacc atgttcgagc acgatcatca tccagagaga attatcgagg tgagtaccgt
8581 caatcaagga gcgcctcaca agtgcgcgtt cctactgtat ttcataagaa gagagttgaa
8641 ccattaacag ttcctccagc acctaaagac atatgtccga ccttgaaaaa aggatttttg
8701 tgtgacagta gtttttgcaa aaaagaccac cagttagaaa gtttaactga tagggaatta
8761 ctcctactaa tcgcccgtaa gacttgtgga tcagtagaac aacaattaaa tataactgca
8821 cccaaggact cgcgcttagc aaatccaacg gctgatgatt tccagcaaga ggaaggtccc
8881 aaaattacct tgttgacact gatcaagacg gcagaacact gggcgagaca agacatccga
8941 accatagagg attccaaatt aagggcattg ttaactctat gtgctgtgat gacgaggaaa
9001 ttctcaaaat cccagctgag tcttttgtgt gagacacacc taaggcgcga agggcttggg
9061 caagatcagg cagaacccgt tctcgaagta tatcaacgat tacacagtga taaaggaggc
9121 agttttgaag ctgcactatg gcaacaatgg gaccgacaat ccctaattat gtttatcact
9181 gcattcttga atatcgctct ccagttaccg tgtgaaagtt ctgctgtcgt tgtttcaggg
9241 ttaagaacat tggttcctca atcagataat gaggaagctt caaccaaccc ggggacatgc
9301 tcatggtctg atgagggtac cccttaataa ggctgactaa aacactatat aaccttctac
9361 ttgatcacaa tactccgtat acctatcatc atatatttaa tcaagacgat atcctttaaa
9421 acttattcag tactataatc actctcattt caaattgata agatatgcat aattgcctta
9481 atatataaag aggtatgata taacccaaac attgaccaaa gaaaatcata atctcgtatc
9541 gctcgcaata taacctgcca agcatacctc ttgcacaaag tgattcttgt acacaaataa
9601 tgtttgactc tacaggaggt agcaacgatc catctcatca aaaaataagt attttatgat
9661 ttactaatga tctcttaaaa tattaagaaa aactgacgga acataaattc tttctgcttc
9721 aagttgtgga ggaggtctat ggtattcgct attgttatat tacaatcaat aacaagcttg
9781 taaaaatatt gttcttgttt caggaggtat attgtgaccg gaaaagctaa actaatgatg
9841 aagattaatg cggaggtctg atgagaataa accttattat tcagattagg ccccaagagg
9901 cattcttcat ctccttttag caaaatacta tttcaggata gtccagctag tgacacgtct
9961 tttagctgta taccagttgc ccctgagata cgccacaaaa gtgtctctga gctaaagtgg
10021 tctgtacaca tctcatacat tgtattaggg gcaataatat ctaattgaac ttagccattt
10081 aaaatttagt gcataaatct gggctaactc caccaggtca actccattgg ctgaaaagaa
10141 gcccacctac aacgaacatt actttgagca ccctcacaat taaaaaataa gagcgtcgtt
10201 ccaacaatcg agcgcaaggt tacaaggttg aactgagagt gtctagacaa caaaatatcg
10261 atactccaga caccaagcaa gacctgagaa aaaaccatgg ccaaagctac gggacgatac
10321 aatctaatat cgcccaaaaa ggacctggag aaaggggttg tcttaagcga cctctgtaac
10381 ttcttagtta gtcaaactat tcaagggtgg aaagtttatt gggctggtat tgagtttgat
10441 gtgactcaca aaggaatggc cctattgcat agactgaaaa ctaatgactt tgcccctgca
10501 tggtcaatga caaggaacct atttccccat ttatttcaaa atccgaattc cactattgaa
10561 tcaccgctgt gggcactgag agtcatcctt gcagcaggga tacaggacca gttaattgac
10621 cagtctttga ttgaaccctt agcaggagcc cttggtctga tctctgattg gctgctaaca
10681 accaacacta accatttcaa catgcgaaca caacgtgtca aggaacaatt gagcctaaaa
10741 atgctgtcgt tgattcgatc caatattctc aagtttatta acaaattgga tgctctacat
10801 gtcgtgaact acaatggatt attgagcagt attgaaattg gaactcaaaa tcatacaatc
10861 atcataactc gaactaacat gggttttctg gtggagctcc aagaacccga caaatcggca
10921 atgaaccgca agaagcctgg gccggcgaaa ttttccctcc ttcatgagtc cacactgaaa
10981 gcatttacac aagggtcctc gacacgaatg caaagtttaa ttcttgaatt caatagctct
11041 cttgctatct aactaagatg gaatacttca tattgggcta actcatatat gctgactcaa
11101 tagttaactt gacatctctg ccttcataat cagatatata agcataataa ataaatactc
11161 atatttcttg ataatttgtt taaccacaga taaatcctca ctgtaagcca gcttccaagt
11221 tgacaccctt acaaaaacca ggactcagaa tccctcaaat aagagattcc aagacaacat
11281 catagaattg ctttattata ttaataagca ttttatcact agaaatccaa tatacgaaat
11341 ggttaattgt aactaaaccc gcaggtcatg tgtgttaggt ttcacaaatt atatatatta
11401 ctaactccat actcgtaact aacattagat aagtaggtta agaaaaaagc ttgaggaaga
11461 ttaagaaaaa ctgcttattg ggtctttccg tgttttagat gaagcagttg acattcttcc
11521 tcttgatatt aaatggctac acaacatacc caatacccag acgccaggtt atcatcacca
11581 attgtattgg accaatgtga ccttgtcact agagcttgcg ggttgtattc atcatactcc
11641 cttaatccgc aactacgcaa ctgtaaactc ccgaaacata tataccgttt aaaatatgat
11701 gtaactgtta ccaagttctt aagtgatgta ccagtggcga cattgcccat agatttcata
11761 gtcccaattc ttctcaaggc actatcaggc aatgggttct gtcctgttga gccgcggtgc
11821 caacagttct tagatgaaat tattaagtac acaatgcaag atgctctctt cctgaaatat
11881 tatctcaaaa atgtgggtgc tcaagaagac tgtgttgatg accactttca agaaaaaatc
11941 ttatcttcaa ttcagggcaa tgaattttta catcaaatgt ttttctggta tgacctggct
12001 attttaactc gaaggggtag attaaatcga ggaaactcta gatcaacgtg gtttgttcat
12061 gatgatttaa tagacatctt aggctatggg gactatgttt tttggaagat cccaatttca
12121 ctgttaccac tgaacacaca aggaatcccc catgctgcta tggattggta tcagacatca
12181 gtattcaaag aagcggttca agggcataca cacattgttt ctgtttctac tgccgatgtc
12241 ttgataatgt gcaaagattt aattacatgt cgattcaaca caactctaat ctcaaaaata
12301 gcagaggttg aggacccagt ttgctctgat tatcccaatt ttaagattgt gtctatgctt
12361 taccagagcg gagattactt actctccata ttagggtctg atgggtataa aatcattaag
12421 tttctcgaac cattgtgctt ggctaaaatt caattgtgct caaagtacac cgagaggaag
12481 ggccgattct taacacaaat gcatttagct gtaaatcaca ccctggaaga aattacagaa
12541 atacgtgcac taaagccttc acaggctcac aagatccgtg aattccatag aacattgata
12601 aggctggaga tgacgccaca acaactttgt gagctatttt ccatacaaaa acactggggg
12661 catcctgtgc tacatagtga aacagcaatc caaaaagtta aaaaacatgc tacggtgcta
12721 aaagcattac gccctatcgt gattttcgag acatattgtg tttttaaata tagcattgca
12781 aaacattatt ttgatagtca aggatcttgg tacagtgtta cctcagatag aaatctaaca
12841 ccaggtctta attcttatat caaaagaaat caattccctc cgttgccaat gattaaagaa
12901 ctgctatggg aattttacca ccttgaccat cctccacttt tctcaaccaa aattattagt
12961 gacttaagta tttttataaa agacagagct actgcagtag aaaggacatg ctgggatgca
13021 gtattcgagc ctaatgttct gggatataat ccacctcaca aattcagtac caaacgtgta
13081 ccggaacaat ttttagagca agaaaacttt tctattgaga atgttctttc ctacgcgcaa
13141 aaactcgagt atctactacc acaatatcgg aatttttctt tctcattgaa agagaaagag
13201 ttgaatgtag gtagaacttt cggaaaattg ccttatccga ctcgcaatgt tcaaacactt
13261 tgtgaagctc tgttagctga tggtcttgct aaagcatttc ctagcaatat gatggtagtt
13321 acggaacgtg aacaaaaaga aagcttattg catcaagcat catggcacca cacaagtgat
13381 gatttcggtg agcatgccac agttagaggg agtagctttg taactgattt agagaaatac
13441 aatcttgcat ttaggtatga gtttacagca ccttttatag aatattgcaa ccgttgctat
13501 ggtgttaaga atgtttttaa ttggatgcat tatacaatcc cacagtgtta tatgcatgtc
13561 agtgattatt ataatccacc gcataacctc acactggaaa atcgaaacaa cccccctgaa
13621 gggcctagtt catacagggg tcatatggga gggattgaag gactgcaaca aaaactctgg
13681 acaagtattt catgtgctca aatttcttta gttgaaatta agactggttt taagttgcgc
13741 tcagctgtga tgggtgacaa tcagtgcatt accgttttat cagtcttccc cttagagact
13801 gatgcaggcg agcaggaaca gagcgccgag gacaatgcag cgagggtggc cgccagccta
13861 gcaaaagtta caagtgcctg tggaatcttt ttaaaacctg atgaaacatt tgtacattca
13921 ggttttatct attttggaaa aaaacaatat ttgaatgggg tccaattgcc tcagtccctt
13981 aaaacggcta caagaatggc accattgtct gatgcaattt ttgatgatct tcaagggacc
14041 ctggctagta taggtactgc ttttgagcga tccatctctg agacacgaca tatctttcct
14101 tgcagaataa ccgcagcttt ccatacgttc ttttcggtga gaatcttgca atatcatcac
14161 ctcggattta ataaaggttt tgaccttgga cagttaacac tcggcaaacc tctggatttc
14221 ggaacaatat cattggcact agcggtaccg caggtgcttg gagggttatc cttcttgaat
14281 cctgagaaat gtttctaccg gaatctagga gatccagtta cctcaggttt attccagtta
14341 aaaacttatc tccgaatgat tgagatggat gatttattct tacctttaat tgcgaagaac
14401 cctgggaact gcactgccat tgactttgtg ctaaatccta gcggattaaa tgttcctggg
14461 tcgcaagact taacttcatt tctgcgccag attgtacgta ggactatcac cctaagtgcg
14521 aaaaacaaac ttattaatac cttatttcat gcatcagctg acttcgaaga cgaaatggtt
14581 tgtaagtggc tcttatcatc aactcctgtt atgagtcgtt tcgcagccga tatattttca
14641 cgcacgccga gcgggaagcg attgcaaatt ctaggatact tggaaggaac acgcacatta
14701 ttagcctcta agatcatcaa caataataca gagacgccgg ttttggacag actgaggaag
14761 ataacattgc aaaggtggag tctatggttt agttatcttg atcattgtga taatatcctg
14821 gcggaggctt taacccaaat aacttgcaca gttgatttag cacagatcct gagggaatat
14881 tcatgggcac atattttaga ggggagacct cttattggag ccacactccc atgtatgatt
14941 gagcaattca aagtggtttg gctgaaaccc tacgaacaat gtccgcagtg ttcaaatgcc
15001 aagcaacctg gtgggaaacc attcgtgtca gtagcagtca agaaacatat tgttagtgca
15061 tggccaaatg catcccgaat aagctggact atcggggatg gaatcccata cattggatca
15121 aggacagaag ataagatagg gcaacctgct attaaaccaa aatgtccttc cgcagcctta
15181 agagaggcca ttgaattggc gtcccgttta acatgggtaa ctcaaggcag ttcgaacagt
15241 gacttgctaa taaaaccatt tttggaagca cgagtaaatt taagtgttca agaaatactt
15301 caaatgaccc cttcacatta ctcgggaaat attgttcata ggtacaacga tcaatacagt
15361 cctcattctt tcatggccaa tcgtatgagt aactcagcaa cgcgattgat tgtttctaca
15421 aacactttag gtgagttttc aggaggtggc caatcggcac gcgacagcaa tattattttc
15481 cagaatgtta taaattatgc agttgcactg ttcgatatta aatttagaaa cactgaggct
15541 acagatatcc agtataatcg tgctcacctt catctaacta agtgttgcac ccgggaggta
15601 ccagctcagt acttaacata cacatctaca ttggatttag atttaacaag ataccgagaa
15661 aatgaattga tttatgacaa taatcctcta aaaggaggac tcaattgcaa tatctcattt
15721 gataacccat ttttccaagg caaacagctg aacattatag aagatgacct tattcgactg
15781 cctcacttat ctggatggga gctagctaag accatcatgc aatcaattat ttcagatagc
15841 aataattcgt ctacagaccc aattagcagt ggagaaacaa gatcattcac tacccatttc
15901 ttaacttatc ccaaaatagg acttctgtac agttttgggg cctttgtaag ttattatctt
15961 ggcaatacaa ttcttcggac taagaaatta acacttgaca attttttata ttacttaact
16021 acccaaattc ataatctacc acatcgctca ttgcgaatac ttaagccaac attcaaacat
16081 gcaagcgtta tgtcacgatt aatgagtatt gatccccatt tttctattta cataggcggt
16141 gctgcaggtg acagaggact ctcagatgcg gccaggttat ttttgagaac gtccatttca
16201 tcttttctta catttgtaaa ggaatggata attaatcgcg gaacaattgt ccctttatgg
16261 atagtatatc cattagaggg tcaaaatcca acacctgtta ataatttcct ccatcagatc
16321 gtagaactgc tggtgcatga ttcatcaaga caccaggctt ttaaaactac cataaatgat
16381 catgtacatc ctcacgacaa tcttgtttac acatgtaaga gtacagccag caatttcttc
16441 catgcgtcat tggcgtactg gaggagcagg cacagaaaca gcaaccgaaa agacttgaca
16501 agaaactctt caactggatc aagcacaaac aacagtgatg gtcatattaa gagaagtcaa
16561 gaacaaacca ccagagatcc acatgatggc actgaacgga gtctagtcct gcaaatgagc
16621 catgaaataa aaagaacgac aattccacaa gagaacacgc accagggtcc gtcgttccag
16681 tcatttctaa gtgactctgc ttgcggtaca gcaaacccaa aactaaattt cgatagatcg
16741 agacacaatg tgaaatctca ggatcataac tcagcatcca agagggaagg tcatcaaata
16801 atctcacatc gtctagtcct acctttcttt acattatctc aagggacacg ccaattaacg
16861 tcatccaatg agtcacaaac ccaagatgag atatcaaagt acttacggca attgagatcc
16921 gtcattgata ccacagttta ttgtaggttt accggtatag tctcgtccat gcattacaaa
16981 cttgatgagg tcctttggga aatagagaat tttaagtcgg ctgtgacgct ggcagaggga
17041 gaaggtgctg gtgccttact attgattcag aaataccaag ttaagacctt attcttcaac
17101 acgctagcta ctgagtccag tatagagtca gaaatagtat caggaatgac tactcctagg
17161 atgcttctac ctgttatgtc aaaattccat aatgaccaaa ttgagattat tcttaacaac
17221 tcagcaagcc aaataacaga cataacaaat cctacttggt ttaaagacca aagagcaagg
17281 ctacctaggc aagtcgaggt tataaccatg gatgcagaga cgacagagaa tataaacaga
17341 tcgaaattgt acgaagctgt acataaattg atcttacacc atgttgatcc cagcgtattg
17401 aaagcagtgg tccttaaagt ctttctaagt gataccgagg gtatgttatg gctaaatgat
17461 aatctagccc cgttttttgc cactgggtat ttaattaagc caataacgtc aagtgccagg
17521 tctagtgagt ggtatctttg tctgacgaac ttcttatcaa ctacacgtaa gatgccacac
17581 caaaaccatc tcagttgtaa gcaggtaata cttacggcat tgcaactgca aattcaacgg
17641 agcccatact ggctaagtca tttaactcag tatgctgact gcgatttaca tttaagctat
17701 atccgccttg gttttccatc attagagaaa gtactatacc acaggtataa ccttgtcgat
17761 tcaaaaagag gtccactagt ctctgtcact cagcacttag cacatcttag ggcagagatt
17821 cgagaattga ccaatgatta taatcaacag cgacaaagtc ggactcaaac atatcacttt
17881 attcgtactg caaaaggacg aatcacaaaa ctagtcaatg attatttaaa attctttctt
17941 attgtacaag cattaaaaca taatgggaca tggcaagctg agtttaagaa attaccagag
18001 ttgattagtg tgtgcaatag gttctatcat attagagatt gtaattgtga agaacgtttc
18061 ttagttcaaa ccttatattt acatagaatg caggattctg aagttaagct tatcgaaagg
18121 ctgacagggc ttctgagttt atttccagat ggtctctaca ggttcgattg aataaccgtg
18181 catagtattt tgatacttgt aaaggttggt tatcaacata cagattataa aaaactcata
18241 aattgctctc atacatcatc ttgatctgat ttcaataaat aactatttag ataacgaaag
18301 gagtccttac attatacact atatttggcc tctctccctg cgtgataatc aaaaaattca
18361 caatacagca tgtgtgacat attactgctg caatgagtct aacgcaacat aataaactcc
18421 gcactcttta taattaagct ttaacgatag gtctgggctc atattgttat tgatatagta
18481 atgttgtatc aatatcttgc cagatggaat agtgctttgg ttgataacac gacttcttaa
18541 aacaaaactg atctttaaga ttaagttttt tataattgtc attgctttaa tttgtcgatt
18601 taaaaatggt gatagcctta atctttgtgt aaaataagag attaggtgta ataactttaa
18661 catttttgtc tagtaagcta ctattccatt cagaatgata aaattaaaag aaaagacatg
18721 actgtaaaat cagaaatacc ttctttacaa tatagcagac tagataataa tcttcgtgtt
18781 aatgataatt aaggcattga ccacgctcat cagaaggctc actagaataa acgttgcaaa
18841 aaggatccct ggaaaaatgg tcgcacacaa aaa

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:12 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
LOCUS KM233037 18915 bp cRNA linear VRL 25-JUL-2014
DEFINITION Zaire ebolavirus strain EBOV_3, partial genome.
ACCESSION KM233037
VERSION KM233037.1 GI:667852510
KEYWORDS .
SOURCE Zaire ebolavirus (ZEBOV)
ORGANISM Zaire ebolavirus
Viruses; ssRNA negative-strand viruses; Mononegavirales;
Filoviridae; Ebolavirus.
REFERENCE 1 (bases 1 to 18915)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Deep sequencing analysis of Ebola virus transmission in Sierra
Leone
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 18915)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Direct Submission
JOURNAL Submitted (25-JUL-2014) Infectious Disease Initiative, Broad
Institute of MIT and Harvard, 75 Ames St., Cambridge, MA 02142, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Novoalign v. v.3
Sequencing Technology :: Illumina; Nextera
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..18915
/organism="Zaire ebolavirus"
/mol_type="viral cRNA"
/strain="EBOV_3"
/host="Homo sapiens"
/db_xref="taxon:186538"
/country="Sierra Leone"
/collection_date="03-Jun-2014"
gene 12..2982
/gene="NP"
mRNA 12..2982
/gene="NP"
/product="nucleoprotein"
misc_signal 12..23
/gene="NP"
/note="putative transcription start signal"
CDS 426..2645
/gene="NP"
/note="encapsidation of genomic RNA"
/codon_start=1
/product="nucleoprotein"
/protein_id="AIG95902.1"
/db_xref="GI:667852511"
/translation="MDSRPQKVWMTPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
GFRFEVKKCDGVKRLEELLPAVSSGRNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
DTTIPDVVVDPDDGGYGEYQSYSENGMSAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
KNSQKGQHTEGRQTQSTPTQNVTGPRRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
QQDQDHIQEARNQDSDNTQPEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
GKEYTYPDSLEEEYPPWLTEKEAMNDENRFVTLDGQQFYWPVMNHRNKFMAILQHHQ"
polyA_signal 2971..2982
/gene="NP"
gene 2988..4363
/gene="VP35"
mRNA 2988..4363
/gene="VP35"
/product="VP35 matrix protein"
misc_signal 2988..2999
/gene="VP35"
/note="putative transcription start signal"
CDS 3085..4107
/gene="VP35"
/note="polymerase complex protein"
/codon_start=1
/product="VP35 matrix protein"
/protein_id="AIG95903.1"
/db_xref="GI:667852512"
/translation="MTTRTKGRGHTVATTQNDRMPGPELSGWISEQLMTGRIPVNDIF
CDIENNPGLCYASQMQQTKPNPKMRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLDSTTSLTEENFGKPD
ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
TLGLKI"
gene 4346..5850
/gene="VP40"
mRNA 4346..5850
/gene="VP40"
/product="matrix protein"
misc_signal 4346..4357
/gene="VP35"
/note="transcription start signal"
polyA_signal 4353..4363
/gene="VP35"
CDS 4435..5415
/gene="VP40"
/codon_start=1
/product="matrix protein"
/protein_id="AIG95904.1"
/db_xref="GI:667852513"
/translation="MRRVILPTAPPEYMEAIYPARSNSTIARGGNSNTGFLTPESVNG
DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVVEK"
polyA_signal 5839..5850
/gene="VP40"
gene 5856..8261
/gene="GP"
mRNA 5856..8261
/gene="GP"
/product="ssGP"
/note="unedited mRNA"
misc_signal 5856..5867
/gene="GP"
/note="putative transcription start signal"
CDS join(5995..6879,6879..8024)
/gene="GP"
/ribosomal_slippage
/note="additional a residue inserted during transcription;
encodes two disulfide linked subunits GP1 and GP2;
receptor binding and fusion"
/codon_start=1
/product="virion spike glycoprotein precursor"
/protein_id="AIG95905.1"
/db_xref="GI:667852514"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNGPKNISGQSPARTSSDPETNT
TNEDHKIMASENSSAMVQVHSQGRKAAVSHLTTLATISTSPQPPTTKTGPDNSTHNTP
VYKLDISEATQVGQHHRRADNDSTASDTPPATTAAGPLKAENTNTSKSADSLDLATTT
SPQNYSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREVIVNAQ
PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQLANETT
QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
CDS 5995..7089
/gene="GP"
/note="small non-structural secreted glycoprotein; sGP
secreted as an anti-parallel oriented homodimer"
/codon_start=1
/product="sGP"
/protein_id="AIG95906.1"
/db_xref="GI:667852515"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTDPKTSVVRVRRELLPTQRPTQ
QMKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
CDS join(5995..6879,6881..6889)
/gene="GP"
/ribosomal_slippage
/note="second non-structural secreted glycoprotein;
secreted in a monomeric form; one a residue is deleted or
two additional a residues are inserted at the editing site
during transcription of the GP gene"
/codon_start=1
/product="ssGP"
/protein_id="AIG95907.1"
/db_xref="GI:667852516"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKPH"
gene 8244..9696
/gene="VP30"
mRNA 8244..9696
/gene="VP30"
/product="VP30 minor nucleoprotein"
misc_signal 8244..8255
/gene="VP30"
/note="putative transcription start signal"
polyA_signal 8251..8261
/gene="VP30"
CDS 8465..9331
/gene="VP30"
/note="minor nucleoprotein; polymerase complex protein"
/codon_start=1
/product="VP30 minor nucleoprotein"
/protein_id="AIG95908.1"
/db_xref="GI:667852517"
/translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
PGTCSWSDEGTP"
polyA_signal 9686..9696
/gene="VP30"
/note="putative"
gene 9841..11474
/gene="VP24"
/note="putative"
mRNA 9841..11452
/gene="VP24"
/product="VP24 membrane-associated protein"
misc_signal 9841..9852
/gene="VP24"
/note="transcription start signal"
CDS 10301..11056
/gene="VP24"
/note="membrane-associated protein"
/codon_start=1
/product="VP24 membrane-associated protein"
/protein_id="AIG95909.1"
/db_xref="GI:667852518"
/translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRKKPGPAK
FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
polyA_signal 11441..11452
/gene="VP24"
/note="putative"
gene 11457..18238
/gene="L"
mRNA 11457..18238
/gene="L"
/product="polymerase"
misc_signal 11457..11468
/gene="VP24"
/note="transcription start signal"
polyA_signal 11464..11474
/gene="VP24"
/note="putative"
CDS 11537..18175
/gene="L"
/note="polymerase; synthesis of viral RNAs;
transcriptional RNA editing"
/codon_start=1
/product="polymerase"
/protein_id="AIG95910.1"
/db_xref="GI:667852519"
/translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPILLKALSGNGFCPVEPRCQQFLDE
IIKYTMQDALFLKYYLKNVGAQEDCVDDHFQEKILSSIQGNEFLHQMFFWYDLAILTR
RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISLLPLNTQGIPHAAMDWYQTSVF
KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEVEDPVCSDYPNFKIVSML
YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
TEIRALKPSQAHKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
PHNLTLENRNNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
GDNQCITVLSVFPLETDAGEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVVWLKP
YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
LIYDNNPLKGGLNCNISFDNPFFQGKQLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLHQIVELLVHDSSRHQAF
KTTINDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKDLTRNSSTGSSTNNS
DGHIKRSQEQTTRDPHDGTERSLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
ANPKLNFDRSRHNVKSQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIENFKSAVTLAEGEGAGAL
LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
ITDITNPTWFKDQRARLPRQVEVITMDAETTENINRSKLYEAVHKLILHHVDPSVLKA
VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCDLHLSYIRLGFPSLEKVLYHRYNL
VDSKRGPLVSVTQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
VKLIERLTGLLSLFPDGLYRFD"
polyA_signal 18228..18238
/gene="L"

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:13 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
1 cgaataacta tgaggaagat taataatttt cctctcattg aaatttatat cggaatttaa
61 attgaaattg ttactgtaat catacctggt ttgtttcaga gccatatcac caagatagag
121 aacaacctag gtctccggag ggggcaaggg catcagtgtg ctcagttgaa aatcccttgt
181 caacatctag gccttatcac atcacaagtt ccgccttaaa ctctgcaggg tgatccaaca
241 accttaatag caacattatt gttaaaggac agcattagtt cacagtcaaa caagcaagat
301 tgagaattaa ctttgatttt gaacctgaac acccagagga ctggagactc aacaacccta
361 aagcctgggg taaaacatta gaaatagttt aaagacaaat tgctcggaat cacaaaattc
421 cgagtatgga ttctcgtcct cagaaagtct ggatgacgcc gagtctcact gaatctgaca
481 tggattacca caagatcttg acagcaggtc tgtccgttca acaggggatt gttcggcaaa
541 gagtcatccc agtgtatcaa gtaaacaatc ttgaggaaat ttgccaactt atcatacagg
601 cctttgaagc tggtgttgat tttcaagaga gtgcggacag tttccttctc atgctttgtc
661 ttcatcatgc gtaccaagga gattacaaac ttttcttgga aagtggcgca gtcaagtatt
721 tggaagggca cgggttccgt tttgaagtca agaagtgtga tggagtgaag cgccttgagg
781 aattgctgcc agcagtatct agtgggagaa acattaagag aacacttgct gccatgccgg
841 aagaggagac gactgaagct aatgccggtc agttcctctc ctttgcaagt ctattccttc
901 cgaaattggt agtaggagaa aaggcttgcc ttgagaaggt tcaaaggcaa attcaagtac
961 atgcagagca aggactgata caatatccaa cagcttggca atcagtagga cacatgatgg
1021 tgattttccg tttgatgcga acaaattttt tgatcaaatt tcttctaata caccaaggga
1081 tgcacatggt tgccggacat gatgccaacg atgctgtgat ttcaaattca gtggctcaag
1141 ctcgtttttc aggtctattg attgtcaaaa cagtacttga tcatatccta caaaagacag
1201 aacgaggagt tcgtctccat cctcttgcaa ggaccgccaa ggtaaaaaat gaggtgaact
1261 ccttcaaggc tgcactcagc tccctggcca agcatggaga gtatgctcct ttcgcccgac
1321 ttttgaacct ttctggagta aataatcttg agcatggtct tttccctcaa ctgtcggcaa
1381 ttgcactcgg agtcgccaca gcccacggga gcaccctcgc aggagtaaat gttggagaac
1441 agtatcaaca gctcagagag gcagccactg aggctgagaa gcaactccaa caatatgcgg
1501 agtctcgtga acttgaccat cttggacttg atgatcagga aaagaaaatt cttatgaact
1561 tccatcagaa aaagaacgaa atcagcttcc agcaaacaaa cgcgatggta actctaagaa
1621 aagagcgcct ggccaagctg acagaagcta tcactgctgc atcactgccc aaaacaagtg
1681 gacattacga tgatgatgac gacattccct ttccaggacc catcaatgat gacgacaatc
1741 ctggccatca agatgatgat ccgactgact cacaggatac gaccattccc gatgtggtag
1801 ttgaccccga tgatggaggc tacggcgaat accaaagtta ctcggaaaac ggcatgagtg
1861 caccagatga cttggtccta ttcgatctag acgaggacga cgaggacacc aagccagtgc
1921 ctaacagatc gaccaagggt ggacaacaga aaaacagtca aaagggccag catacagagg
1981 gcagacagac acaatccacg ccaactcaaa acgtcacagg ccctcgcaga acaatccacc
2041 atgccagtgc tccactcacg gacaatgaca gaagaaacga accctccggc tcaaccagcc
2101 ctcgcatgct gaccccaatc aacgaagagg cagacccact ggacgatgcc gacgacgaga
2161 cgtctagcct tccgccctta gagtcagatg atgaagaaca ggacagggac ggaacttcta
2221 accgcacacc cactgtcgcc ccaccggctc ccgtatacag agatcactcc gaaaagaaag
2281 aactcccgca agatgaacaa caagatcagg accacattca agaggccagg aaccaagaca
2341 gtgacaacac ccagccagaa cattcttttg aggagatgta tcgccacatt ctaagatcac
2401 aggggccatt tgatgccgtt ttgtattatc atatgatgaa ggatgagcct gtagttttca
2461 gtaccagtga tggtaaagag tacacgtatc cggactccct tgaagaggaa tatccaccat
2521 ggctcactga aaaagaggcc atgaatgatg agaatagatt tgttacactg gatggtcaac
2581 aattttattg gccagtaatg aatcacagga ataaattcat ggcaatcctg caacatcatc
2641 agtgaatgag catgtaataa tgggatgatt taatcgacaa atagctaaca ttaaatagtc
2701 aaggaacgca aacaggaaga atttttgatg tctaaggtgt gaattattat cacaataaaa
2761 gtgattctta gttttgaatt taaagctagc ttattattac tagccgtttt tcaaagttca
2821 atttgagtct taatgcaaat aagcgttaag ccacagttat agccataatg gtaactcaat
2881 atcttagcca gcgatttatc taaattaaat tacattatgc ttttataact tacctactag
2941 cctgcccaac atttacacga tcgttttata attaagaaaa aactaatgat gaagattaaa
3001 accttcatca tccttacgtc aattgaattc tctagcacta gaagcttatt gtcttcaatg
3061 taaaagaaaa gctggcctaa caagatgaca actagaacaa agggcagggg ccatactgtg
3121 gccacgactc aaaacgacag aatgccaggc cctgagcttt cgggctggat ctctgagcag
3181 ctaatgaccg gaaggattcc tgtaaacgac atcttctgtg atattgagaa caatccagga
3241 ttatgctacg catcccaaat gcaacaaacg aagccaaacc cgaagatgcg caacagtcaa
3301 acccaaacgg acccaatttg caatcatagt tttgaggagg tagtacaaac attggcttca
3361 ttggctactg ttgtgcaaca acaaaccatc gcatcagaat cattagaaca acgcattacg
3421 agtcttgaga atggtctaaa gccagtttat gatatggcaa aaacaatctc ctcattgaac
3481 agggtttgtg ctgagatggt tgcaaaatat gatcttctgg tgatgacaac cggtcgggca
3541 acagcaaccg ctgcggcaac tgaggcttat tgggctgaac atggtcaacc accacctgga
3601 ccatcacttt atgaagaaag tgcgattcgg ggtaagattg aatctagaga tgagactgtc
3661 cctcaaagtg ttagggaggc attcaacaat ctagacagta ccacttcact aactgaggaa
3721 aattttggga aacctgacat ttcggcaaag gatttgagaa acattatgta tgatcacttg
3781 cctggttttg gaactgcttt ccaccaatta gtacaagtga tttgtaaatt gggaaaagat
3841 agcaattcat tggacattat tcatgctgag ttccaggcca gcctggctga aggagactcc
3901 cctcaatgtg ccctaattca aattacaaaa agagttccaa tcttccaaga tgctgctcca
3961 cctgtcatcc acatccgctc tcgaggtgac attccccgag cttgccagaa gagcttgcgt
4021 ccagtcccac catcacccaa gattgatcga ggttgggtat gtgtttttca gcttcaagat
4081 ggtaaaacac ttggactcaa aatttgagcc aatctctttt ccctccgaaa gaggcaacta
4141 atagcagagg cttcaactgc tgaactatag ggtatgttac attaatgata cacttgtgag
4201 tatcagccct agataatata agtcaattaa acaaccaaga taaaattgtt catatcccgc
4261 tagcagcttt aaagataaat gtaataggag ctatacctct gacagtatta taattaattg
4321 ttattaagta acccaaacca aaaatgatga agattaagaa aaacctacct cgactgagag
4381 agtgtttttt cattaacctt catcttgtaa acgttgagca aaattgttaa aaatatgagg
4441 cgggttatat tgcctactgc tcctcctgaa tatatggagg ccatataccc tgccaggtca
4501 aattcaacaa ttgctagggg tggcaacagc aatacaggct tcctgacacc ggagtcagtc
4561 aatggagaca ctccatcgaa tccactcagg ccaattgctg atgacaccat cgaccatgcc
4621 agccacacac caggcagtgt gtcatcagca ttcatcctcg aagctatggt gaatgtcata
4681 tcgggcccca aagtgctaat gaagcaaatt ccaatttggc ttcctctagg tgtcgctgat
4741 caaaagacct acagctttga ctcaactacg gccgccatca tgcttgcttc atatactatc
4801 acccatttcg gcaaggcaac caatccgctt gtcagagtca atcggctggg tcctggaatc
4861 ccggatcacc ccctcaggct cctgcgaatt ggaaaccagg ctttcctcca ggagttcgtt
4921 cttccaccag tccaactacc ccagtatttc acctttgatt tgacagcact caaactgatc
4981 actcaaccac tgcctgctgc aacatggacc gatgacactc caactggatc aaatggagcg
5041 ttgcgtccag gaatttcatt tcatccaaaa cttcgcccca ttcttttacc caacaaaagt
5101 gggaagaagg ggaacagtgc cgatctaaca tctccggaga aaatccaagc aataatgact
5161 tcactccagg actttaagat cgttccaatt gatccaacca aaaatatcat gggtatcgaa
5221 gtgccagaaa ctctggtcca caagctgacc ggtaagaagg tgacttccaa aaatggacaa
5281 ccaatcatcc ctgttctttt gccaaagtac attgggttgg acccggtggc tccaggagac
5341 ctcaccatgg taatcacaca ggattgtgac acgtgtcatt ctcctgcaag tcttccagct
5401 gtggttgaga agtaattgca ataattgact cagatccagt tttacagaat cttctcaggg
5461 atagtgataa catcttttta ataatccgtc tactagaaga gatacttcta attgatcaat
5521 atactaaagg tgctttacac cattgtctct tttctctcct aaatgtagag cttaacaaaa
5581 gactcataat atacctgttt ttaaaagatt gattgatgaa agatcatgac taataacatt
5641 acaaacaatc ctactataat caatacggtg attcaaatgt caatctttct cattgcacat
5701 actctttgtc cttatcctca aattgcctac atgcttacat ctgaggacag ccagtgtgac
5761 ttggattgga gatgtggagg aaaaatcggg gcccatttct aagttgttca caatctaagt
5821 acagacattg ctcttctaat taagaaaaaa tcggcgatga agattaagcc gacagtgagc
5881 gtaatcttca tctctcttag attatttgtc ttccagagta ggggtcatca ggtccttttc
5941 aattggataa ccaaaataag cttcactaga aggatattgt gaggcgacaa cacaatgggt
6001 gttacaggaa tattgcagtt acctcgtgat cgattcaaga ggacatcatt ctttctttgg
6061 gtaattatcc ttttccaaag aacattttcc atcccgcttg gagttatcca caatagtaca
6121 ttacaggtta gtgatgtcga caaactagtt tgtcgtgaca aactgtcatc cacaaatcaa
6181 ttgagatcag ttggactgaa tctcgagggg aatggagtgg caactgacgt gccatctgtg
6241 actaaaagat ggggcttcag gtccggtgtc ccaccaaagg tggtcaatta tgaagctggt
6301 gaatgggctg aaaactgcta caatcttgaa atcaaaaaac ctgacgggag tgagtgtcta
6361 ccagcagcgc cagacgggat tcggggcttc ccccggtgcc ggtatgtgca caaagtatca
6421 ggaacgggac catgtgccgg agactttgcc ttccacaaag agggtgcttt cttcctgtat
6481 gatcgacttg cttccacagt tatctaccga ggaacgactt tcgctgaagg tgtcgttgca
6541 tttctgatac tgccccaagc taagaaggac ttcttcagct cacacccctt gagagagccg
6601 gtcaatgcaa cggaggaccc gtcgagtggc tattattcta ccacaattag atatcaggct
6661 accggttttg gaactaatga gacagagtac ttgttcgagg ttgacaattt gacctacgtc
6721 caacttgaat caagattcac accacagttt ctgctccagc tgaatgagac aatatatgca
6781 agtgggaaga ggagcaacac cacgggaaaa ctaatttgga aggtcaaccc cgaaattgat
6841 acaacaatcg gggagtgggc cttctgggaa actaaaaaaa cctcactaga aaaattcgca
6901 gtgaagagtt gtctttcaca gctgtatcaa acggacccaa aaacatcagt ggtcagagtc
6961 cggcgcgaac ttcttccgac ccagagacca acacaacaaa tgaagaccac aaaatcatgg
7021 cttcagaaaa ttcctctgca atggttcaag tgcacagtca aggaaggaaa gctgcagtgt
7081 cgcatctgac aacccttgcc acaatctcca cgagtcctca acctcccaca accaaaacag
7141 gtccggacaa cagcacccat aatacacccg tgtataaact tgacatctct gaggcaactc
7201 aagttggaca acatcaccgt agagcagaca acgacagcac agcctccgac actccccccg
7261 ccacgaccgc agccggaccc ttaaaagcag agaacaccaa cacgagtaag agcgctgact
7321 ccctggacct cgccaccacg acaagccccc aaaactacag cgagactgct ggcaacaaca
7381 acactcatca ccaagatacc ggagaagaga gtgccagcag cgggaagcta ggcttaatta
7441 ccaatactat tgctggagta gcaggactga tcacaggcgg gagaaggact cgaagagaag
7501 taattgtcaa tgctcaaccc aaatgcaacc ccaatttaca ttactggact actcaggatg
7561 aaggtgctgc aatcggattg gcctggatac catatttcgg gccagcagcc gaaggaattt
7621 acacagaggg gctaatgcac aaccaagatg gtttaatctg tgggttgagg cagctggcca
7681 acgaaacgac tcaagctctc caactgttcc tgagagccac aactgagctg cgaacctttt
7741 caatcctcaa ccgtaaggca attgacttcc tgctgcagcg atggggtggc acatgccaca
7801 ttttgggacc ggactgctgt atcgaaccac atgattggac caagaacata acagacaaaa
7861 ttgatcagat tattcatgat tttgttgata aaacccttcc ggaccagggg gacaatgaca
7921 attggtggac aggatggaga caatggatac cggcaggtat tggagttaca ggtgttataa
7981 ttgcagttat cgctttattc tgtatatgca aatttgtctt ttagtctttc ttcagattgt
8041 ttcacggcaa aactcaacct caaatcaatg aaactaggat ttaattatat gaatcacttg
8101 aatctaagat tacttgacaa atgataacat aatacactgg agcttcaaac atagccaatg
8161 tgattctaac tcctttaaac tcacagttaa tcataaacaa ggtttgacat caatctagct
8221 atatctttaa gaatgataaa cttgatgaag attaagaaaa aggtaatctt tcgattatct
8281 ttagtcttca tccttgattc tacaatcatg acagttgtct ttaatgaaaa aggaaaaaag
8341 cctttttatt aagttgtaat aatcagatct gcaaaccggt agaatttagt tgtaacctaa
8401 cacacacaaa gcattggtaa aaaagtcaat agaaatttaa acagtgagtg cagacaactc
8461 ttaaatggaa gcttcatatg agagaggacg cccccgagct gccagacagc attcaaggga
8521 tggacacgac caccatgttc gagcacgatc atcatccaga gagaattatc gaggtgagta
8581 ccgtcaatca aggagcgcct cacaagtgcg cgttcctact gtatttcata agaagagagt
8641 tgaaccatta acagttcctc cagcacctaa agacatatgt ccgaccttga aaaaaggatt
8701 tttgtgtgac agtagttttt gcaaaaaaga ccaccagtta gaaagtttaa ctgataggga
8761 attactccta ctaatcgccc gtaagacttg tggatcagta gaacaacaat taaatataac
8821 tgcacccaag gactcgcgct tagcaaatcc aacggctgat gatttccagc aagaggaagg
8881 tcccaaaatt accttgttga cactgatcaa gacggcagaa cactgggcga gacaagacat
8941 ccgaaccata gaggattcca aattaagggc attgttaact ctatgtgctg tgatgacgag
9001 gaaattctca aaatcccagc tgagtctttt gtgtgagaca cacctaaggc gcgaagggct
9061 tgggcaagat caggcagaac ccgttctcga agtatatcaa cgattacaca gtgataaagg
9121 aggcagtttt gaagctgcac tatggcaaca atgggaccga caatccctaa ttatgtttat
9181 cactgcattc ttgaatatcg ctctccagtt accgtgtgaa agttctgctg tcgttgtttc
9241 agggttaaga acattggttc ctcaatcaga taatgaggaa gcttcaacca acccggggac
9301 atgctcatgg tctgatgagg gtacccctta ataaggctga ctaaaacact atataacctt
9361 ctacttgatc acaatactcc gtatacctat catcatatat ttaatcaaga cgatatcctt
9421 taaaacttat tcagtactat aatcactctc atttcaaatt gataagatat gcataattgc
9481 cttaatatat aaagaggtat gatataaccc aaacattgac caaagaaaat cataatctcg
9541 tatcgctcgc aatataacct gccaagcata cctcttgcac aaagtgattc ttgtacacaa
9601 ataatgtttg actctacagg aggtagcaac gatccatctc atcaaaaaat aagtatttta
9661 tgatttacta atgatctctt aaaatattaa gaaaaactga cggaacataa attctttctg
9721 cttcaagttg tggaggaggt ctatggtatt cgctattgtt atattacaat caataacaag
9781 cttgtaaaaa tattgttctt gtttcaggag gtatattgtg accggaaaag ctaaactaat
9841 gatgaagatt aatgcggagg tctgatgaga ataaacctta ttattcagat taggccccaa
9901 gaggcattct tcatctcctt ttagcaaaat actatttcag gatagtccag ctagtgacac
9961 gtcttttagc tgtataccag ttgcccctga gatacgccac aaaagtgtct ctgagctaaa
10021 gtggtctgta cacatctcat acattgtatt aggggcaata atatctaatt gaacttagcc
10081 atttaaaatt tagtgcataa atctgggcta actccaccag gtcaactcca ttggctgaaa
10141 agaagcccac ctacaacgaa cattactttg agcaccctca caattaaaaa ataagagcgt
10201 cgttccaaca atcgagcgca aggttacaag gttgaactga gagtgtctag acaacaaaat
10261 atcgatactc cagacaccaa gcaagacctg agaaaaaacc atggccaaag ctacgggacg
10321 atacaatcta atatcgccca aaaaggacct ggagaaaggg gttgtcttaa gcgacctctg
10381 taacttctta gttagtcaaa ctattcaagg gtggaaagtt tattgggctg gtattgagtt
10441 tgatgtgact cacaaaggaa tggccctatt gcatagactg aaaactaatg actttgcccc
10501 tgcatggtca atgacaagga acctatttcc ccatttattt caaaatccga attccactat
10561 tgaatcaccg ctgtgggcac tgagagtcat ccttgcagca gggatacagg accagttaat
10621 tgaccagtct ttgattgaac ccttagcagg agcccttggt ctgatctctg attggctgct
10681 aacaaccaac actaaccatt tcaacatgcg aacacaacgt gtcaaggaac aattgagcct
10741 aaaaatgctg tcgttgattc gatccaatat tctcaagttt attaacaaat tggatgctct
10801 acatgtcgtg aactacaatg gattattgag cagtattgaa attggaactc aaaatcatac
10861 aatcatcata actcgaacta acatgggttt tctggtggag ctccaagaac ccgacaaatc
10921 ggcaatgaac cgcaagaagc ctgggccggc gaaattttcc ctccttcatg agtccacact
10981 gaaagcattt acacaagggt cctcgacacg aatgcaaagt ttaattcttg aattcaatag
11041 ctctcttgct atctaactaa gatggaatac ttcatattgg gctaactcat atatgctgac
11101 tcaatagtta acttgacatc tctgccttca taatcagata tataagcata ataaataaat
11161 actcatattt cttgataatt tgtttaacca cagataaatc ctcactgtaa gccagcttcc
11221 aagttgacac ccttacaaaa accaggactc agaatccctc aaataagaga ttccaagaca
11281 acatcataga attgctttat tatattaata agcattttat cactagaaat ccaatatacg
11341 aaatggttaa ttgtaactaa acccgcaggt catgtgtgtt aggtttcaca aattatatat
11401 attactaact ccatactcgt aactaacatt agataagtag gttaagaaaa aagcttgagg
11461 aagattaaga aaaactgctt attgggtctt tccgtgtttt agatgaagca gttgacattc
11521 ttcctcttga tattaaatgg ctacacaaca tacccaatac ccagacgcca ggttatcatc
11581 accaattgta ttggaccaat gtgaccttgt cactagagct tgcgggttgt attcatcata
11641 ctcccttaat ccgcaactac gcaactgtaa actcccgaaa catatatacc gtttaaaata
11701 tgatgtaact gttaccaagt tcttaagtga tgtaccagtg gcgacattgc ccatagattt
11761 catagtccca attcttctca aggcactatc aggcaatggg ttctgtcctg ttgagccgcg
11821 gtgccaacag ttcttagatg aaattattaa gtacacaatg caagatgctc tcttcctgaa
11881 atattatctc aaaaatgtgg gtgctcaaga agactgtgtt gatgaccact ttcaagaaaa
11941 aatcttatct tcaattcagg gcaatgaatt tttacatcaa atgtttttct ggtatgacct
12001 ggctatttta actcgaaggg gtagattaaa tcgaggaaac tctagatcaa cgtggtttgt
12061 tcatgatgat ttaatagaca tcttaggcta tggggactat gttttttgga agatcccaat
12121 ttcactgtta ccactgaaca cacaaggaat cccccatgct gctatggatt ggtatcagac
12181 atcagtattc aaagaagcgg ttcaagggca tacacacatt gtttctgttt ctactgccga
12241 tgtcttgata atgtgcaaag atttaattac atgtcgattc aacacaactc taatctcaaa
12301 aatagcagag gttgaggacc cagtttgctc tgattatccc aattttaaga ttgtgtctat
12361 gctttaccag agcggagatt acttactctc catattaggg tctgatgggt ataaaatcat
12421 taagtttctc gaaccattgt gcttggctaa aattcaattg tgctcaaagt acaccgagag
12481 gaagggccga ttcttaacac aaatgcattt agctgtaaat cacaccctgg aagaaattac
12541 agaaatacgt gcactaaagc cttcacaggc tcacaagatc cgtgaattcc atagaacatt
12601 gataaggctg gagatgacgc cacaacaact ttgtgagcta ttttccatac aaaaacactg
12661 ggggcatcct gtgctacata gtgaaacagc aatccaaaaa gttaaaaaac atgctacggt
12721 gctaaaagca ttacgcccta tcgtgatttt cgagacatat tgtgttttta aatatagcat
12781 tgcaaaacat tattttgata gtcaaggatc ttggtacagt gttacctcag atagaaatct
12841 aacaccaggt cttaattctt atatcaaaag aaatcaattc cctccgttgc caatgattaa
12901 agaactgcta tgggaatttt accaccttga ccatcctcca cttttctcaa ccaaaattat
12961 tagtgactta agtattttta taaaagacag agctactgca gtagaaagga catgctggga
13021 tgcagtattc gagcctaatg ttctgggata taatccacct cacaaattca gtaccaaacg
13081 tgtaccggaa caatttttag agcaagaaaa cttttctatt gagaatgttc tttcctacgc
13141 gcaaaaactc gagtatctac taccacaata tcggaatttt tctttctcat tgaaagagaa
13201 agagttgaat gtaggtagaa ctttcggaaa attgccttat ccgactcgca atgttcaaac
13261 actttgtgaa gctctgttag ctgatggtct tgctaaagca tttcctagca atatgatggt
13321 agttacggaa cgtgaacaaa aagaaagctt attgcatcaa gcatcatggc accacacaag
13381 tgatgatttc ggtgagcatg ccacagttag agggagtagc tttgtaactg atttagagaa
13441 atacaatctt gcatttaggt atgagtttac agcacctttt atagaatatt gcaaccgttg
13501 ctatggtgtt aagaatgttt ttaattggat gcattataca atcccacagt gttatatgca
13561 tgtcagtgat tattataatc caccgcataa cctcacactg gaaaatcgaa acaacccccc
13621 tgaagggcct agttcataca ggggtcatat gggagggatt gaaggactgc aacaaaaact
13681 ctggacaagt atttcatgtg ctcaaatttc tttagttgaa attaagactg gttttaagtt
13741 gcgctcagct gtgatgggtg acaatcagtg cattaccgtt ttatcagtct tccccttaga
13801 gactgatgca ggcgagcagg aacagagcgc cgaggacaat gcagcgaggg tggccgccag
13861 cctagcaaaa gttacaagtg cctgtggaat ctttttaaaa cctgatgaaa catttgtaca
13921 ttcaggtttt atctattttg gaaaaaaaca atatttgaat ggggtccaat tgcctcagtc
13981 ccttaaaacg gctacaagaa tggcaccatt gtctgatgca atttttgatg atcttcaagg
14041 gaccctggct agtataggta ctgcttttga gcgatccatc tctgagacac gacatatctt
14101 tccttgcaga ataaccgcag ctttccatac gttcttttcg gtgagaatct tgcaatatca
14161 tcacctcgga tttaataaag gttttgacct tggacagtta acactcggca aacctctgga
14221 tttcggaaca atatcattgg cactagcggt accgcaggtg cttggagggt tatccttctt
14281 gaatcctgag aaatgtttct accggaatct aggagatcca gttacctcag gtttattcca
14341 gttaaaaact tatctccgaa tgattgagat ggatgattta ttcttacctt taattgcgaa
14401 gaaccctggg aactgcactg ccattgactt tgtgctaaat cctagcggat taaatgttcc
14461 tgggtcgcaa gacttaactt catttctgcg ccagattgta cgtaggacta tcaccctaag
14521 tgcgaaaaac aaacttatta ataccttatt tcatgcatca gctgacttcg aagacgaaat
14581 ggtttgtaag tggctcttat catcaactcc tgttatgagt cgtttcgcag ccgatatatt
14641 ttcacgcacg ccgagcggga agcgattgca aattctagga tacttggaag gaacacgcac
14701 attattagcc tctaagatca tcaacaataa tacagagacg ccggttttgg acagactgag
14761 gaagataaca ttgcaaaggt ggagtctatg gtttagttat cttgatcatt gtgataatat
14821 cctggcggag gctttaaccc aaataacttg cacagttgat ttagcacaga tcctgaggga
14881 atattcatgg gcacatattt tagaggggag acctcttatt ggagccacac tcccatgtat
14941 gattgagcaa ttcaaagtgg tttggctgaa accctacgaa caatgtccgc agtgttcaaa
15001 tgccaagcaa cctggtggga aaccattcgt gtcagtagca gtcaagaaac atattgttag
15061 tgcatggcca aatgcatccc gaataagctg gactatcggg gatggaatcc catacattgg
15121 atcaaggaca gaagataaga tagggcaacc tgctattaaa ccaaaatgtc cttccgcagc
15181 cttaagagag gccattgaat tggcgtcccg tttaacatgg gtaactcaag gcagttcgaa
15241 cagtgacttg ctaataaaac catttttgga agcacgagta aatttaagtg ttcaagaaat
15301 acttcaaatg accccttcac attactcggg aaatattgtt cataggtaca acgatcaata
15361 cagtcctcat tctttcatgg ccaatcgtat gagtaactca gcaacgcgat tgattgtttc
15421 tacaaacact ttaggtgagt tttcaggagg tggccaatcg gcacgcgaca gcaatattat
15481 tttccagaat gttataaatt atgcagttgc actgttcgat attaaattta gaaacactga
15541 ggctacagat atccagtata atcgtgctca ccttcatcta actaagtgtt gcacccggga
15601 ggtaccagct cagtacttaa catacacatc tacattggat ttagatttaa caagataccg
15661 agaaaatgaa ttgatttatg acaataatcc tctaaaagga ggactcaatt gcaatatctc
15721 atttgataac ccatttttcc aaggcaaaca gctgaacatt atagaagatg accttattcg
15781 actgcctcac ttatctggat gggagctagc taagaccatc atgcaatcaa ttatttcaga
15841 tagcaataat tcgtctacag acccaattag cagtggagaa acaagatcat tcactaccca
15901 tttcttaact tatcccaaaa taggacttct gtacagtttt ggggcctttg taagttatta
15961 tcttggcaat acaattcttc ggactaagaa attaacactt gacaattttt tatattactt
16021 aactacccaa attcataatc taccacatcg ctcattgcga atacttaagc caacattcaa
16081 acatgcaagc gttatgtcac gattaatgag tattgatccc catttttcta tttacatagg
16141 cggtgctgca ggtgacagag gactctcaga tgcggccagg ttatttttga gaacgtccat
16201 ttcatctttt cttacatttg taaaggaatg gataattaat cgcggaacaa ttgtcccttt
16261 atggatagta tatccattag agggtcaaaa tccaacacct gttaataatt tcctccatca
16321 gatcgtagaa ctgctggtgc atgattcatc aagacaccag gcttttaaaa ctaccataaa
16381 tgatcatgta catcctcacg acaatcttgt ttacacatgt aagagtacag ccagcaattt
16441 cttccatgcg tcattggcgt actggaggag caggcacaga aacagcaacc gaaaagactt
16501 gacaagaaac tcttcaactg gatcaagcac aaacaacagt gatggtcata ttaagagaag
16561 tcaagaacaa accaccagag atccacatga tggcactgaa cggagtctag tcctgcaaat
16621 gagccatgaa ataaaaagaa cgacaattcc acaagagaac acgcaccagg gtccgtcgtt
16681 ccagtcattt ctaagtgact ctgcttgcgg tacagcaaac ccaaaactaa atttcgatag
16741 atcgagacac aatgtgaaat ctcaggatca taactcagca tccaagaggg aaggtcatca
16801 aataatctca catcgtctag tcctaccttt ctttacatta tctcaaggga cacgccaatt
16861 aacgtcatcc aatgagtcac aaacccaaga tgagatatca aagtacttac ggcaattgag
16921 atccgtcatt gataccacag tttattgtag gtttaccggt atagtctcgt ccatgcatta
16981 caaacttgat gaggtccttt gggaaataga gaattttaag tcggctgtga cgctggcaga
17041 gggagaaggt gctggtgcct tactattgat tcagaaatac caagttaaga ccttattctt
17101 caacacgcta gctactgagt ccagtataga gtcagaaata gtatcaggaa tgactactcc
17161 taggatgctt ctacctgtta tgtcaaaatt ccataatgac caaattgaga ttattcttaa
17221 caactcagca agccaaataa cagacataac aaatcctact tggtttaaag accaaagagc
17281 aaggctacct aggcaagtcg aggttataac catggatgca gagacgacag agaatataaa
17341 cagatcgaaa ttgtacgaag ctgtacataa attgatctta caccatgttg atcccagcgt
17401 attgaaagca gtggtcctta aagtctttct aagtgatacc gagggtatgt tatggctaaa
17461 tgataatcta gccccgtttt ttgccactgg gtatttaatt aagccaataa cgtcaagtgc
17521 caggtctagt gagtggtatc tttgtctgac gaacttctta tcaactacac gtaagatgcc
17581 acaccaaaac catctcagtt gtaagcaggt aatacttacg gcattgcaac tgcaaattca
17641 acggagccca tactggctaa gtcatttaac tcagtatgct gactgcgatt tacatttaag
17701 ctatatccgc cttggttttc catcattaga gaaagtacta taccacaggt ataaccttgt
17761 cgattcaaaa agaggtccac tagtctctgt cactcagcac ttagcacatc ttagggcaga
17821 gattcgagaa ttgaccaatg attataatca acagcgacaa agtcggactc aaacatatca
17881 ctttattcgt actgcaaaag gacgaatcac aaaactagtc aatgattatt taaaattctt
17941 tcttattgta caagcattaa aacataatgg gacatggcaa gctgagttta agaaattacc
18001 agagttgatt agtgtgtgca ataggttcta tcatattaga gattgtaatt gtgaagaacg
18061 tttcttagtt caaaccttat atttacatag aatgcaggat tctgaagtta agcttatcga
18121 aaggctgaca gggcttctga gtttatttcc agatggtctc tacaggttcg attgaataac
18181 cgtgcatagt attttgatac ttgtaaaggt tggttatcaa catacagatt ataaaaaact
18241 cataaattgc tctcatacat catcttgatc tgatttcaat aaataactat ttagataacg
18301 aaaggagtcc ttacattata cactatattt ggcctctctc cctgcgtgat aatcaaaaaa
18361 ttcacaatac agcatgtgtg acatattact gctgcaatga gtctaacgca acataataaa
18421 ctccgcactc tttataatta agctttaacg ataggtctgg gctcatattg ttattgatat
18481 agtaatgttg tatcaatatc ttgccagatg gaatagtgct ttggttgata acacgacttc
18541 ttaaaacaaa actgatcttt aagattaagt tttttataat tgtcattgct ttaatttgtc
18601 gatttaaaaa tggtgatagc cttaatcttt gtgtaaaata agagattagg tgtaataact
18661 ttaacatttt tgtctagtaa gctactattc cattcagaat gataaaatta aaagaaaaga
18721 catgactgta aaatcagaaa taccttcttt acaatatagc agactagata ataatcttcg
18781 tgttaatgat aattaaggca ttgaccacgc tcatcagaag gctcactaga ataaacgttg
18841 caaaaaggat ccctggaaaa atggtcgcac acaaaaattt aaaaataaat ctatttcttc
18901 ttttttgtgt gtcca

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:22 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
LOCUS KM233038 18914 bp cRNA linear VRL 25-JUL-2014
DEFINITION Zaire ebolavirus strain EBOV_4, partial genome.
ACCESSION KM233038
VERSION KM233038.1 GI:667852521
KEYWORDS .
SOURCE Zaire ebolavirus (ZEBOV)
ORGANISM Zaire ebolavirus
Viruses; ssRNA negative-strand viruses; Mononegavirales;
Filoviridae; Ebolavirus.
REFERENCE 1 (bases 1 to 18914)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Deep sequencing analysis of Ebola virus transmission in Sierra
Leone
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 18914)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Direct Submission
JOURNAL Submitted (25-JUL-2014) Infectious Disease Initiative, Broad
Institute of MIT and Harvard, 75 Ames St., Cambridge, MA 02142, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Novoalign v. v.3
Sequencing Technology :: Illumina; Nextera
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..18914
/organism="Zaire ebolavirus"
/mol_type="viral cRNA"
/strain="EBOV_4"
/host="Homo sapiens"
/db_xref="taxon:186538"
/country="Sierra Leone"
/collection_date="03-Jun-2014"
gene 12..2982
/gene="NP"
mRNA 12..2982
/gene="NP"
/product="nucleoprotein"
misc_signal 12..23
/gene="NP"
/note="putative transcription start signal"
CDS 426..2645
/gene="NP"
/note="encapsidation of genomic RNA"
/codon_start=1
/product="nucleoprotein"
/protein_id="AIG95911.1"
/db_xref="GI:667852522"
/translation="MDSRPQKVWMTPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
GFRFEVKKCDGVKRLEELLPAVSSGRNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
DTTIPDVVVDPDDGGYGEYQSYSENGMSAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
KNSQKGQHTEGRQTQSTPTQNVTGPRRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
QQDQDHIQEARNQDSDNTQPEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
GKEYTYPDSLEEEYPPWLTEKEAMNDENRFVTLDGQQFYWPVMNHRNKFMAILQHHQ"
polyA_signal 2971..2982
/gene="NP"
gene 2988..4363
/gene="VP35"
mRNA 2988..4363
/gene="VP35"
/product="VP35 matrix protein"
misc_signal 2988..2999
/gene="VP35"
/note="putative transcription start signal"
CDS 3085..4107
/gene="VP35"
/note="polymerase complex protein"
/codon_start=1
/product="VP35 matrix protein"
/protein_id="AIG95912.1"
/db_xref="GI:667852523"
/translation="MTTRTKGRGHTVATTQNDRMPGPELSGWISEQLMTGRIPVNDIF
CDIENNPGLCYASQMQQTKPNPKMRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLDSTTSLTEENFGKPD
ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
TLGLKI"
gene 4346..5850
/gene="VP40"
mRNA 4346..5850
/gene="VP40"
/product="matrix protein"
misc_signal 4346..4357
/gene="VP35"
/note="transcription start signal"
polyA_signal 4353..4363
/gene="VP35"
CDS 4435..5415
/gene="VP40"
/codon_start=1
/product="matrix protein"
/protein_id="AIG95913.1"
/db_xref="GI:667852524"
/translation="MRRVILPTAPPEYMEAIYPARSNSTIARGGNSNTGFLTPESVNG
DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVVEK"
polyA_signal 5839..5850
/gene="VP40"
gene 5856..8261
/gene="GP"
mRNA 5856..8261
/gene="GP"
/product="ssGP"
/note="unedited mRNA"
misc_signal 5856..5867
/gene="GP"
/note="putative transcription start signal"
CDS join(5995..6879,6879..8024)
/gene="GP"
/ribosomal_slippage
/note="additional a residue inserted during transcription;
encodes two disulfide linked subunits GP1 and GP2;
receptor binding and fusion"
/codon_start=1
/product="virion spike glycoprotein precursor"
/protein_id="AIG95914.1"
/db_xref="GI:667852525"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNGPKNISGQSPARTSSDPETNT
TNEDHKIMASENSSAMVQVHSQGRKAAVSHLTTLATISTSPQPPTTKTGPDNSTHNTP
VYKLDISEATQVGQHHRRADNDSTASDTPPATTAAGPLKAENTNTSKSADSLDLATTT
SPQNYSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREVIVNAQ
PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQLANETT
QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
CDS 5995..7089
/gene="GP"
/note="small non-structural secreted glycoprotein; sGP
secreted as an anti-parallel oriented homodimer"
/codon_start=1
/product="sGP"
/protein_id="AIG95915.1"
/db_xref="GI:667852526"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTDPKTSVVRVRRELLPTQRPTQ
QMKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
CDS join(5995..6879,6881..6889)
/gene="GP"
/ribosomal_slippage
/note="second non-structural secreted glycoprotein;
secreted in a monomeric form; one a residue is deleted or
two additional a residues are inserted at the editing site
during transcription of the GP gene"
/codon_start=1
/product="ssGP"
/protein_id="AIG95916.1"
/db_xref="GI:667852527"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKPH"
gene 8244..9696
/gene="VP30"
mRNA 8244..9696
/gene="VP30"
/product="VP30 minor nucleoprotein"
misc_signal 8244..8255
/gene="VP30"
/note="putative transcription start signal"
polyA_signal 8251..8261
/gene="VP30"
CDS 8465..9331
/gene="VP30"
/note="minor nucleoprotein; polymerase complex protein"
/codon_start=1
/product="VP30 minor nucleoprotein"
/protein_id="AIG95917.1"
/db_xref="GI:667852528"
/translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
PGTCSWSDEGTP"
polyA_signal 9686..9696
/gene="VP30"
/note="putative"
gene 9841..11474
/gene="VP24"
/note="putative"
mRNA 9841..11452
/gene="VP24"
/product="VP24 membrane-associated protein"
misc_signal 9841..9852
/gene="VP24"
/note="transcription start signal"
CDS 10301..11056
/gene="VP24"
/note="membrane-associated protein"
/codon_start=1
/product="VP24 membrane-associated protein"
/protein_id="AIG95918.1"
/db_xref="GI:667852529"
/translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRKKPGPAK
FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
polyA_signal 11441..11452
/gene="VP24"
/note="putative"
gene 11457..18238
/gene="L"
mRNA 11457..18238
/gene="L"
/product="polymerase"
misc_signal 11457..11468
/gene="VP24"
/note="transcription start signal"
polyA_signal 11464..11474
/gene="VP24"
/note="putative"
CDS 11537..18175
/gene="L"
/note="polymerase; synthesis of viral RNAs;
transcriptional RNA editing"
/codon_start=1
/product="polymerase"
/protein_id="AIG95919.1"
/db_xref="GI:667852530"
/translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPILLKALSGNGFCPVEPRCQQFLDE
IIKYTMQDALFLKYYLKNVGAQEDCVDDHFQEKILSSIQGNEFLHQMFFWYDLAILTR
RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISLLPLNTQGIPHAAMDWYQTSVF
KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEVEDPVCSDYPNFKIVSML
YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
TEIRALKPSQAHKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
PHNLTLENRNNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
GDNQCITVLSVFPLETDAGEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVVWLKP
YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
LIYDNNPLKGGLNCNISFDNPFFQGKQLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLHQIVELLVHDSSRHQAF
KTTINDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKDLTRNSSTGSSTNNS
DGHIKRSQEQTTRDPHDGTERSLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
ANPKLNFDRSRHNVKSQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIENFKSAVTLAEGEGAGAL
LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
ITDITNPTWFKDQRARLPRQVEVITMDAETTENINRSKLYEAVHKLILHHVDPSVLKA
VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCDLHLSYIRLGFPSLEKVLYHRYNL
VDSKRGPLVSVTQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
VKLIERLTGLLSLFPDGLYRFD"
polyA_signal 18228..18238
/gene="L"

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:22 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
1 cgaataacta tgaggaagat taataatttt cctctcattg aaatttatat cggaatttaa
61 attgaaattg ttactgtaat catacctggt ttgtttcaga gccatatcac caagatagag
121 aacaacctag gtctccggag ggggcaaggg catcagtgtg ctcagttgaa aatcccttgt
181 caacatctag gccttatcac atcacaagtt ccgccttaaa ctctgcaggg tgatccaaca
241 accttaatag caacattatt gttaaaggac agcattagtt cacagtcaaa caagcaagat
301 tgagaattaa ctttgatttt gaacctgaac acccagagga ctggagactc aacaacccta
361 aagcctgggg taaaacatta gaaatagttt aaagacaaat tgctcggaat cacaaaattc
421 cgagtatgga ttctcgtcct cagaaagtct ggatgacgcc gagtctcact gaatctgaca
481 tggattacca caagatcttg acagcaggtc tgtccgttca acaggggatt gttcggcaaa
541 gagtcatccc agtgtatcaa gtaaacaatc ttgaggaaat ttgccaactt atcatacagg
601 cctttgaagc tggtgttgat tttcaagaga gtgcggacag tttccttctc atgctttgtc
661 ttcatcatgc gtaccaagga gattacaaac ttttcttgga aagtggcgca gtcaagtatt
721 tggaagggca cgggttccgt tttgaagtca agaagtgtga tggagtgaag cgccttgagg
781 aattgctgcc agcagtatct agtgggagaa acattaagag aacacttgct gccatgccgg
841 aagaggagac gactgaagct aatgccggtc agttcctctc ctttgcaagt ctattccttc
901 cgaaattggt agtaggagaa aaggcttgcc ttgagaaggt tcaaaggcaa attcaagtac
961 atgcagagca aggactgata caatatccaa cagcttggca atcagtagga cacatgatgg
1021 tgattttccg tttgatgcga acaaattttt tgatcaaatt tcttctaata caccaaggga
1081 tgcacatggt tgccggacat gatgccaacg atgctgtgat ttcaaattca gtggctcaag
1141 ctcgtttttc aggtctattg attgtcaaaa cagtacttga tcatatccta caaaagacag
1201 aacgaggagt tcgtctccat cctcttgcaa ggaccgccaa ggtaaaaaat gaggtgaact
1261 ccttcaaggc tgcactcagc tccctggcca agcatggaga gtatgctcct ttcgcccgac
1321 ttttgaacct ttctggagta aataatcttg agcatggtct tttccctcaa ctgtcggcaa
1381 ttgcactcgg agtcgccaca gcccacggga gcaccctcgc aggagtaaat gttggagaac
1441 agtatcaaca gctcagagag gcagccactg aggctgagaa gcaactccaa caatatgcgg
1501 agtctcgtga acttgaccat cttggacttg atgatcagga aaagaaaatt cttatgaact
1561 tccatcagaa aaagaacgaa atcagcttcc agcaaacaaa cgcgatggta actctaagaa
1621 aagagcgcct ggccaagctg acagaagcta tcactgctgc atcactgccc aaaacaagtg
1681 gacattacga tgatgatgac gacattccct ttccaggacc catcaatgat gacgacaatc
1741 ctggccatca agatgatgat ccgactgact cacaggatac gaccattccc gatgtggtag
1801 ttgaccccga tgatggaggc tacggcgaat accaaagtta ctcggaaaac ggcatgagtg
1861 caccagatga cttggtccta ttcgatctag acgaggacga cgaggacacc aagccagtgc
1921 ctaacagatc gaccaagggt ggacaacaga aaaacagtca aaagggccag catacagagg
1981 gcagacagac acaatccacg ccaactcaaa acgtcacagg ccctcgcaga acaatccacc
2041 atgccagtgc tccactcacg gacaatgaca gaagaaacga accctccggc tcaaccagcc
2101 ctcgcatgct gaccccaatc aacgaagagg cagacccact ggacgatgcc gacgacgaga
2161 cgtctagcct tccgccctta gagtcagatg atgaagaaca ggacagggac ggaacttcta
2221 accgcacacc cactgtcgcc ccaccggctc ccgtatacag agatcactcc gaaaagaaag
2281 aactcccgca agatgaacaa caagatcagg accacattca agaggccagg aaccaagaca
2341 gtgacaacac ccagccagaa cattcttttg aggagatgta tcgccacatt ctaagatcac
2401 aggggccatt tgatgccgtt ttgtattatc atatgatgaa ggatgagcct gtagttttca
2461 gtaccagtga tggtaaagag tacacgtatc cggactccct tgaagaggaa tatccaccat
2521 ggctcactga aaaagaggcc atgaatgatg agaatagatt tgttacactg gatggtcaac
2581 aattttattg gccagtaatg aatcacagga ataaattcat ggcaatcctg caacatcatc
2641 agtgaatgag catgtaataa tgggatgatt taatcgacaa atagctaaca ttaaatagtc
2701 aaggaacgca aacaggaaga atttttgatg tctaaggtgt gaattattat cacaataaaa
2761 gtgattctta gttttgaatt taaagctagc ttattattac tagccgtttt tcaaagttca
2821 atttgagtct taatgcaaat aagcgttaag ccacagttat agccataatg gtaactcaat
2881 atcttagcca gcgatttatc taaattaaat tacattatgc ttttataact tacctactag
2941 cctgcccaac atttacacga tcgttttata attaagaaaa aactaatgat gaagattaaa
3001 accttcatca tccttacgtc aattgaattc tctagcacta gaagcttatt gtcttcaatg
3061 taaaagaaaa gctggcctaa caagatgaca actagaacaa agggcagggg ccatactgtg
3121 gccacgactc aaaacgacag aatgccaggc cctgagcttt cgggctggat ctctgagcag
3181 ctaatgaccg gaaggattcc tgtaaacgac atcttctgtg atattgagaa caatccagga
3241 ttatgctacg catcccaaat gcaacaaacg aagccaaacc cgaagatgcg caacagtcaa
3301 acccaaacgg acccaatttg caatcatagt tttgaggagg tagtacaaac attggcttca
3361 ttggctactg ttgtgcaaca acaaaccatc gcatcagaat cattagaaca acgcattacg
3421 agtcttgaga atggtctaaa gccagtttat gatatggcaa aaacaatctc ctcattgaac
3481 agggtttgtg ctgagatggt tgcaaaatat gatcttctgg tgatgacaac cggtcgggca
3541 acagcaaccg ctgcggcaac tgaggcttat tgggctgaac atggtcaacc accacctgga
3601 ccatcacttt atgaagaaag tgcgattcgg ggtaagattg aatctagaga tgagactgtc
3661 cctcaaagtg ttagggaggc attcaacaat ctagacagta ccacttcact aactgaggaa
3721 aattttggga aacctgacat ttcggcaaag gatttgagaa acattatgta tgatcacttg
3781 cctggttttg gaactgcttt ccaccaatta gtacaagtga tttgtaaatt gggaaaagat
3841 agcaattcat tggacattat tcatgctgag ttccaggcca gcctggctga aggagactcc
3901 cctcaatgtg ccctaattca aattacaaaa agagttccaa tcttccaaga tgctgctcca
3961 cctgtcatcc acatccgctc tcgaggtgac attccccgag cttgccagaa gagcttgcgt
4021 ccagtcccac catcacccaa gattgatcga ggttgggtat gtgtttttca gcttcaagat
4081 ggtaaaacac ttggactcaa aatttgagcc aatctctttt ccctccgaaa gaggcaacta
4141 atagcagagg cttcaactgc tgaactatag ggtatgttac attaatgata cacttgtgag
4201 tatcagccct agataatata agtcaattaa acaaccaaga taaaattgtt catatcccgc
4261 tagcagcttt aaagataaat gtaataggag ctatacctct gacagtatta taattaattg
4321 ttattaagta acccaaacca aaaatgatga agattaagaa aaacctacct cgactgagag
4381 agtgtttttt cattaacctt catcttgtaa acgttgagca aaattgttaa aaatatgagg
4441 cgggttatat tgcctactgc tcctcctgaa tatatggagg ccatataccc tgccaggtca
4501 aattcaacaa ttgctagggg tggcaacagc aatacaggct tcctgacacc ggagtcagtc
4561 aatggagaca ctccatcgaa tccactcagg ccaattgctg atgacaccat cgaccatgcc
4621 agccacacac caggcagtgt gtcatcagca ttcatcctcg aagctatggt gaatgtcata
4681 tcgggcccca aagtgctaat gaagcaaatt ccaatttggc ttcctctagg tgtcgctgat
4741 caaaagacct acagctttga ctcaactacg gccgccatca tgcttgcttc atatactatc
4801 acccatttcg gcaaggcaac caatccgctt gtcagagtca atcggctggg tcctggaatc
4861 ccggatcacc ccctcaggct cctgcgaatt ggaaaccagg ctttcctcca ggagttcgtt
4921 cttccaccag tccaactacc ccagtatttc acctttgatt tgacagcact caaactgatc
4981 actcaaccac tgcctgctgc aacatggacc gatgacactc caactggatc aaatggagcg
5041 ttgcgtccag gaatttcatt tcatccaaaa cttcgcccca ttcttttacc caacaaaagt
5101 gggaagaagg ggaacagtgc cgatctaaca tctccggaga aaatccaagc aataatgact
5161 tcactccagg actttaagat cgttccaatt gatccaacca aaaatatcat gggtatcgaa
5221 gtgccagaaa ctctggtcca caagctgacc ggtaagaagg tgacttccaa aaatggacaa
5281 ccaatcatcc ctgttctttt gccaaagtac attgggttgg acccggtggc tccaggagac
5341 ctcaccatgg taatcacaca ggattgtgac acgtgtcatt ctcctgcaag tcttccagct
5401 gtggttgaga agtaattgca ataattgact cagatccagt tttacagaat cttctcaggg
5461 atagtgataa catcttttta ataatccgtc tactagaaga gatacttcta attgatcaat
5521 atactaaagg tgctttacac cattgtctct tttctctcct aaatgtagag cttaacaaaa
5581 gactcataat atacctgttt ttaaaagatt gattgatgaa agatcatgac taataacatt
5641 acaaacaatc ctactataat caatacggtg attcaaatgt caatctttct cattgcacat
5701 actctttgtc cttatcctca aattgcctac atgcttacat ctgaggacag ccagtgtgac
5761 ttggattgga gatgtggagg aaaaatcggg gcccatttct aagttgttca caatctaagt
5821 acagacattg ctcttctaat taagaaaaaa tcggcgatga agattaagcc gacagtgagc
5881 gtaatcttca tctctcttag attatttgtc ttccagagta ggggtcatca ggtccttttc
5941 aattggataa ccaaaataag cttcactaga aggatattgt gaggcgacaa cacaatgggt
6001 gttacaggaa tattgcagtt acctcgtgat cgattcaaga ggacatcatt ctttctttgg
6061 gtaattatcc ttttccaaag aacattttcc atcccgcttg gagttatcca caatagtaca
6121 ttacaggtta gtgatgtcga caaactagtt tgtcgtgaca aactgtcatc cacaaatcaa
6181 ttgagatcag ttggactgaa tctcgagggg aatggagtgg caactgacgt gccatctgtg
6241 actaaaagat ggggcttcag gtccggtgtc ccaccaaagg tggtcaatta tgaagctggt
6301 gaatgggctg aaaactgcta caatcttgaa atcaaaaaac ctgacgggag tgagtgtcta
6361 ccagcagcgc cagacgggat tcggggcttc ccccggtgcc ggtatgtgca caaagtatca
6421 ggaacgggac catgtgccgg agactttgcc ttccacaaag agggtgcttt cttcctgtat
6481 gatcgacttg cttccacagt tatctaccga ggaacgactt tcgctgaagg tgtcgttgca
6541 tttctgatac tgccccaagc taagaaggac ttcttcagct cacacccctt gagagagccg
6601 gtcaatgcaa cggaggaccc gtcgagtggc tattattcta ccacaattag atatcaggct
6661 accggttttg gaactaatga gacagagtac ttgttcgagg ttgacaattt gacctacgtc
6721 caacttgaat caagattcac accacagttt ctgctccagc tgaatgagac aatatatgca
6781 agtgggaaga ggagcaacac cacgggaaaa ctaatttgga aggtcaaccc cgaaattgat
6841 acaacaatcg gggagtgggc cttctgggaa actaaaaaaa cctcactaga aaaattcgca
6901 gtgaagagtt gtctttcaca gctgtatcaa acggacccaa aaacatcagt ggtcagagtc
6961 cggcgcgaac ttcttccgac ccagagacca acacaacaaa tgaagaccac aaaatcatgg
7021 cttcagaaaa ttcctctgca atggttcaag tgcacagtca aggaaggaaa gctgcagtgt
7081 cgcatctgac aacccttgcc acaatctcca cgagtcctca acctcccaca accaaaacag
7141 gtccggacaa cagcacccat aatacacccg tgtataaact tgacatctct gaggcaactc
7201 aagttggaca acatcaccgt agagcagaca acgacagcac agcctccgac actccccccg
7261 ccacgaccgc agccggaccc ttaaaagcag agaacaccaa cacgagtaag agcgctgact
7321 ccctggacct cgccaccacg acaagccccc aaaactacag cgagactgct ggcaacaaca
7381 acactcatca ccaagatacc ggagaagaga gtgccagcag cgggaagcta ggcttaatta
7441 ccaatactat tgctggagta gcaggactga tcacaggcgg gagaaggact cgaagagaag
7501 taattgtcaa tgctcaaccc aaatgcaacc ccaatttaca ttactggact actcaggatg
7561 aaggtgctgc aatcggattg gcctggatac catatttcgg gccagcagcc gaaggaattt
7621 acacagaggg gctaatgcac aaccaagatg gtttaatctg tgggttgagg cagctggcca
7681 acgaaacgac tcaagctctc caactgttcc tgagagccac aactgagctg cgaacctttt
7741 caatcctcaa ccgtaaggca attgacttcc tgctgcagcg atggggtggc acatgccaca
7801 ttttgggacc ggactgctgt atcgaaccac atgattggac caagaacata acagacaaaa
7861 ttgatcagat tattcatgat tttgttgata aaacccttcc ggaccagggg gacaatgaca
7921 attggtggac aggatggaga caatggatac cggcaggtat tggagttaca ggtgttataa
7981 ttgcagttat cgctttattc tgtatatgca aatttgtctt ttagtctttc ttcagattgt
8041 ttcacggcaa aactcaacct caaatcaatg aaactaggat ttaattatat gaatcacttg
8101 aatctaagat tacttgacaa atgataacat aatacactgg agcttcaaac atagccaatg
8161 tgattctaac tcctttaaac tcacagttaa tcataaacaa ggtttgacat caatctagct
8221 atatctttaa gaatgataaa cttgatgaag attaagaaaa aggtaatctt tcgattatct
8281 ttagtcttca tccttgattc tacaatcatg acagttgtct ttaatgaaaa aggaaaaaag
8341 cctttttatt aagttgtaat aatcagatct gcaaaccggt agaatttagt tgtaacctaa
8401 cacacacaaa gcattggtaa aaaagtcaat agaaatttaa acagtgagtg cagacaactc
8461 ttaaatggaa gcttcatatg agagaggacg cccccgagct gccagacagc attcaaggga
8521 tggacacgac caccatgttc gagcacgatc atcatccaga gagaattatc gaggtgagta
8581 ccgtcaatca aggagcgcct cacaagtgcg cgttcctact gtatttcata agaagagagt
8641 tgaaccatta acagttcctc cagcacctaa agacatatgt ccgaccttga aaaaaggatt
8701 tttgtgtgac agtagttttt gcaaaaaaga ccaccagtta gaaagtttaa ctgataggga
8761 attactccta ctaatcgccc gtaagacttg tggatcagta gaacaacaat taaatataac
8821 tgcacccaag gactcgcgct tagcaaatcc aacggctgat gatttccagc aagaggaagg
8881 tcccaaaatt accttgttga cactgatcaa gacggcagaa cactgggcga gacaagacat
8941 ccgaaccata gaggattcca aattaagggc attgttaact ctatgtgctg tgatgacgag
9001 gaaattctca aaatcccagc tgagtctttt gtgtgagaca cacctaaggc gcgaagggct
9061 tgggcaagat caggcagaac ccgttctcga agtatatcaa cgattacaca gtgataaagg
9121 aggcagtttt gaagctgcac tatggcaaca atgggaccga caatccctaa ttatgtttat
9181 cactgcattc ttgaatatcg ctctccagtt accgtgtgaa agttctgctg tcgttgtttc
9241 agggttaaga acattggttc ctcaatcaga taatgaggaa gcttcaacca acccggggac
9301 atgctcatgg tctgatgagg gtacccctta ataaggctga ctaaaacact atataacctt
9361 ctacttgatc acaatactcc gtatacctat catcatatat ttaatcaaga cgatatcctt
9421 taaaacttat tcagtactat aatcactctc atttcaaatt gataagatat gcataattgc
9481 cttaatatat aaagaggtat gatataaccc aaacattgac caaagaaaat cataatctcg
9541 tatcgctcgc aatataacct gccaagcata cctcttgcac aaagtgattc ttgtacacaa
9601 ataatgtttg actctacagg aggtagcaac gatccatctc atcaaaaaat aagtatttta
9661 tgatttacta atgatctctt aaaatattaa gaaaaactga cggaacataa attctttctg
9721 cttcaagttg tggaggaggt ctatggtatt cgctattgtt atattacaat caataacaag
9781 cttgtaaaaa tattgttctt gtttcaggag gtatattgtg accggaaaag ctaaactaat
9841 gatgaagatt aatgcggagg tctgatgaga ataaacctta ttattcagat taggccccaa
9901 gaggcattct tcatctcctt ttagcaaaat actatttcag gatagtccag ctagtgacac
9961 gtcttttagc tgtataccag ttgcccctga gatacgccac aaaagtgtct ctgagctaaa
10021 gtggtctgta cacatctcat acattgtatt aggggcaata atatctaatt gaacttagcc
10081 atttaaaatt tagtgcataa atctgggcta actccaccag gtcaactcca ttggctgaaa
10141 agaagcccac ctacaacgaa cattactttg agcaccctca caattaaaaa ataagagcgt
10201 cgttccaaca atcgagcgca aggttacaag gttgaactga gagtgtctag acaacaaaat
10261 atcgatactc cagacaccaa gcaagacctg agaaaaaacc atggccaaag ctacgggacg
10321 atacaatcta atatcgccca aaaaggacct ggagaaaggg gttgtcttaa gcgacctctg
10381 taacttctta gttagtcaaa ctattcaagg gtggaaagtt tattgggctg gtattgagtt
10441 tgatgtgact cacaaaggaa tggccctatt gcatagactg aaaactaatg actttgcccc
10501 tgcatggtca atgacaagga acctatttcc ccatttattt caaaatccga attccactat
10561 tgaatcaccg ctgtgggcac tgagagtcat ccttgcagca gggatacagg accagttaat
10621 tgaccagtct ttgattgaac ccttagcagg agcccttggt ctgatctctg attggctgct
10681 aacaaccaac actaaccatt tcaacatgcg aacacaacgt gtcaaggaac aattgagcct
10741 aaaaatgctg tcgttgattc gatccaatat tctcaagttt attaacaaat tggatgctct
10801 acatgtcgtg aactacaatg gattattgag cagtattgaa attggaactc aaaatcatac
10861 aatcatcata actcgaacta acatgggttt tctggtggag ctccaagaac ccgacaaatc
10921 ggcaatgaac cgcaagaagc ctgggccggc gaaattttcc ctccttcatg agtccacact
10981 gaaagcattt acacaagggt cctcgacacg aatgcaaagt ttaattcttg aattcaatag
11041 ctctcttgct atctaactaa gatggaatac ttcatattgg gctaactcat atatgctgac
11101 tcaatagtta acttgacatc tctgccttca taatcagata tataagcata ataaataaat
11161 actcatattt cttgataatt tgtttaacca cagataaatc ctcactgtaa gccagcttcc
11221 aagttgacac ccttacaaaa accaggactc agaatccctc aaataagaga ttccaagaca
11281 acatcataga attgctttat tatattaata agcattttat cactagaaat ccaatatacg
11341 aaatggttaa ttgtaactaa acccgcaggt catgtgtgtt aggtttcaca aattatatat
11401 attactaact ccatactcgt aactaacatt agataagtag gttaagaaaa aagcttgagg
11461 aagattaaga aaaactgctt attgggtctt tccgtgtttt agatgaagca gttgacattc
11521 ttcctcttga tattaaatgg ctacacaaca tacccaatac ccagacgcca ggttatcatc
11581 accaattgta ttggaccaat gtgaccttgt cactagagct tgcgggttgt attcatcata
11641 ctcccttaat ccgcaactac gcaactgtaa actcccgaaa catatatacc gtttaaaata
11701 tgatgtaact gttaccaagt tcttaagtga tgtaccagtg gcgacattgc ccatagattt
11761 catagtccca attcttctca aggcactatc aggcaatggg ttctgtcctg ttgagccgcg
11821 gtgccaacag ttcttagatg aaattattaa gtacacaatg caagatgctc tcttcctgaa
11881 atattatctc aaaaatgtgg gtgctcaaga agactgtgtt gatgaccact ttcaagaaaa
11941 aatcttatct tcaattcagg gcaatgaatt tttacatcaa atgtttttct ggtatgacct
12001 ggctatttta actcgaaggg gtagattaaa tcgaggaaac tctagatcaa cgtggtttgt
12061 tcatgatgat ttaatagaca tcttaggcta tggggactat gttttttgga agatcccaat
12121 ttcactgtta ccactgaaca cacaaggaat cccccatgct gctatggatt ggtatcagac
12181 atcagtattc aaagaagcgg ttcaagggca tacacacatt gtttctgttt ctactgccga
12241 tgtcttgata atgtgcaaag atttaattac atgtcgattc aacacaactc taatctcaaa
12301 aatagcagag gttgaggacc cagtttgctc tgattatccc aattttaaga ttgtgtctat
12361 gctttaccag agcggagatt acttactctc catattaggg tctgatgggt ataaaatcat
12421 taagtttctc gaaccattgt gcttggctaa aattcaattg tgctcaaagt acaccgagag
12481 gaagggccga ttcttaacac aaatgcattt agctgtaaat cacaccctgg aagaaattac
12541 agaaatacgt gcactaaagc cttcacaggc tcacaagatc cgtgaattcc atagaacatt
12601 gataaggctg gagatgacgc cacaacaact ttgtgagcta ttttccatac aaaaacactg
12661 ggggcatcct gtgctacata gtgaaacagc aatccaaaaa gttaaaaaac atgctacggt
12721 gctaaaagca ttacgcccta tcgtgatttt cgagacatat tgtgttttta aatatagcat
12781 tgcaaaacat tattttgata gtcaaggatc ttggtacagt gttacctcag atagaaatct
12841 aacaccaggt cttaattctt atatcaaaag aaatcaattc cctccgttgc caatgattaa
12901 agaactgcta tgggaatttt accaccttga ccatcctcca cttttctcaa ccaaaattat
12961 tagtgactta agtattttta taaaagacag agctactgca gtagaaagga catgctggga
13021 tgcagtattc gagcctaatg ttctgggata taatccacct cacaaattca gtaccaaacg
13081 tgtaccggaa caatttttag agcaagaaaa cttttctatt gagaatgttc tttcctacgc
13141 gcaaaaactc gagtatctac taccacaata tcggaatttt tctttctcat tgaaagagaa
13201 agagttgaat gtaggtagaa ctttcggaaa attgccttat ccgactcgca atgttcaaac
13261 actttgtgaa gctctgttag ctgatggtct tgctaaagca tttcctagca atatgatggt
13321 agttacggaa cgtgaacaaa aagaaagctt attgcatcaa gcatcatggc accacacaag
13381 tgatgatttc ggtgagcatg ccacagttag agggagtagc tttgtaactg atttagagaa
13441 atacaatctt gcatttaggt atgagtttac agcacctttt atagaatatt gcaaccgttg
13501 ctatggtgtt aagaatgttt ttaattggat gcattataca atcccacagt gttatatgca
13561 tgtcagtgat tattataatc caccgcataa cctcacactg gaaaatcgaa acaacccccc
13621 tgaagggcct agttcataca ggggtcatat gggagggatt gaaggactgc aacaaaaact
13681 ctggacaagt atttcatgtg ctcaaatttc tttagttgaa attaagactg gttttaagtt
13741 gcgctcagct gtgatgggtg acaatcagtg cattaccgtt ttatcagtct tccccttaga
13801 gactgatgca ggcgagcagg aacagagcgc cgaggacaat gcagcgaggg tggccgccag
13861 cctagcaaaa gttacaagtg cctgtggaat ctttttaaaa cctgatgaaa catttgtaca
13921 ttcaggtttt atctattttg gaaaaaaaca atatttgaat ggggtccaat tgcctcagtc
13981 ccttaaaacg gctacaagaa tggcaccatt gtctgatgca atttttgatg atcttcaagg
14041 gaccctggct agtataggta ctgcttttga gcgatccatc tctgagacac gacatatctt
14101 tccttgcaga ataaccgcag ctttccatac gttcttttcg gtgagaatct tgcaatatca
14161 tcacctcgga tttaataaag gttttgacct tggacagtta acactcggca aacctctgga
14221 tttcggaaca atatcattgg cactagcggt accgcaggtg cttggagggt tatccttctt
14281 gaatcctgag aaatgtttct accggaatct aggagatcca gttacctcag gtttattcca
14341 gttaaaaact tatctccgaa tgattgagat ggatgattta ttcttacctt taattgcgaa
14401 gaaccctggg aactgcactg ccattgactt tgtgctaaat cctagcggat taaatgttcc
14461 tgggtcgcaa gacttaactt catttctgcg ccagattgta cgtaggacta tcaccctaag
14521 tgcgaaaaac aaacttatta ataccttatt tcatgcatca gctgacttcg aagacgaaat
14581 ggtttgtaag tggctcttat catcaactcc tgttatgagt cgtttcgcag ccgatatatt
14641 ttcacgcacg ccgagcggga agcgattgca aattctagga tacttggaag gaacacgcac
14701 attattagcc tctaagatca tcaacaataa tacagagacg ccggttttgg acagactgag
14761 gaagataaca ttgcaaaggt ggagtctatg gtttagttat cttgatcatt gtgataatat
14821 cctggcggag gctttaaccc aaataacttg cacagttgat ttagcacaga tcctgaggga
14881 atattcatgg gcacatattt tagaggggag acctcttatt ggagccacac tcccatgtat
14941 gattgagcaa ttcaaagtgg tttggctgaa accctacgaa caatgtccgc agtgttcaaa
15001 tgccaagcaa cctggtggga aaccattcgt gtcagtagca gtcaagaaac atattgttag
15061 tgcatggcca aatgcatccc gaataagctg gactatcggg gatggaatcc catacattgg
15121 atcaaggaca gaagataaga tagggcaacc tgctattaaa ccaaaatgtc cttccgcagc
15181 cttaagagag gccattgaat tggcgtcccg tttaacatgg gtaactcaag gcagttcgaa
15241 cagtgacttg ctaataaaac catttttgga agcacgagta aatttaagtg ttcaagaaat
15301 acttcaaatg accccttcac attactcggg aaatattgtt cataggtaca acgatcaata
15361 cagtcctcat tctttcatgg ccaatcgtat gagtaactca gcaacgcgat tgattgtttc
15421 tacaaacact ttaggtgagt tttcaggagg tggccaatcg gcacgcgaca gcaatattat
15481 tttccagaat gttataaatt atgcagttgc actgttcgat attaaattta gaaacactga
15541 ggctacagat atccagtata atcgtgctca ccttcatcta actaagtgtt gcacccggga
15601 ggtaccagct cagtacttaa catacacatc tacattggat ttagatttaa caagataccg
15661 agaaaatgaa ttgatttatg acaataatcc tctaaaagga ggactcaatt gcaatatctc
15721 atttgataac ccatttttcc aaggcaaaca gctgaacatt atagaagatg accttattcg
15781 actgcctcac ttatctggat gggagctagc taagaccatc atgcaatcaa ttatttcaga
15841 tagcaataat tcgtctacag acccaattag cagtggagaa acaagatcat tcactaccca
15901 tttcttaact tatcccaaaa taggacttct gtacagtttt ggggcctttg taagttatta
15961 tcttggcaat acaattcttc ggactaagaa attaacactt gacaattttt tatattactt
16021 aactacccaa attcataatc taccacatcg ctcattgcga atacttaagc caacattcaa
16081 acatgcaagc gttatgtcac gattaatgag tattgatccc catttttcta tttacatagg
16141 cggtgctgca ggtgacagag gactctcaga tgcggccagg ttatttttga gaacgtccat
16201 ttcatctttt cttacatttg taaaggaatg gataattaat cgcggaacaa ttgtcccttt
16261 atggatagta tatccattag agggtcaaaa tccaacacct gttaataatt tcctccatca
16321 gatcgtagaa ctgctggtgc atgattcatc aagacaccag gcttttaaaa ctaccataaa
16381 tgatcatgta catcctcacg acaatcttgt ttacacatgt aagagtacag ccagcaattt
16441 cttccatgcg tcattggcgt actggaggag caggcacaga aacagcaacc gaaaagactt
16501 gacaagaaac tcttcaactg gatcaagcac aaacaacagt gatggtcata ttaagagaag
16561 tcaagaacaa accaccagag atccacatga tggcactgaa cggagtctag tcctgcaaat
16621 gagccatgaa ataaaaagaa cgacaattcc acaagagaac acgcaccagg gtccgtcgtt
16681 ccagtcattt ctaagtgact ctgcttgcgg tacagcaaac ccaaaactaa atttcgatag
16741 atcgagacac aatgtgaaat ctcaggatca taactcagca tccaagaggg aaggtcatca
16801 aataatctca catcgtctag tcctaccttt ctttacatta tctcaaggga cacgccaatt
16861 aacgtcatcc aatgagtcac aaacccaaga tgagatatca aagtacttac ggcaattgag
16921 atccgtcatt gataccacag tttattgtag gtttaccggt atagtctcgt ccatgcatta
16981 caaacttgat gaggtccttt gggaaataga gaattttaag tcggctgtga cgctggcaga
17041 gggagaaggt gctggtgcct tactattgat tcagaaatac caagttaaga ccttattctt
17101 caacacgcta gctactgagt ccagtataga gtcagaaata gtatcaggaa tgactactcc
17161 taggatgctt ctacctgtta tgtcaaaatt ccataatgac caaattgaga ttattcttaa
17221 caactcagca agccaaataa cagacataac aaatcctact tggtttaaag accaaagagc
17281 aaggctacct aggcaagtcg aggttataac catggatgca gagacgacag agaatataaa
17341 cagatcgaaa ttgtacgaag ctgtacataa attgatctta caccatgttg atcccagcgt
17401 attgaaagca gtggtcctta aagtctttct aagtgatacc gagggtatgt tatggctaaa
17461 tgataatcta gccccgtttt ttgccactgg gtatttaatt aagccaataa cgtcaagtgc
17521 caggtctagt gagtggtatc tttgtctgac gaacttctta tcaactacac gtaagatgcc
17581 acaccaaaac catctcagtt gtaagcaggt aatacttacg gcattgcaac tgcaaattca
17641 acggagccca tactggctaa gtcatttaac tcagtatgct gactgcgatt tacatttaag
17701 ctatatccgc cttggttttc catcattaga gaaagtacta taccacaggt ataaccttgt
17761 cgattcaaaa agaggtccac tagtctctgt cactcagcac ttagcacatc ttagggcaga
17821 gattcgagaa ttgaccaatg attataatca acagcgacaa agtcggactc aaacatatca
17881 ctttattcgt actgcaaaag gacgaatcac aaaactagtc aatgattatt taaaattctt
17941 tcttattgta caagcattaa aacataatgg gacatggcaa gctgagttta agaaattacc
18001 agagttgatt agtgtgtgca ataggttcta tcatattaga gattgtaatt gtgaagaacg
18061 tttcttagtt caaaccttat atttacatag aatgcaggat tctgaagtta agcttatcga
18121 aaggctgaca gggcttctga gtttatttcc agatggtctc tacaggttcg attgaataac
18181 cgtgcatagt attttgatac ttgtaaaggt tggttatcaa catacagatt ataaaaaact
18241 cataaattgc tctcatacat catcttgatc tgatttcaat aaataactat ttagataacg
18301 aaaggagtcc ttacattata cactatattt ggcctctctc cctgcgtgat aatcaaaaaa
18361 ttcacaatac agcatgtgtg acatattact gctgcaatga gtctaacgca acataataaa
18421 ctccgcactc tttataatta agctttaacg ataggtctgg gctcatattg ttattgatat
18481 agtaatgttg tatcaatatc ttgccagatg gaatagtgct ttggttgata acacgacttc
18541 ttaaaacaaa actgatcttt aagattaagt tttttataat tgtcattgct ttaatttgtc
18601 gatttaaaaa tggtgatagc cttaatcttt gtgtaaaata agagattagg tgtaataact
18661 ttaacatttt tgtctagtaa gctactattc cattcagaat gataaaatta aaagaaaaga
18721 catgactgta aaatcagaaa taccttcttt acaatatagc agactagata ataatcttcg
18781 tgttaatgat aattaaggca ttgaccacgc tcatcagaag gctcactaga ataaacgttg
18841 caaaaaggat ccctggaaaa atggtcgcac acaaaaattt aaaaataaat ctatttcttc
18901 ttttttgtgt gtcc

_________________
www.twitter.com/hniman


Top
 Profile  
 
PostPosted: Tue Jul 29, 2014 10:25 am 
Offline

Joined: Wed Aug 19, 2009 10:42 am
Posts: 56044
Location: Pittsburgh, PA USA
LOCUS KM233039 18953 bp cRNA linear VRL 25-JUL-2014
DEFINITION Zaire ebolavirus strain EBOV_5, partial genome.
ACCESSION KM233039
VERSION KM233039.1 GI:667852531
KEYWORDS .
SOURCE Zaire ebolavirus (ZEBOV)
ORGANISM Zaire ebolavirus
Viruses; ssRNA negative-strand viruses; Mononegavirales;
Filoviridae; Ebolavirus.
REFERENCE 1 (bases 1 to 18953)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Deep sequencing analysis of Ebola virus transmission in Sierra
Leone
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 18953)
AUTHORS Goba,A., Momoh,M., Fullah,M., Kanneh,L., Jalloh,S., Khan,H.,
Gevao,S., Gire,S., Andersen,K., Wohl,S., Park,D., Sealfon,R.,
Matranga,C., Malboeuf,C., Gladden,A., Qu,J., Yang,X., Winnicki,S.,
Chapman,S., Happi,C., Garry,R. and Sabeti,P.
CONSRTM Viral Hemorrhagic Fever Consortium
TITLE Direct Submission
JOURNAL Submitted (25-JUL-2014) Infectious Disease Initiative, Broad
Institute of MIT and Harvard, 75 Ames St., Cambridge, MA 02142, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: Novoalign v. v.3
Sequencing Technology :: Illumina; Nextera
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..18953
/organism="Zaire ebolavirus"
/mol_type="viral cRNA"
/strain="EBOV_5"
/host="Homo sapiens"
/db_xref="taxon:186538"
/country="Sierra Leone"
/collection_date="03-Jun-2014"
gene 50..3020
/gene="NP"
mRNA 50..3020
/gene="NP"
/product="nucleoprotein"
misc_signal 50..61
/gene="NP"
/note="putative transcription start signal"
CDS 464..2683
/gene="NP"
/note="encapsidation of genomic RNA"
/codon_start=1
/product="nucleoprotein"
/protein_id="AIG95920.1"
/db_xref="GI:667852532"
/translation="MDSRPQKVWMTPSLTESDMDYHKILTAGLSVQQGIVRQRVIPVY
QVNNLEEICQLIIQAFEAGVDFQESADSFLLMLCLHHAYQGDYKLFLESGAVKYLEGH
GFRFEVKKCDGVKRLEELLPAVSSGRNIKRTLAAMPEEETTEANAGQFLSFASLFLPK
LVVGEKACLEKVQRQIQVHAEQGLIQYPTAWQSVGHMMVIFRLMRTNFLIKFLLIHQG
MHMVAGHDANDAVISNSVAQARFSGLLIVKTVLDHILQKTERGVRLHPLARTAKVKNE
VNSFKAALSSLAKHGEYAPFARLLNLSGVNNLEHGLFPQLSAIALGVATAHGSTLAGV
NVGEQYQQLREAATEAEKQLQQYAESRELDHLGLDDQEKKILMNFHQKKNEISFQQTN
AMVTLRKERLAKLTEAITAASLPKTSGHYDDDDDIPFPGPINDDDNPGHQDDDPTDSQ
DTTIPDVVVDPDDGGYGEYQSYSENGMSAPDDLVLFDLDEDDEDTKPVPNRSTKGGQQ
KNSQKGQHTEGRQTQSTPTQNVTGPRRTIHHASAPLTDNDRRNEPSGSTSPRMLTPIN
EEADPLDDADDETSSLPPLESDDEEQDRDGTSNRTPTVAPPAPVYRDHSEKKELPQDE
QQDQDHIQEARNQDSDNTQPEHSFEEMYRHILRSQGPFDAVLYYHMMKDEPVVFSTSD
GKEYTYPDSLEEEYPPWLTEKEAMNDENRFVTLDGQQFYWPVMNHRNKFMAILQHHQ"
polyA_signal 3009..3020
/gene="NP"
gene 3026..4401
/gene="VP35"
mRNA 3026..4401
/gene="VP35"
/product="VP35 matrix protein"
misc_signal 3026..3037
/gene="VP35"
/note="putative transcription start signal"
CDS 3123..4145
/gene="VP35"
/note="polymerase complex protein"
/codon_start=1
/product="VP35 matrix protein"
/protein_id="AIG95921.1"
/db_xref="GI:667852533"
/translation="MTTRTKGRGHTVATTQNDRMPGPELSGWISEQLMTGRIPVNDIF
CDIENNPGLCYASQMQQTKPNPKMRNSQTQTDPICNHSFEEVVQTLASLATVVQQQTI
ASESLEQRITSLENGLKPVYDMAKTISSLNRVCAEMVAKYDLLVMTTGRATATAAATE
AYWAEHGQPPPGPSLYEESAIRGKIESRDETVPQSVREAFNNLDSTTSLTEENFGKPD
ISAKDLRNIMYDHLPGFGTAFHQLVQVICKLGKDSNSLDIIHAEFQASLAEGDSPQCA
LIQITKRVPIFQDAAPPVIHIRSRGDIPRACQKSLRPVPPSPKIDRGWVCVFQLQDGK
TLGLKI"
gene 4384..5888
/gene="VP40"
mRNA 4384..5888
/gene="VP40"
/product="matrix protein"
misc_signal 4384..4395
/gene="VP35"
/note="transcription start signal"
polyA_signal 4391..4401
/gene="VP35"
CDS 4473..5453
/gene="VP40"
/codon_start=1
/product="matrix protein"
/protein_id="AIG95922.1"
/db_xref="GI:667852534"
/translation="MRRVILPTAPPEYMEAIYPARSNSTIARGGNSNTGFLTPESVNG
DTPSNPLRPIADDTIDHASHTPGSVSSAFILEAMVNVISGPKVLMKQIPIWLPLGVAD
QKTYSFDSTTAAIMLASYTITHFGKATNPLVRVNRLGPGIPDHPLRLLRIGNQAFLQE
FVLPPVQLPQYFTFDLTALKLITQPLPAATWTDDTPTGSNGALRPGISFHPKLRPILL
PNKSGKKGNSADLTSPEKIQAIMTSLQDFKIVPIDPTKNIMGIEVPETLVHKLTGKKV
TSKNGQPIIPVLLPKYIGLDPVAPGDLTMVITQDCDTCHSPASLPAVVEK"
polyA_signal 5877..5888
/gene="VP40"
gene 5894..8299
/gene="GP"
mRNA 5894..8299
/gene="GP"
/product="ssGP"
/note="unedited mRNA"
misc_signal 5894..5905
/gene="GP"
/note="putative transcription start signal"
CDS join(6033..6917,6917..8062)
/gene="GP"
/ribosomal_slippage
/note="additional a residue inserted during transcription;
encodes two disulfide linked subunits GP1 and GP2;
receptor binding and fusion"
/codon_start=1
/product="virion spike glycoprotein precursor"
/protein_id="AIG95923.1"
/db_xref="GI:667852535"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKNLTRKIRSEELSFTAVSNGPKNISGQSPARTSSDPETNT
TNEDHKIMASENSSAMVQVHSQGRKAAVSHLTTLATISTSPQPPTTKTGPDNSTHNTP
VYKLDISEATQVGQHHRRADNDSTASDTPPATTAAGPLKAENTNTSKSADSLDLATTT
SPQNYSETAGNNNTHHQDTGEESASSGKLGLITNTIAGVAGLITGGRRTRREVIVNAQ
PKCNPNLHYWTTQDEGAAIGLAWIPYFGPAAEGIYTEGLMHNQDGLICGLRQLANETT
QALQLFLRATTELRTFSILNRKAIDFLLQRWGGTCHILGPDCCIEPHDWTKNITDKID
QIIHDFVDKTLPDQGDNDNWWTGWRQWIPAGIGVTGVIIAVIALFCICKFVF"
CDS 6033..7127
/gene="GP"
/note="small non-structural secreted glycoprotein; sGP
secreted as an anti-parallel oriented homodimer"
/codon_start=1
/product="sGP"
/protein_id="AIG95924.1"
/db_xref="GI:667852536"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKTSLEKFAVKSCLSQLYQTDPKTSVVRVRRELLPTQRPTQ
QMKTTKSWLQKIPLQWFKCTVKEGKLQCRI"
CDS join(6033..6917,6919..6927)
/gene="GP"
/ribosomal_slippage
/note="second non-structural secreted glycoprotein;
secreted in a monomeric form; one a residue is deleted or
two additional a residues are inserted at the editing site
during transcription of the GP gene"
/codon_start=1
/product="ssGP"
/protein_id="AIG95925.1"
/db_xref="GI:667852537"
/translation="MGVTGILQLPRDRFKRTSFFLWVIILFQRTFSIPLGVIHNSTLQ
VSDVDKLVCRDKLSSTNQLRSVGLNLEGNGVATDVPSVTKRWGFRSGVPPKVVNYEAG
EWAENCYNLEIKKPDGSECLPAAPDGIRGFPRCRYVHKVSGTGPCAGDFAFHKEGAFF
LYDRLASTVIYRGTTFAEGVVAFLILPQAKKDFFSSHPLREPVNATEDPSSGYYSTTI
RYQATGFGTNETEYLFEVDNLTYVQLESRFTPQFLLQLNETIYASGKRSNTTGKLIWK
VNPEIDTTIGEWAFWETKKPH"
gene 8282..9734
/gene="VP30"
mRNA 8282..9734
/gene="VP30"
/product="VP30 minor nucleoprotein"
misc_signal 8282..8293
/gene="VP30"
/note="putative transcription start signal"
polyA_signal 8289..8299
/gene="VP30"
CDS 8503..9369
/gene="VP30"
/note="minor nucleoprotein; polymerase complex protein"
/codon_start=1
/product="VP30 minor nucleoprotein"
/protein_id="AIG95926.1"
/db_xref="GI:667852538"
/translation="MEASYERGRPRAARQHSRDGHDHHVRARSSSRENYRGEYRQSRS
ASQVRVPTVFHKKRVEPLTVPPAPKDICPTLKKGFLCDSSFCKKDHQLESLTDRELLL
LIARKTCGSVEQQLNITAPKDSRLANPTADDFQQEEGPKITLLTLIKTAEHWARQDIR
TIEDSKLRALLTLCAVMTRKFSKSQLSLLCETHLRREGLGQDQAEPVLEVYQRLHSDK
GGSFEAALWQQWDRQSLIMFITAFLNIALQLPCESSAVVVSGLRTLVPQSDNEEASTN
PGTCSWSDEGTP"
polyA_signal 9724..9734
/gene="VP30"
/note="putative"
gene 9879..11512
/gene="VP24"
/note="putative"
mRNA 9879..11490
/gene="VP24"
/product="VP24 membrane-associated protein"
misc_signal 9879..9890
/gene="VP24"
/note="transcription start signal"
CDS 10339..11094
/gene="VP24"
/note="membrane-associated protein"
/codon_start=1
/product="VP24 membrane-associated protein"
/protein_id="AIG95927.1"
/db_xref="GI:667852539"
/translation="MAKATGRYNLISPKKDLEKGVVLSDLCNFLVSQTIQGWKVYWAG
IEFDVTHKGMALLHRLKTNDFAPAWSMTRNLFPHLFQNPNSTIESPLWALRVILAAGI
QDQLIDQSLIEPLAGALGLISDWLLTTNTNHFNMRTQRVKEQLSLKMLSLIRSNILKF
INKLDALHVVNYNGLLSSIEIGTQNHTIIITRTNMGFLVELQEPDKSAMNRKKPGPAK
FSLLHESTLKAFTQGSSTRMQSLILEFNSSLAI"
polyA_signal 11479..11490
/gene="VP24"
/note="putative"
gene 11495..18276
/gene="L"
mRNA 11495..18276
/gene="L"
/product="polymerase"
misc_signal 11495..11506
/gene="VP24"
/note="transcription start signal"
polyA_signal 11502..11512
/gene="VP24"
/note="putative"
CDS 11575..18213
/gene="L"
/note="polymerase; synthesis of viral RNAs;
transcriptional RNA editing"
/codon_start=1
/product="polymerase"
/protein_id="AIG95928.1"
/db_xref="GI:667852540"
/translation="MATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNC
KLPKHIYRLKYDVTVTKFLSDVPVATLPIDFIVPILLKALSGNGFCPVEPRCQQFLDE
IIKYTMQDALFLKYYLKNVGAQEDCVDDHFQEKILSSIQGNEFLHQMFFWYDLAILTR
RGRLNRGNSRSTWFVHDDLIDILGYGDYVFWKIPISLLPLNTQGIPHAAMDWYQTSVF
KEAVQGHTHIVSVSTADVLIMCKDLITCRFNTTLISKIAEVEDPVCSDYPNFKIVSML
YQSGDYLLSILGSDGYKIIKFLEPLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEI
TEIRALKPSQAHKIREFHRTLIRLEMTPQQLCELFSIQKHWGHPVLHSETAIQKVKKH
ATVLKALRPIVIFETYCVFKYSIAKHYFDSQGSWYSVTSDRNLTPGLNSYIKRNQFPP
LPMIKELLWEFYHLDHPPLFSTKIISDLSIFIKDRATAVERTCWDAVFEPNVLGYNPP
HKFSTKRVPEQFLEQENFSIENVLSYAQKLEYLLPQYRNFSFSLKEKELNVGRTFGKL
PYPTRNVQTLCEALLADGLAKAFPSNMMVVTEREQKESLLHQASWHHTSDDFGEHATV
RGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKNVFNWMHYTIPQCYMHVSDYYNP
PHNLTLENRNNPPEGPSSYRGHMGGIEGLQQKLWTSISCAQISLVEIKTGFKLRSAVM
GDNQCITVLSVFPLETDAGEQEQSAEDNAARVAASLAKVTSACGIFLKPDETFVHSGF
IYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASIGTAFERSISETRHIFP
CRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTISLALAVPQVLGGLSF
LNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNCTAIDFVLNPSGL
NVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWLLSSTPVMSRF
AADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQRWSLWFSY
LDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFKVVWLKP
YEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTEDKIGQ
PAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTPSH
YSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI
NYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENE
LIYDNNPLKGGLNCNISFDNPFFQGKQLNIIEDDLIRLPHLSGWELAKTIMQSIISDS
NNSSTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYY
LTTQIHNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLR
TSISSFLTFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLHQIVELLVHDSSRHQAF
KTTINDHVHPHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKDLTRNSSTGSSTNNS
DGHIKRSQEQTTRDPHDGTERSLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGT
ANPKLNFDRSRHNVKSQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQ
DEISKYLRQLRSVIDTTVYCRFTGIVSSMHYKLDEVLWEIENFKSAVTLAEGEGAGAL
LLIQKYQVKTLFFNTLATESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQ
ITDITNPTWFKDQRARLPRQVEVITMDAETTENINRSKLYEAVHKLILHHVDPSVLKA
VVLKVFLSDTEGMLWLNDNLAPFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPH
QNHLSCKQVILTALQLQIQRSPYWLSHLTQYADCDLHLSYIRLGFPSLEKVLYHRYNL
VDSKRGPLVSVTQHLAHLRAEIRELTNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYL
KFFLIVQALKHNGTWQAEFKKLPELISVCNRFYHIRDCNCEERFLVQTLYLHRMQDSE
VKLIERLTGLLSLFPDGLYRFD"
polyA_signal 18266..18276
/gene="L"

_________________
www.twitter.com/hniman


Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 173 posts ]  Go to page 1, 2, 3, 4, 5 ... 18  Next

All times are UTC - 5 hours [ DST ]


Who is online

Users browsing this forum: Yahoo [Bot] and 106 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
cron
Powered by phpBB® Forum Software © phpBB Group