[Bioperl-l] problem parsing FASTA output - bug or my fault?
Aidan Budd
budd at embl-heidelberg.de
Thu Apr 26 10:18:11 UTC 2007
Hi Bioperlers,
I'm trying to parse a FASTA search output file (see attached .out file)
using Bioperl 1.4. My Bioperl installation has otherwise been working
fine, however I currently get the following error when running a simple
script that attempts to access result from this outfile via bioperl.
Is this a problem with the parser?
Or have I executed FASTA wrongly creating output that isn't covered by the
parser?
Any suggestions on how to deal with this much appreciated.
Best wishes,
Aidan
Script:
#!/usr/bin/perl -w
$^W=1;
use strict;
use Bio::SearchIO;
my $fasta_report = new Bio::SearchIO ('-format' => 'fasta',
'-file' => $ARGV[0]);
my $result = $fasta_report->next_result();
Errors:
Use of uninitialized value in concatenation (.) or string at
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Search/HSP/GenericHSP.pm
line 231, <GEN3> line 47.
------------- EXCEPTION -------------
MSG: Did not specify a Query End or Query Begin -verbose 0 -algorithm
FASTP -score 62.4 -hit_frame 0 -hsp_length 180 -hit_seq -hit_length 0
-query_length 128 -query_frame 0 -swscore 122 -rank 1 -query_seq
GTTILQYAQTTDGQQILVPSNQVVVQAASGDVQTYQIRTAPTSTIAPGVVMASS--PALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVLENQ-NKTLIEELKALKD-LYCHKSD
-homology_seq
MEMTDFELTSNSQ.NL.IPTNFK.TLP.RKRAKTK..KEQR.IE.ILR..R..HQS.E..RLHLQY..RKCSL...LL.SVNL.K.ADHE.A.T.SHDAFVASLDEYRDFQSTRGASLDTRASSHSSSDTFTPSPLNCTMEPATLSPKSMR
-hit_name YFL031W -bits 19.4 -query_name CREB1_MONKEY -evalue 1.1 (qs='
STACK Bio::Search::HSP::GenericHSP::new
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Search/HSP/GenericHSP.pm:231
STACK Bio::Search::HSP::FastaHSP::new
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Search/HSP/FastaHSP.pm:97
STACK Bio::Factory::ObjectFactory::create_object
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/Factory/ObjectFactory.pm:150
STACK Bio::SearchIO::SearchResultEventBuilder::end_hsp
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/SearchIO/SearchResultEventBuilder.pm:275
STACK Bio::SearchIO::fasta::end_element
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/SearchIO/fasta.pm:872
STACK Bio::SearchIO::fasta::next_result
/Users/budd/perl_modules/bioperl_1_4/bioperl-1.4/Bio/SearchIO/fasta.pm:403
STACK toplevel
/Users/budd/scripts/test_scripts/test_parsing_fasta_output.pl:22
--------------------------------------
--
----------------------------------------------------------------------
Aidan Budd, PhD tel:+49 (0)6221 387 8530
EMBL - European Molecular Biology Laboratory fax:+49 (0)6221 387 8517
Meyerhofstr. 1, 69117 Heidelberg, Germany
URL: http://www-db.embl.de/jss/EmblGroupsHD/per_1807.html
-------------- next part --------------
# fasta34 -m 2 creb1_human.fasta yeast_bzips_from_ensembl.fasta
FASTA searches a protein or DNA sequence data bank
version 34.26 January 12, 2007
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query library creb1_human.fasta vs yeast_bzips_from_ensembl.fasta library
searching yeast_bzips_from_ensembl.fasta library
1>>>CREB1_MONKEY 341 aa - 341 aa
vs yeast_bzips_from_ensembl.fasta library
3683 residues in 10 sequences
MLE_cen statistics: Lambda= 0.0338; K=8.757e-05 (cen=0)
FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2
join: 37, opt: 25, open/ext: -10/-2, width: 16
Scan time: 0.000
The best scores are: opt bits E(10)
YFL031W ( 238) 122 19.4 1.1
YEL009C ( 281) 121 19.4 1.3
YIL036W ( 587) 129 19.8 2
YIR017C ( 187) 83 17.5 2.9
YVNL167C ( 647) 119 19.3 2.9
YIR018W ( 245) 67 16.7 5.3
YER045C ( 489) 73 17.0 7.1
YDR259C ( 383) 62 16.5 7.5
YOR028C ( 296) 41 15.5 8.9
YHL009C ( 330) 33 15.1 9.6
>>YFL031W (238 aa)
initn: 107 init1: 107 opt: 122 Z-score: 62.4 bits: 19.4 E(): 1.1
Smith-Waterman score: 122; 27.660% identity (63.830% similar) in 94 aa overlap (248-337:2-95)
220 230 240 250 260 270
CREB1_ GTTILQYAQTTDGQQILVPSNQVVVQAASGDVQTYQIRTAPTSTIAPGVVMASS--PALP
YFL031 MEMTDFELTSNSQ.NL.IPTNFK.TLP.RKR
280 290 300 310 320 330
CREB1_ TQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVLENQ-NKTLIEELKALKD
YFL031 AKTK..KEQR.IE.ILR..R..HQS.E..RLHLQY..RKCSL...LL.SVNL.K.ADHE.
340
CREB1_ -LYCHKSD
YFL031 A.T.SHDAFVASLDEYRDFQSTRGASLDTRASSHSSSDTFTPSPLNCTMEPATLSPKSMR
>>YEL009C (281 aa)
initn: 138 init1: 83 opt: 121 Z-score: 60.8 bits: 19.4 E(): 1.3
Smith-Waterman score: 121; 29.412% identity (55.462% similar) in 119 aa overlap (219-335:165-277)
190 200 210 220 230 240
CREB1_ GAIQLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVVQAASGD
YEL009 VSLADKAIESTEEVSLVPSNLEVSTTSFLP.PV.ED.KL.QTRKVKK.NS--..KKSHHV
250 260 270 280 290 300
CREB1_ VQTYQIRTAPTSTIAPGVVMASSPALPTQP--AEEAARKREVRLMKNREAARECRRKKKE
YEL009 GKDDES.LDHLGVV.YNRKQR.I.LS.IV.ESSDP..L..----AR.T....RS.AR.LQ
310 320 330 340
CREB1_ YVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD
YEL009 RM.Q..DK.EE.LSK.YH.EN.VAR..K.VGER
>>YIL036W (587 aa)
initn: 132 init1: 70 opt: 129 Z-score: 57.2 bits: 19.8 E(): 2
Smith-Waterman score: 129; 18.750% identity (55.682% similar) in 352 aa overlap (2-335:137-477)
10 20
CREB1_ MTMESGAENQQSGDAAVTEAENQQM--TVQA
YIL036 RVVKPSANSNYQQAAYLRQQQQQDQRQQSPS.KTEE.S.LY..ILMNSGVV.D.HQNLAT
30 40 50 60 70 80
CREB1_ QPQIATLAQVSMPAAHATSSAPTVTLVQLPNGQTVQVHGVIQAAQPSVIQSPQVQTVQSS
YIL036 HTNLSQ.SSTRKS.PNDSTT...-NASNIA.--.AS.NKQMYFMNMNMNNN.HALNDP.I
90 100 110 120 130 140
CREB1_ CKDLKRLFS--GTQISTIAESEDS--QESVDSVTDSQKRREILSRRPSYRKILNDL----
YIL036 LET.SPF.QPF.VDVAHLPMTNPPIF.S.LPGCDEPIR..R.SISNGQISQLGE.IETLE
150 160 170 180 190
CREB1_ ---SSDAPGVPRIEEEKSEEET---SAPAITTVTVP-TPIYQTSSGQYIAITQGGAIQLA
YIL036 NLHNTQP.PM.NFHNYNGLSQ.RNV.NKPVFNQA..VSS.P.YNAKKV.NP.KDS.--.G
200 210 220 230 240 250
CREB1_ NNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVVQAASGDVQTYQI
YIL036 DQSVIYSKSQ.RNFVNAPSKNT.AES.----SDLE.MTTFA.TTGGENRGK.ALRESHSN
260 270 280 290 300 310
CREB1_ RT-APTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLEN
YIL036 PSFT.K.QGSHLNLA.NTQGN.I-.GT-T.W..ARL.ER..I..SK..QR..VAQLQ.QK
320 330 340
CREB1_ RVAVLENQNKTLIEELKALKDLYCHKSD
YIL036 EFNEIKDE.RI.LKK.NYYEK.ISKFKKFSKIHLREHEKLNKDSDNNVNGTNSSNKNESM
>>YIR017C (187 aa)
initn: 43 init1: 43 opt: 83 Z-score: 54.0 bits: 17.5 E(): 2.9
Smith-Waterman score: 84; 22.785% identity (56.962% similar) in 158 aa overlap (176-330:9-148)
150 160 170 180 190 200
CREB1_ PGVPRIEEEKSEEETSAPAITTVTVPTPIYQTSSGQYIAITQGGAIQLANNGTDGVQGLQ
YIR017 MSAKQGWEKK.TNID..SRK.MNV---..LSEHL.N.I
210 220 230 240 250 260
CREB1_ TLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVVQAASG-DVQTYQIRTAPTS--TI
YIR017 S------SDSEL.SRL.SLLLVSS.N-----AEELISMINN.Q..SQFKKLRE.RKGKVA
270 280 290 300 310 320
CREB1_ APGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVLENQN
YIR017 .TTA.VVKEEEA.VSTSN.LDKIKQE.RR..T..SQRF.IR..Q--.NF..-MNK.Q.L.
330 340
CREB1_ KTLIEELKALKDLYCHKSD
YIR017 -.Q.NK.RDRIEQLNKENEFWKAKLNDINEIKSLKLLNDIKRRNMGR
>>YVNL167C (647 aa)
initn: 142 init1: 119 opt: 119 Z-score: 53.8 bits: 19.3 E(): 2.9
Smith-Waterman score: 119; 39.623% identity (62.264% similar) in 53 aa overlap (280-332:426-478)
250 260 270 280 290 300
CREB1_ QTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVK
YVNL16 RKNSAVTTAPAQKDDVENNKISNNVTLDEN..QE...KEF.ER..V..SKF.KR....I.
310 320 330 340
CREB1_ CLENRVAVLENQNKTLIEELKALKDLYCHKSD
YVNL16 KI..DLQFY.SEYDD.TQVIGK.CGIIPSSSSNSQFNVNVSTPSSSSPPSTSLIALLESS
>>YIR018W (245 aa)
initn: 61 init1: 61 opt: 67 Z-score: 47.6 bits: 16.7 E(): 5.3
Smith-Waterman score: 67; 25.455% identity (61.818% similar) in 55 aa overlap (280-334:55-109)
250 260 270 280 290 300
CREB1_ QTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVK
YIR018 SKNWKLPPRLPHRAAQRRKRVHRLHEDYET..NDEELQKKKRQ..D.Q.AY.ER.NNKLQ
310 320 330 340
CREB1_ CLENRVAVLENQNKTLIEELKALKDLYCHKSD
YIR018 V..ETIES.SKVV.NYETK.NR.QNELQAKESENHALKQKLETLTLKQASVPAQDPILQN
>>YER045C (489 aa)
initn: 111 init1: 70 opt: 73 Z-score: 43.8 bits: 17.0 E(): 7.1
Smith-Waterman score: 97; 22.826% identity (67.391% similar) in 92 aa overlap (3-92:210-300)
10 20 30
CREB1_ MTMESGAENQQSGDAAVTEAE-NQQMTVQAQP
YER045 QTGSKNIYAAMTPYDSNIKLNIPAVAATCDIP.ATPSIP...STMNQ.YI.M.LRL...M
40 50 60 70 80 90
CREB1_ QIATLAQVSMPAAHATSSAPTVTLVQLPNGQTVQVHGV-IQAAQPSVIQSPQVQTVQSSC
YER045 .TKAWKNAQL-NV.PCTP.SNSSVSSSSSC.NIND.NIEN.SVHS.ISHGVNHH..NN..
100 110 120 130 140 150
CREB1_ KDLKRLFSGTQISTIAESEDSQESVDSVTDSQKRREILSRRPSYRKILNDLSSDAPGVPR
YER045 QNAELNISSSLPYESKCPDVNLTHANSKPQYKDATSALKNNINSEKDVHTAPFSSMHTTA
>>YDR259C (383 aa)
initn: 84 init1: 52 opt: 62 Z-score: 42.8 bits: 16.5 E(): 7.5
Smith-Waterman score: 81; 33.333% identity (64.583% similar) in 48 aa overlap (289-330:227-274)
260 270 280 290 300 310
CREB1_ TSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVL
YDR259 NDNNDNVTKPVPDKDTQLISSSGKTLRNTR.AAQ..T.QKAF.QR.EK.I.N..QKSKIF
320 330 340
CREB1_ -----ENQN-KTLIEELKALKDLYCHKSD
YDR259 DDLLA..N.F.S.NDS.RNDNNILIAQHEAIRNAITMLRSEYDVLCNENNMLKNENSIIK
>>YOR028C (296 aa)
initn: 35 init1: 35 opt: 41 Z-score: 39.3 bits: 15.5 E(): 8.9
Smith-Waterman score: 80; 33.962% identity (66.038% similar) in 53 aa overlap (289-334:243-295)
260 270 280 290 300 310
CREB1_ TSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRKKKEYVKCLENRVAVL
YOR028 LSEQVFNEGERYNNDGQLIGKTGKPLRNTK.AAQ..S.QKAF.QRREK.I.N..EKSKLF
320 330 340
CREB1_ -----ENQN-KTLIEELKA-LKDLYCHKSD
YOR028 DGLMK..SEL.KM..S..SK..E*
>>YHL009C (330 aa)
initn: 33 init1: 33 opt: 33 Z-score: 36.4 bits: 15.1 E(): 9.6
Smith-Waterman score: 91; 21.667% identity (57.500% similar) in 120 aa overlap (222-333:79-194)
200 210 220 230 240
CREB1_ QLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQI-LVP-----SNQVVVQAA
YHL009 EQTAPFPILEDQCPALNLDRSNNDLLLQNNISFPKGS.L.A.Q.T.ISGDY.TY.MADNN
250 260 270 280 290 300
CREB1_ SGDVQTYQIRT--APTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRRK
YHL009 NN.NDS.SNTNYFSKNNG.S.SSRSP.VAHNENV.DDSK.K.KA----Q..A.QKAF.ER
310 320 330 340
CREB1_ KKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD
YHL009 .EARM.E.QDKLLES.RNRQS.LK.IEE.RKANTEINAENRLLLRSGNENFSKDIEDDTN
341 residues in 1 query sequences
3683 residues in 10 library sequences
Scomplib [34.26]
start: Thu Apr 26 11:52:16 2007 done: Thu Apr 26 11:52:16 2007
Total Scan time: 0.000 Total Display time: 0.010
Function used was FASTA [version 34.26 January 12, 2007]
-------------- next part --------------
>CREB1_MONKEY
MTMESGAENQQSGDAAVTEAENQQMTVQAQPQIATLAQVSMPAAHATSSAPTVTLVQLPN
GQTVQVHGVIQAAQPSVIQSPQVQTVQSSCKDLKRLFSGTQISTIAESEDSQESVDSVTD
SQKRREILSRRPSYRKILNDLSSDAPGVPRIEEEKSEEETSAPAITTVTVPTPIYQTSSG
QYIAITQGGAIQLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQV
VVQAASGDVQTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAAREC
RRKKKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD
-------------- next part --------------
>YIL036W
MFTGQEYHSVDSNSNKQKDNNKRGIDDTSKILNNKIPHSVSDTSAAATTTSTMNNSALSR
SLDPTDINYSTNMAGVVDQIHDYTTSNRNSLTPQYSIAAGNVNSHDRVVKPSANSNYQQA
AYLRQQQQQDQRQQSPSMKTEEESQLYGDILMNSGVVQDMHQNLATHTNLSQLSSTRKSA
PNDSTTAPTNASNIANTASVNKQMYFMNMNMNNNPHALNDPSILETLSPFFQPFGVDVAH
LPMTNPPIFQSSLPGCDEPIRRRRISISNGQISQLGEDIETLENLHNTQPPPMPNFHNYN
GLSQTRNVSNKPVFNQAVPVSSIPQYNAKKVINPTKDSALGDQSVIYSKSQQRNFVNAPS
KNTPAESISDLEGMTTFAPTTGGENRGKSALRESHSNPSFTPKSQGSHLNLAANTQGNPI
PGTTAWKRARLLERNRIAASKCRQRKKVAQLQLQKEFNEIKDENRILLKKLNYYEKLISK
FKKFSKIHLREHEKLNKDSDNNVNGTNSSNKNESMTVDSLKIIEELLMIDSDVTEVDKDT
GKIIAIKHEPYSQRFGSDTDDDDIDLKPVEGGKDPDNQSLPNSEKIK
>YIR017C
MSAKQGWEKKSTNIDIASRKGMNVNNLSEHLQNLISSDSELGSRLLSLLLVSSGNAEELI
SMINNGQDVSQFKKLREPRKGKVAATTAVVVKEEEAPVSTSNELDKIKQERRRKNTEASQ
RFRIRKKQKNFENMNKLQNLNTQINKLRDRIEQLNKENEFWKAKLNDINEIKSLKLLNDI
KRRNMGR
>YVNL167C
MSSEERSRQPSTVSTFDLEPNPFEQSFASSKKALSLPGTISHPSLPKELSRNNSTSTITQ
HSQRSTHSLNSIPEENGNSTVTDNSNHNDVKKDSPSFLPGQQRPTIISPPILTPGGSKRL
PPLLLSPSILYQANSTTNPSQNSHSVSVSNSNPSAIGVSSTSGSLYPNSSSPSGTSLIRQ
PRNSNVTTSNSGNGFPTNDSQMPGFLLNLSKSGLTPNESNIRTGLTPGILTQSYNYPVLP
SINKNTITGSKNVNKSVTVNGSIENHPHVNIMHPTVNGTPLTPGLSSLLNLPSTGVLANP
VFKSTPTTNTTDGTVNNSISNSNFSPNTSTKAAVKMDNPAEFNAIEHSAHNHKENENLTT
QIENNDQFNNKTRKRKRRMSSTSSTSKASRKNSISRKNSAVTTAPAQKDDVENNKISNNV
TLDENEEQERKRKEFLERNRVAASKFRKRKKEYIKKIENDLQFYESEYDDLTQVIGKLCG
IIPSSSSNSQFNVNVSTPSSSSPPSTSLIALLESSISRSDYSSAMSVLSNMKQLICETNF
YRRGGKNPRDDMDGQEDSFNKDTNVVKSENAGYPSVNSRPIILDKKYSLNSGANISKSNT
TTNNVGNSAQNIINSCYSVTNPLVINANSDTHDTNKHDVLSTLPHNN
>YER045C
MDYKHNFATSPDSFLDGRQNPLLYTDFLSSNKELIYKQPSGPGLVDSAYNFHHQNSLHDR
SVQENLGPMFQPFGVDISHLPITNPPIFQSSLPAFDQPVYKRRISISNGQISQLGEDLET
VENLYNCQPPILSSKAQQNPNPQQVANPSAAIYPSFSSNELQNVPQPHEQATVIPEAAPQ
TGSKNIYAAMTPYDSNIKLNIPAVAATCDIPSATPSIPSGDSTMNQAYINMQLRLQAQMQ
TKAWKNAQLNVHPCTPASNSSVSSSSSCQNINDHNIENQSVHSSISHGVNHHTVNNSCQN
AELNISSSLPYESKCPDVNLTHANSKPQYKDATSALKNNINSEKDVHTAPFSSMHTTATF
QIKQEARPQKIENNTAGLKDGAKAWKRARLLERNRIAASKCRQRKKMSQLQLQREFDQIS
KENTMMKKKIENYEKLVQKMKKISRLHMQECTINGGNNSYQSLQNKDSDVNGFLKMIEEM
IRSSSLYDE
>YIR018W
MALPLIKPKESEESHLALLSKIHVSKNWKLPPRLPHRAAQRRKRVHRLHEDYETEENDEE
LQKKKRQNRDAQRAYRERKNNKLQVLEETIESLSKVVKNYETKLNRLQNELQAKESENHA
LKQKLETLTLKQASVPAQDPILQNLIENFKPMKAIPIKYNTAIKRHQHSTELPSSVKCGF
CNDNTTCVCKELETDHRKSDDGVATEQKDMSMPHAECNNKDNPNGLCSNCTNIDKSCIDI
RSIIH
>YHL009C
MTPSNMDDNTSGFMKFINPQCQEEDCCIRNSLFQEDSKCIKQQPDLLSEQTAPFPILEDQ
CPALNLDRSNNDLLLQNNISFPKGSDLQAIQLTPISGDYSTYVMADNNNNDNDSYSNTNY
FSKNNGISPSSRSPSVAHNENVPDDSKAKKKAQNRAAQKAFRERKEARMKELQDKLLESE
RNRQSLLKEIEELRKANTEINAENRLLLRSGNENFSKDIEDDTNYKYSFPTKDEFFTSMV
LESKLNHKGKYSLKDNEIMKRNTQYTDEAGRHVLTVPATWEYLYKLSEERDFDVTYVMSK
LQGQECCHTHGPAYPRSLIDFLVEEATLNE
>YOR028C
MLMQIKMDNHPFNFQPILASHSMTRDSTKPKKMTDTAFVPSPPVGFIKEENKADLHTISV
VASNVTLPQIQLPKIATLEEPGYESRTGSLTDLSGRRNSVNIGALCEDVPNTAGPHIARP
VTINNLIPPSLPRLNTYQLRPQLSDTHLNCHFNSNPYTTASHAPFESSYTTASTFTSQPA
ASYFPSNSTPATRKNSATTNLPSEERRRVSVSLSEQVFNEGERYNNDGQLIGKTGKPLRN
TKRAAQNRSAQKAFRQRREKYIKNLEEKSKLFDGLMKENSELKKMIESLKSKLKE*
>YEL009C
MSEYQPSLFALNPMGFSPLDGSKSTNENVSASTSTAKPMVGQLIFDKFIKTEEDPIIKQD
TPSNLDFDFALPQTATAPDAKTVLPIPELDDAVVESFFSSSTDSTPMFEYENLEDNSKEW
TSLFDNDIPVTTDDVSLADKAIESTEEVSLVPSNLEVSTTSFLPTPVLEDAKLTQTRKVK
KPNSVVKKSHHVGKDDESRLDHLGVVAYNRKQRSIPLSPIVPESSDPAALKRARNTEAAR
RSRARKLQRMKQLEDKVEELLSKNYHLENEVARLKKLVGER
>YDR259C
MQNPPLIRPDMYNQGSSSMATYNASEKNLNEHPSPQIAQPSTSQKLPYRINPTTTNGDTD
ISVNSNPIQPPLPNLMHLSGPSDYRSMHQSPIHPSYIIPPHSNERKQSASYNRPQNAHVS
IQPSVVFPPKSYSISYAPYQINPPLPNGLPNQSISLNKEYIAEEQLSTLPSRNTSVTTAP
PSFQNSADTAKNSADNNDNNDNVTKPVPDKDTQLISSSGKTLRNTRRAAQNRTAQKAFRQ
RKEKYIKNLEQKSKIFDDLLAENNNFKSLNDSLRNDNNILIAQHEAIRNAITMLRSEYDV
LCNENNMLKNENSIIKNEHNMSRNENENLKLENKRFHAEYIRMIEDIENTKRKEQEQRDE
IEQLKKKIRSLEEIVGRHSDSAT
>YFL031W
MEMTDFELTSNSQSNLAIPTNFKSTLPPRKRAKTKEEKEQRRIERILRNRRAAHQSREKK
RLHLQYLERKCSLLENLLNSVNLEKLADHEDALTCSHDAFVASLDEYRDFQSTRGASLDT
RASSHSSSDTFTPSPLNCTMEPATLSPKSMRDSASDQETSWELQMFKTENVPESTTLPAV
DNNNLFDAVASPLADPLCDDIAGNSLPFDNSIDLDNWRNPEAQSGLNSFELNDFFITS
More information about the Bioperl-l
mailing list