[Biojava-dev] [Bug 2315] New: UniProtFormat fail on long protein sequence.

bugzilla-daemon at portal.open-bio.org bugzilla-daemon at portal.open-bio.org
Mon Jun 11 09:00:02 UTC 2007


http://bugzilla.open-bio.org/show_bug.cgi?id=2315

           Summary: UniProtFormat fail on long protein sequence.
           Product: BioJava
           Version: live (CVS source)
          Platform: PC
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: seq.io
        AssignedTo: biojava-dev at biojava.org
        ReportedBy: miyabe at port4.info


It seems some errors in the renewal of the symbol buffer of
org.biojava.bio.seq.io.ChunkedSymbolListFactory.

Here is a stacktrace.
--
Exception in thread "main" org.biojava.bio.BioException: Could not read
sequence
        at
org.biojavax.bio.seq.io.RichStreamReader.nextRichSequence(RichStreamReader.java:113)
        at jp.co.maze.intact.RestructMain.exec(RestructMain.java:90)
        at jp.co.maze.intact.RestructMain.main(RestructMain.java:42)
Caused by: org.biojava.bio.seq.io.ParseException: 

A Exception Has Occurred During Parsing. 
Please submit the details that follow to biojava-l at biojava.org or post a bug
report to http://bugzilla.open-bio.org/ 

Format_object=org.biojavax.bio.seq.io.UniProtFormat
Accession=Q8WXI7
Id=
Comments=
Parse_block=SQ        MLKPSGLPGS SSPTRSLMTG SRSTKATPEM DSGLTGATLS PKTSTGAIVV
TEHTLPFTSP     DKTLASPTSS VVGRTTQSLG VMSSALPEST SRGMTHSEQR TSPSLSPQVN
GTPSRNYPAT     SMVSGLSSPR TRTSSTEGNF TKEASTYTLT VETTSGPVTE KYTVPTETST
TEGDSTETPW     DTRYIPVKIT SPMKTFADST ASKENAPVSM TPAETTVTDS HTPGRTNPSF
GTLYSSFLDL     SPKGTPNSRG ETSLELILST TGYPFSSPEP GSAGHSRIST SAPLSSSASV
LDNKISETSI     FSGQSLTSPL SPGVPEARAS TMPNSAIPFS MTLSNAETSA ERVRSTISSL
GTPSISTKQT     AETILTFHAF AETMDIPSTH IAKTLASEWL GSPGTLGGTS TSALTTTSPS
TTLVSEETNT     HHSTSGKETE GTLNTSMTPL ETSAPGEESE MTATLVPTLG FTTLDSKIRS
PSQVSSSHPT     RELRTTGSTS GRQSSSTAAH GSSDILRATT SSTSKASSWT SESTAQQFSE
PQHTQWVETS     PSMKTERPPA STSVAAPITT SVPSVVSGFT TLKTSSTKGI WLEETSADTL
IGESTAGPTT     HQFAVPTGIS MTGGSSTRGS QGTTHLLTRA TASSETSADL TLATNGVPVS
VSPAVSKTAA     GSSPPGGTKP SYTMVSSVIP ETSSLQSSAF REGTSLGLTP LNTRHPFSSP
EPDSAGHTKI     STSIPLLSSA SVLEDKVSAT STFSHHKATS SITTGTPEIS TKTKPSSAVL
SSMTLSNAAT     SPERVRNATS PLTHPSPSGE ETAGSVLTLS TSAETTDSPN IHPTGTLTSE
SSESPSTLSL     PSVSGVKTTF SSSTPSTHLF TSGEETEETS NPSVSQPETS VSRVRTTLAS
TSVPTPVFPT     MDTWPTRSAQ FSSSHLVSEL RATSSTSVTN STGSALPKIS HLTGTATMSQ
TNRDTFNDSA     APQSTTWPET SPRFKTGLPS ATTTVSTSAT SLSATVMVSK FTSPATSSME
ATSIREPSTT     ILTTETTNGP GSMAVASTNI PIGKGYITEG RLDTSHLPIG TTASSETSMD
FTMAKESVSM     SVSPSQSMDA AGSSTPGRTS QFVDTFSDDV YHLTSREITI PRDGTSSALT
PQMTATHPPS     PDPGSARSTW LGILSSSPSS PTPKVTMSST FSTQRVTTSM IMDTVETSRW
NMPNLPSTTS     LTPSNIPTSG AIGKSTLVPL DTPSPATSLE ASEGGLPTLS TYPESTNTPS
IHLGAHASSE     SPSTIKLTMA SVVKPGSYTP LTFPSIETHI HVSTARMAYS SGSSPEMTAP
GETNTGSTWD     PTTYITTTDP KDTSSAQVST PHSVRTLRTT ENHPKTESAT PAAYSGSPKI
SSSPNLTSPA     TKAWTITDTT EHSTQLHYTK LAEKSSGFET QSAPGPVSVV IPTSPTIGSS
TLELTSDVPG     EPLVLAPSEQ TTITLPMATW LSTSLTEEMA STDLDISSPS SPMSTFAIFP
PMSTPSHELS     KSEADTSAIR NTDSTTLDQH LGIRSLGRTG DLTTVPITPL TTTWTSVIEH
STQAQDTLSA     TMSPTHVTQS LKDQTSIPAS ASPSHLTEVY PELGTQGRSS SEATTFWKPS
TDTLSREIET     GPTNIQSTPP MDNTTTGSSS SGVTLGIAHL PIGTSSPAET STNMALERRS
STATVSMAGT     MGLLVTSAPG RSISQSLGRV SSVLSESTTE GVTDSSKGSS PRLNTQGNTA
LSSSLEPSYA     EGSQMSTSIP LTSSPTTPDV EFIGGSTFWT KEVTTVMTSD ISKSSARTES
SSATLMSTAL     GSTENTGKEK LRTASMDLPS PTPSMEVTPW ISLTLSNAPN TTDSLDLSHG
VHTSSAGTLA     TDRSLNTGVT RASRLENGSD TSSKSLSMGN STHTSMTDTE KSEVSSSIHP
RPETSAPGAE     TTLTSTPGNR AISLTLPFSS IPVEEVISTG ITSGPDINSA PMTHSPITPP
TIVWTSTGTI     EQSTQPLHAV SSEKVSVQTQ STPYVNSVAV SASPTHENSV SSGSSTSSPY
SSASLESLDS     TISRRNAITS WLWDLTTSLP TTTWPSTSLS EALSSGHSGV SNPSSTTTEF
PLFSAASTSA     AKQRNPETET HGPQNTAAST LNTDASSVTG LSETPVGASI SSEVPLPMAI
TSRSDVSGLT     SESTANPSLG TASSAGTKLT RTISLPTSES LVSFRMNKDP WTVSIPLGSH
PTTNTETSIP     VNSAGPPGLS TVASDVIDTP SDGAESIPTV SFSPSPDTEV TTISHFPEKT
THSFRTISSL     THELTSRVTP IPGDWMSSAM STKPTGASPS ITLGERRTIT SAAPTTSPIV
LTASFTETST     VSLDNETTVK TSDILDARKT NELPSDSSSS SDLINTSIAS STMDVTKTAS
ISPTSISGMT     ASSSPSLFSS DRPQVPTSTT ETNTATSPSV SSNTYSLDGG SNVGGTPSTL
PPFTITHPVE     TSSALLAWSR PVRTFSTMVS TDTASGENPT SSNSVVTSVP APGTWASVGS
TTDLPAMGFL     KTSPAGEAHS LLASTIEPAT AFTPHLSAAV VTGSSATSEA SLLTTSESKA
IHSSPQTPTT     PTSGANWETS ATPESLLVVT ETSDTTLTSK ILVTDTILFS TVSTPPSKFP
STGTLSGASF     PTLLPDTPAI PLTATEPTSS LATSFDSTPL VTIASDSLGT VPETTLTMSE
TSNGDALVLK     TVSNPDRSIP GITIQGVTES PLHPSSTSPS KIVAPRNTTY EGSITVALST
LPAGTTGSLV     FSQSSENSET TALVDSSAGL ERASVMPLTT GSQGMASSGG IRSGSTHSTG
TKTFSSLPLT     MNPGEVTAMS EITTNRLTAT QSTAPKGIPV KPTSAESGLL TPVSASSSPS
KAFASLTTAP     PSTWGIPQST LTFEFSEVPS LDTKSASLPT PGQSLNTIPD SDASTASSSL
SKSPEKNPRA     RMMTSTKAIS ASSFQSTGFT ETPEGSASPS MAGHEPRVPT SGTGDPRYAS
ESMSYPDPSK     ASSAMTSTSL ASKLTTLFST GQAARSGSSS SPISLSTEKE TSFLSPTAST
SRKTSLFLGP     SMARQPNILV HLQTSALTLS PTSTLNMSQE EPPELTSSQT IAEEEGTTAE
TQTLTFTPSE     TPTSLLPVSS PTEPTARRKS SPETWASSIS VPAKTSLVET TDGTLVTTIK
MSSQAAQGNS     TWPAPAEETG TSPAGTSPGS PEVSTTLKIM SSKEPSISPE IRSTVRNSPW
KTPETTVPME     TTVEPVTLQS TALGSGSTSI SHLPTGTTSP TKSPTENMLA TERVSLSPSP
PEAWTNLYSG     TPGGTRQSLA TMSSVSLESP TARSITGTGQ QSSPELVSKT TGMEFSMWHG
STGGTTGDTH     VSLSTSSNIL EDPVTSPNSV SSLTDKSKHK TETWVSTTAI PSTVLNNKIM
AAEQQTSRSV     DEAYSSTSSW SDQTSGSDIT LGASPDVTNT LYITSTAQTT SLVSLPSGDQ
GITSLTNPSG     GKTSSASSVT SPSIGLETLR ANVSAVKSDI APTAGHLSQT SSPAEVSILD
VTTAPTPGIS     TTITTMGTNS ISTTTPNPEV GMSTMDSTPA TERRTTSTEH PSTWSSTAAS
DSWTVTDMTS     NLKVARSPGT ISTMHTTSFL ASSTELDSMS TPHGRITVIG TSLVTPSSDA
SAVKTETSTS     ERTLSPSDTT ASTPISTFSR VQRMSISVPD ILSTSWTPSS TEAEDVPVSM
VSTDHASTKT     DPNTPLSTFL FDSLSTLDWD TGRSLSSATA TTSAPQGATT PQELTLETMI
SPATSQLPFS     IGHITSAVTP AAMARSSGVT FSRPDPTSKK AEQTSTQLPT TTSAHPGQVP
RSAATTLDVI     PHTAKTPDAT FQRQGQTALT TEARATSDSW NEKEKSTPSA PWITEMMNSV
SEDTIKEVTS     SSSVLKDPEY AGHKLGIWDD FIPKFGKAAH MRELPLLSPP QDKEAIHPST
NTVETTGWVT     SSEHASHSTI PAHSASSKLT SPVVTTSTRE QAIVSMSTTT WPESTRARTE
PNSFLTIELR     DVSPYMDTSS TTQTSIISSP GSTAITKGPR TEITSSKRIS SSFLAQSMRS
SDSPSEAITR     LSNFPAMTES GGMILAMQTS PPGATSLSAP TLDTSATASW TGTPLATTQR
FTYSEKTTLF     SKGPEDTSQP SPPSVEETSS SSSLVPIHAT TSPSNILLTS QGHSPSSTPP
VTSVFLSETS     GLGKTTDMSR ISLEPGTSLP PNLSSTAGEA LSTYEASRDT KAIHHSADTA
VTNMEATSSE     YSPIPGHTKP SKATSPLVTS HIMGDITSST SVFGSSETTE IETVSSVNQG
LQERSTSQVA     SSATETSTVI THVSSGDATT HVTKTQATFS SGTSISSPHQ FITSTNTFTD
VSTNPSTSLI     MTESSGVTIT TQTGPTGAAT QGPYLLDTST MPYLTETPLA VTPDFMQSEK
TTLISKGPKD     VTWTSPPSVA ETSYPSSLTP FLVTTIPPAT STLQGQHTSS PVSATSVLTS
GLVKTTDMLN     TSMEPVTNSP QNLNNPSNEI LATLAATTDI ETIHPSINKA VTNMGTASSA
HVLHSTLPVS     SEPSTATSPM VPASSMGDAL ASISIPGSET TDIEGEPTSS LTAGRKENST
LQEMNSTTES     NIILSNVSVG AITEATKMEV PSFDATFIPT PAQSTKFPDI FSVASSRLSN
SPPMTISTHM     TTTQTGSSGA TSKIPLALDT STLETSAGTP SVVTEGFAHS KITTAMNNDV
KDVSQTNPPF     QDEASSPSSQ APVLVTTLPS SVAFTPQWHS TSSPVSMSSV LTSSLVKTAG
KVDTSLETVT     SSPQSMSNTL DDISVTSAAT TDIETTHPSI NTVVTNVGTT GSAFESHSTV
SAYPEPSKVT     SPNVTTSTME DTTISRSIPK SSKTTRTETE TTSSLTPKLR ETSISQEITS
STETSTVPYK     ELTGATTEVS RTDVTSSSST SFPGPDQSTV SLDISTETNT RLSTSPIMTE
SAEITITTQT     GPHGATSQDT FTMDPSNTTP QAGIHSAMTH GFSQLDVTTL MSRIPQDVSW
TSPPSVDKTS     SPSSFLSSPA MTTPSLISST LPEDKLSSPM TSLLTSGLVK ITDILRTRLE
PVTSSLPNFS     STSDKILATS KDSKDTKEIF PSINTEETNV KANNSGHESH SPALADSETP
KATTQMVITT     TVGDPAPSTS MPVHGSSETT NIKREPTYFL TPRLRETSTS QESSFPTDTS
FLLSKVPTGT     ITEVSSTGVN SSSKISTPDH DKSTVPPDTF TGEIPRVFTS SIKTKSAEMT
ITTQASPPES     ASHSTLPLDT STTLSQGGTH STVTQGFPYS EVTTLMGMGP GNVSWMTTPP
VEETSSVSSL     MSSPAMTSPS PVSSTSPQSI PSSPLPVTAL PTSVLVTTTD VLGTTSPESV
TSSPPNLSSI     THERPATYKD TAHTEAAMHH STNTAVTNVG TSGSGHKSQS SVLADSETSK
ATPLMSTTST     LGDTSVSTST PNISQTNQIQ TEPTASLSPR LRESSTSEKT SSTTETNTAF
SYVPTGAITQ     ASRTEISSSR TSISDLDRPT IAPDISTGMI TRLFTSPIMT KSAEMTVTTQ
TTTPGATSQG     ILPWDTSTTL FQGGTHSTVS QGFPHSEITT LRSRTPGDVS WMTTPPVEET
SSGFSLMSPS     MTSPSPVSST SPESIPSSPL PVTALLTSVL VTTTNVLGTT SPETVTSSPP
NLSSPTQERL     TTYKDTAHTE AMHASMHTNT AVANVGTSIS GHESQSSVPA DSHTSKATSP
MGITFAMGDT     SVSTSTPAFF ETRIQTESTS SLIPGLRDTR TSEEINTVTE TSTVLSEVPT
TTTTEVSRTE     VITSSRTTIS GPDHSKMSPY ISTETITRLS TFPFVTGSTE MAITNQTGPI
GTISQATLTL     DTSSTASWEG THSPVTQRFP HSEETTTMSR STKGVSWQSP PSVEETSSPS
SPVPLPAITS     HSSLYSAVSG SSPTSALPVT SLLTSGRRKT IDMLDTHSEL VTSSLPSASS
FSGEILTSEA     STNTETIHFS ENTAETNMGT TNSMHKLHSS VSIHSQPSGH TPPKVTGSMM
EDAIVSTSTP     GSPETKNVDR DSTSPLTPEL KEDSTALVMN STTESNTVFS SVSLDAATEV
SRAEVTYYDP     TFMPASAQST KSPDISPEAS SSHSNSPPLT ISTHKTIATQ TGPSGVTSLG
QLTLDTSTIA     TSAGTPSART QDFVDSETTS VMNNDLNDVL KTSPFSAEEA NSLSSQAPLL
VTTSPSPVTS     TLQEHSTSSL VSVTSVPTPT LAKITDMDTN LEPVTRSPQN LRNTLATSEA
TTDTHTMHPS     INTAMANVGT TSSPNEFYFT VSPDSDPYKA TSAVVITSTS GDSIVSTSMP
RSSAMKKIES     ETTFSLIFRL RETSTSQKIG SSSDTSTVFD KAFTAATTEV SRTELTSSSR
TSIQGTEKPT     MSPDTSTRSV TMLSTFAGLT KSEERTIATQ TGPHRATSQG TLTWDTSITT
SQAGTHSAMT     HGFSQLDLST LTSRVPEYIS GTSPPSVEKT SSSSSLLSLP AITSPSPVPT
TLPESRPSSP     VHLTSLPTSG LVKTTDMLAS VASLPPNLGS TSHKIPTTSE DIKDTEKMYP
STNIAVTNVG     TTTSEKESYS SVPAYSEPPK VTSPMVTSFN IRDTIVSTSM PGSSEITRIE
MESTFSVAHG     LKGTSTSQDP IVSTEKSAVL HKLTTGATET SRTEVASSRR TSIPGPDHST
ESPDISTEVI     PSLPISLGIT ESSNMTIITR TGPPLGSTSQ GTFTLDTPTT SSRAGTHSMA
TQEFPHSEMT     TVMNKDPEIL SWTIPPSIEK TSFSSSLMPS PAMTSPPVSS TLPKTIHTTP
SPMTSLLTPS     LVMTTDTLGT SPEPTTSSPP NLSSTSHVIL TTDEDTTAIE AMHPSTSTAA
TNVETTCSGH     GSQSSVLTDS EKTKATAPMD TTSTMGHTTV STSMSVSSET TKIKRESTYS
LTPGLRETSI     SQNASFSTDT SIVLSEVPTG TTAEVSRTEV TSSGRTSIPG PSQSTVLPEI
STRTMTRLFA     SPTMTESAEM TIPTQTGPSG STSQDTLTLD TSTTKSQAKT HSTLTQRFPH
SEMTTLMSRG     PGDMSWQSSP SLENPSSLPS LLSLPATTSP PPISSTLPVT ISSSPLPVTS
LLTSSPVTTT     DMLHTSPELV TSSPPKLSHT SDERLTTGKD TTNTEAVHPS TNTAASNVEI
PSFGHESPSS     ALADSETSKA TSPMFITSTQ EDTTVAISTP HFLETSRIQK ESISSLSPKL
RETGSSVETS     SAIETSAVLS EVSIGATTEI SRTEVTSSSR TSISGSAEST MLPEISTTRK
IIKFPTSPIL     AESSEMTIKT QTSPPGSTSE STFTLDTSTT PSLVITHSTM TQRLPHSEIT
TLVSRGAGDV     PRPSSLPVEE TSPPSSQLSL SAMISPSPVS STLPASSHSS SASVTSPLTP
GQVKTTEVLD     ASAEPETSSP PSLSSTSVEI LATSEVTTDT EKIHPFPNTA VTKVGTSSSG
HESPSSVLPD     SETTKATSAM GTISIMGDTS VSTLTPALSN TRKIQSEPAS SLTTRLRETS
TSEETSLATE     ANTVLSKVST GATTEVSRTE AISFSRTSMS GPEQSTMSQD ISIGTIPRIS
ASSVLTESAK     MTITTQTGPS ESTLESTLNL NTATTPSWVE THSIVIQGFP HPEMTTSMGR
GPGGVSWPSP     PFVKETSPPS SPLSLPAVTS PHPVSTTFLA HIPPSPLPVT SLLTSGPATT
TDILGTSTEP     GTSSSSSLST TSHERLTTYK DTAHTEAVHP STNTGGTNVA TTSSGYKSQS
SVLADSSPMC     TTSTMGDTSV LTSTPAFLET RRIQTELASS LTPGLRESSG SEGTSSGTKM
STVLSKVPTG     ATTEISKEDV TSIPGPAQST ISPDISTRTV SWFSTSPVMT ESAEITMNTH
TSPLGATTQG     TSTLATSSTT SLTMTHSTIS QGFSHSQMST LMRRGPEDVS WMSPPLLEKT
RPSFSLMSSP     ATTSPSPVSS TLPESISSSP LPVTSLLTSG LAKTTDMLHK SSEPVTNSPA
NLSSTSVEIL     ATSEVTTDTE KTHPSSNRTV TDVGTSSSGH ESTSFVLADS QTSKVTSPMV
ITSTMEDTSV     STSTPGFFET SRIQTEPTSS LTLGLRKTSS SEGTSLATEM STVLSGVPTG
ATAEVSRTEV     TSSSRTSISG FAQLTVSPET STETITRLPT SSIMTESAEM MIKTQTDPPG
STPESTHTVD     ISTTPNWVET HSTVTQRFSH SEMTTLVSRS PGDMLWPSQS SVEETSSASS
LLSLPATTSP     SPVSSTLVED FPSASLPVTS LLTPGLVITT DRMGISREPG TSSTSNLSST
SHERLTTLED     TVDTEDMQPS THTAVTNVRT SISGHESQSS VLSDSETPKA TSPMGTTYTM
GETSVSISTS     DFFETSRIQI EPTSSLTSGL RETSSSERIS SATEGSTVLS EVPSGATTEV
SRTEVISSRG     TSMSGPDQFT ISPDISTEAI TRLSTSPIMT ESAESAITIE TGSPGATSEG
TLTLDTSTTT     FWSGTHSTAS PGFSHSEMTT LMSRTPGDVP WPSLPSVEEA SSVSSSLSSP
AMTSTSFFSA     LPESISSSPH PVTALLTLGP VKTTDMLRTS SEPETSSPPN LSSTSAEILA
TSEVTKDREK     IHPSSNTPVV NVGTVIYKHL SPSSVLADLV TTKPTSPMAT TSTLGNTSVS
TSTPAFPETM     MTQPTSSLTS GLREISTSQE TSSATERSAS LSGMPTGATT KVSRTEALSL
GRTSTPGPAQ     STISPEISTE TITRISTPLT TTGSAEMTIT PKTGHSGASS QGTFTLDTSS
RASWPGTHSA     ATHRSPHSGM TTPMSRGPED VSWPSRPSVE KTSPPSSLVS LSAVTSPSPL
YSTPSESSHS     SPLRVTSLFT PVMMKTTDML DTSLEPVTTS PPSMNITSDE SLATSKATME
TEAIQLSENT     AVTQMGTISA RQEFYSSYPG LPEPSKVTSP VVTSSTIKDI VSTTIPASSE
ITRIEMESTS     TLTPTPRETS TSQEIHSATK PSTVPYKALT SATIEDSMTQ VMSSSRGPSP
DQSTMSQDIS     SEVITRLSTS PIKAESTEMT ITTQTGSPGA TSRGTLTLDT STTFMSGTHS
TASQGFSHSQ     MTALMSRTPG DVPWLSHPSV EEASSASFSL SSPVMTSSSP VSSTLPDSIH
SSSLPVTSLL     TSGLVKTTEL LGTSSEPETS SPPNLSSTSA EILATTEVTT DTEKLEMTNV
VTSGYTHESP     SSVLADSVTT KATSSMGITY PTGDTNVLTS TPAFSDTSRI QTKSKLSLTP
GLMETSISEE     TSSATEKSTV LSSVPTGATT EVSRTEAISS SRTSIPGPAQ STMSSDTSME
TITRISTPLT     RKESTDMAIT PKTGPSGATS QGTFTLDSSS TASWPGTHSA TTQRFPQSVV
TTPMSRGPED     VSWPSPLSVE KNSPPSSLVS SSSVTSPSPL YSTPSGSSHS SPVPVTSLFT
SIMMKATDML     DASLEPETTS APNMNITSDE SLATSKATTE TEAIHVFENT AASHVETTSA
TEELYSSSPG     FSEPTKVISP VVTSSSIRDN MVSTTMPGSS GITRIEIESM SSLTPGLRET
RTSQDITSST     ETSTVLYKMS SGATPEVSRT EVMPSSRTSI PGPAQSTMSL DISDEVVTRL
STSPIMTESA     EITITTQTGY SLATSQVTLP LGTSMTFLSG THSTMSQGLS HSEMTNLMSR
GPESLSWTSP     RFVETTRSSS SLTSLPLTTS LSPVSSTLLD SSPSSPLPVT SLILPGLVKT
TEVLDTSSEP     KTSSSPNLSS TSVEIPATSE IMTDTEKIHP SSNTAVAKVR TSSSVHESHS
SVLADSETTI     TIPSMGITSA VDDTTVFTSN PAFSETRRIP TEPTFSLTPG FRETSTSEET
TSITETSAVL     YGVPTSATTE VSMTEIMSSN RTHIPDSDQS TMSPDIITEV ITRLSSSSMM
SESTQMTITT     QKSSPGATAQ STLTLATTTA PLARTHSTVP PRFLHSEMTT LMSRSPENPS
WKSSPFVEKT     SSSSSLLSLP VTTSPSVSST LPQSIPSSSF SVTSLLTPGM VKTTDTSTEP
GTSLSPNLSG     TSVEILAASE VTTDTEKIHP SSSMAVTNVG TTSSGHELYS SVSIHSEPSK
ATYPVGTPSS     MAETSISTSM PANFETTGFE AEPFSHLTSG FRKTNMSLDT SSVTPTNTPS
SPGSTHLLQS     SKTDFTSSAK TSSPDWPPAS QYTEIPVDII TPFNASPSIT ESTGITSFPE
SRFTMSVTES     THHLSTDLLP SAETISTGTV MPSLSEAMTS FATTGVPRAI SGSGSPFSRT
ESGPGDATLS     TIAESLPSST PVPFSSSTFT TTDSSTIPAL HEITSSSATP YRVDTSLGTE
SSTTEGRLVM     VSTLDTSSQP GRTSSTPILD TRMTESVELG TVTSAYQVPS LSTRLTRTDG
IMEHITKIPN     EAAHRGTIRP VKGPQTSTSP ASPKGLHTGG TKRMETTTTA LKTTTTALKT
TSRATLTTSV     YTPTLGTLTP LNASRQMAST ILTEMMITTP YVFPDVPETT SSLATSLGAE
TSTALPRTTP     SVLNRESETT ASLVSRSGAE RSPVIQTLDV SSSEPDTTAS WVIHPAETIP
TVSKTTPNFF     HSELDTVSST ATSHGADVSS AIPTNISPSE LDALTPLVTI SGTDTSTTFP
TLTKSPHETE     TRTTWLTHPA ETSSTIPRTI PNFSHHESDA TPSIATSPGA ETSSAIPIMT
VSPGAEDLVT     SQVTSSGTDR NMTIPTLTLS PGEPKTIASL VTHPEAQTSS AIPTSTISPA
VSRLVTSMVT     SLAAKTSTTN RALTNSPGEP ATTVSLVTHP AQTSPTVPWT TSIFFHSKSD
TTPSMTTSHG     AESSSAVPTP TVSTEVPGVV TPLVTSSRAV ISTTIPILTL SPGEPETTPS
MATSHGEEAS     SAIPTPTVSP GVPGVVTSLV TSSRAVTSTT IPILTFSLGE PETTPSMATS
HGTEAGSAVP     TVLPEVPGMV TSLVASSRAV TSTTLPTLTL SPGEPETTPS MATSHGAEAS
STVPTVSPEV     PGVVTSLVTS SSGVNSTSIP TLILSPGELE TTPSMATSHG AEASSAVPTP
TVSPGVSGVV     TPLVTSSRAV TSTTIPILTL SSSEPETTPS MATSHGVEAS SAVLTVSPEV
PGMVTSLVTS     SRAVTSTTIP TLTISSDEPE TTTSLVTHSE AKMISAIPTL AVSPTVQGLV
TSLVTSSGSE     TSAFSNLTVA SSQPETIDSW VAHPGTEASS VVPTLTVSTG EPFTNISLVT
HPAESSSTLP     RTTSRFSHSE LDTMPSTVTS PEAESSSAIS TTISPGIPGV LTSLVTSSGR
DISATFPTVP     ESPHESEATA SWVTHPAVTS TTVPRTTPNY SHSEPDTTPS IATSPGAEAT
SDFPTITVSP     DVPDMVTSQV TSSGTDTSIT IPTLTLSSGE PETTTSFITY SETHTSSAIP
TLPVSPGASK     MLTSLVISSG TDSTTTFPTL TETPYEPETT AIQLIHPAET NTMVPKTTPK
FSHSKSDTTL     PVAITSPGPE ASSAVSTTTI SPDMSDLVTS LVPSSGTDTS TTFPTLSETP
YEPETTVTWL     THPAETSTTV SGTIPNFSHR GSDTAPSMVT SPGVDTRSGV PTTTIPPSIP
GVVTSQVTSS     ATDTSTAIPT LTPSPGEPET TASSATHPGT QTGFTVPIRT VPSSEPDTMA
SWVTHPPQTS     TPVSRTTSSF SHSSPDATPV MATSPRTEAS SAVLTTISPG APEMVTSQIT
SSGAATSTTV     PTLTHSPGMP ETTALLSTHP RTGTSKTFPA STVFPQVSET TASLTIRPGA
ETSTALPTQT     TSSLFTLLVT GTSRVDLSPT ASPGVSAKTA PLSTHPGTET STMIPTSTLS
LGLLETTGLL     ATSSSAETST STLTLTVSPA VSGLSSASIT TDKPQTVTSW NTETSPSVTS
VGPPEFSRTV     TGTTMTLIPS EMPTPPKTSH GEGVSPTTIL RTTMVEATNL ATTGSSPTVA
KTTTTFNTLA     GSLFTPLTTP GMSTLASESV TSRTSYNHRS WISTTSSYNR RYWTPATSTP
VTSTFSPGIS     TSSIPSSTAA TVPFMVPFTL NFTITNLQYE EDMRHPGSRK FNATERELQG
LLKPLFRNSS     LEYLYSGCRL ASLRPEKDSS AMAVDAICTH RPDPEDLGLD RERLYWELSN
LTNGIQELGP     YTLDRNSLYV NGFTHRSSMP TTSTPGTSTV DVGTSGTPSS SPSPTAAGPL
LMPFTLNFTI     TNLQYEEDMR RTGSRKFNTM ESVLQGLLKP LFKNTSVGPL YSGCRLTLLR
PEKDGAATGV     DAICTHRLDP KSPGLNREQL YWELSKLTND IEELGPYTLD RNSLYVNGFT
HQSSVSTTST     PGTSTVDLRT SGTPSSLSSP TIMAAGPLLV PFTLNFTITN LQYGEDMGHP
GSRKFNTTER     VLQGLLGPIF KNTSVGPLYS GCRLTSLRSE KDGAATGVDA ICIHHLDPKS
PGLNRERLYW     ELSQLTNGIK ELGPYTLDRN SLYVNGFTHR TSVPTTSTPG TSTVDLGTSG
TPFSLPSPAT     AGPLLVLFTL NFTITNLKYE EDMHRPGSRK FNTTERVLQT LLGPMFKNTS
VGLLYSGCRL     TLLRSEKDGA ATGVDAICTH RLDPKSPGLD REQLYWELSQ LTNGIKELGP
YTLDRNSLYV     NGFTHWIPVP TSSTPGTSTV DLGSGTPSSL PSPTAAGPLL VPFTLNFTIT
NLQYEEDMHH     PGSRKFNTTE RVLQGLLGPM FKNTSVGLLY SGCRLTLLRS EKDGAATGVD
AICTHRLDPK     SPGVDREQLY WELSQLTNGI KELGPYTLDR NSLYVNGFTH QTSAPNTSTP
GTSTVDLGTS     GTPSSLPSPT SAGPLLVPFT LNFTITNLQY EEDMRHPGSR KFNTTERVLQ
GLLKPLFKST     SVGPLYSGCR LTLLRSEKDG AATGVDAICT HRLDPKSPGV DREQLYWELS
QLTNGIKELG     PYTLDRNSLY VNGFTHQTSA PNTSTPGTST VDLGTSGTPS SLPSPTSAGP
LLVPFTLNFT     ITNLQYEEDM HHPGSRKFNT TERVLQGLLG PMFKNTSVGL LYSGCRLTLL
RPEKNGAATG     MDAICSHRLD PKSPGLNREQ LYWELSQLTH GIKELGPYTL DRNSLYVNGF
THRSSVAPTS     TPGTSTVDLG TSGTPSSLPS PTTAVPLLVP FTLNFTITNL QYGEDMRHPG
SRKFNTTERV     LQGLLGPLFK NSSVGPLYSG CRLISLRSEK DGAATGVDAI CTHHLNPQSP
GLDREQLYWQ     LSQMTNGIKE LGPYTLDRNS LYVNGFTHRS SGLTTSTPWT STVDLGTSGT
PSPVPSPTTA     GPLLVPFTLN FTITNLQYEE DMHRPGSRKF NTTERVLQGL LSPIFKNSSV
GPLYSGCRLT     SLRPEKDGAA TGMDAVCLYH PNPKRPGLDR EQLYWELSQL THNITELGPY
SLDRDSLYVN     GFTHQNSVPT TSTPGTSTVY WATTGTPSSF PGHTEPGPLL IPFTFNFTIT
NLHYEENMQH     PGSRKFNTTE RVLQGLLKPL FKNTSVGPLY SGCRLTSLRP EKDGAATGMD
AVCLYHPNPK     RPGLDREQLY WELSQLTHNI TELGPYSLDR DSLYVNGFTH QNSVPTTSTP
GTSTVYWATT     GTPSSFPGHT EPGPLLIPFT FNFTITNLHY EENMQHPGSR KFNTTERVLQ
GLLKPLFKNT     SVGPLYSGCR LTLLRPEKHE AATGVDTICT HRVDPIGPGL DRERLYWELS
QLTNSITELG     PYTLDRDSLY VNGFNPRSSV PTTSTPGTST VHLATSGTPS SLPGHTAPVP
LLIPFTLNFT     ITNLHYEENM QHPGSRKFNT TERVLQGLLK PLFKNTSVGP LYSGCRLTLL
RPEKHEAATG     VDTICTHRVD PIGPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF
THXXSXPTTS     TPGTSTVXXG TSGTPSSXPX XTSAGPLLVP FTLNFTITNL QYEEDMHHPG
SRKFNTTERV     LQGLLGPMFK NTSVGLLYSG CRLTLLRPEK NGAATGMDAI CSHRLDPKSP
GLDREQLYWE     LSQLTHGIKE LGPYTLDRNS LYVNGFTHRS SVAPTSTPGT STVDLGTSGT
PSSLPSPTTA     VPLLVPFTLN FTITNLQYGE DMRHPGSRKF NTTERVLQGL LGPLFKNSSV
GPLYSGCRLI     SLRSEKDGAA TGVDAICTHH LNPQSPGLDR EQLYWQLSQM TNGIKELGPY
TLDRNSLYVN     GFTHRSSGLT TSTPWTSTVD LGTSGTPSPV PSPTTAGPLL VPFTLNFTIT
NLQYEEDMHR     PGSRKFNATE RVLQGLLSPI FKNSSVGPLY SGCRLTSLRP EKDGAATGMD
AVCLYHPNPK     RPGLDREQLY WELSQLTHNI TELGPYSLDR DSLYVNGFTH QSSMTTTRTP
DTSTMHLATS     RTPASLSGPT TASPLLVLFT INCTITNLQY EEDMRRTGSR KFNTMESVLQ
GLLKPLFKNT     SVGPLYSGCR LTLLRPKKDG AATGVDAICT HRLDPKSPGL NREQLYWELS
KLTNDIEELG     PYTLDRNSLY VNGFTHQSSV STTSTPGTST VDLRTSGTPS SLSSPTIMXX
XPLLXPFTXN     XTITNLXXXX XMXXPGSRKF NTTERVLQGL LRPLFKNTSV SSLYSGCRLT
LLRPEKDGAA     TRVDAACTYR PDPKSPGLDR EQLYWELSQL THSITELGPY TLDRVSLYVN
GFNPRSSVPT     TSTPGTSTVH LATSGTPSSL PGHTXXXPLL XPFTXNXTIT NLXXXXXMXX
PGSRKFNTTE     RVLQGLLKPL FRNSSLEYLY SGCRLASLRP EKDSSAMAVD AICTHRPDPE
DLGLDRERLY     WELSNLTNGI QELGPYTLDR NSLYVNGFTH RSSGLTTSTP WTSTVDLGTS
GTPSPVPSPT     TAGPLLVPFT LNFTITNLQY EEDMHRPGSR RFNTTERVLQ GLLTPLFKNT
SVGPLYSGCR     LTLLRPEKQE AATGVDTICT HRVDPIGPGL DRERLYWELS QLTNSITELG
PYTLDRDSLY     VNGFNPWSSV PTTSTPGTST VHLATSGTPS SLPGHTAPVP LLIPFTLNFT
ITDLHYEENM     QHPGSRKFNT TERVLQGLLK PLFKSTSVGP LYSGCRLTLL RPEKHGAATG
VDAICTLRLD     PTGPGLDRER LYWELSQLTN SVTELGPYTL DRDSLYVNGF THRSSVPTTS
IPGTSAVHLE     TSGTPASLPG HTAPGPLLVP FTLNFTITNL QYEEDMRHPG SRKFSTTERV
LQGLLKPLFK     NTSVSSLYSG CRLTLLRPEK DGAATRVDAV CTHRPDPKSP GLDRERLYWK
LSQLTHGITE     LGPYTLDRHS LYVNGFTHQS SMTTTRTPDT STMHLATSRT PASLSGPTTA
SPLLVLFTIN     FTITNLRYEE NMHHPGSRKF NTTERVLQGL LRPVFKNTSV GPLYSGCRLT
TLRPKKDGAA     TKVDAICTYR PDPKSPGLDR EQLYWELSQL THSITELGPY TQDRDSLYVN
GFTHRSSVPT     TSIPGTSAVH LETSGTPASL PGHTAPGPLL VPFTLNFTIT NLQYEEDMRH
PGSRKFNTTE     RVLQGLLKPL FKSTSVGPLY SGCRLTLLRP EKRGAATGVD TICTHRLDPL
NPGLDREQLY     WELSKLTRGI IELGPYLLDR GSLYVNGFTH RTSVPTTSTP GTSTVDLGTS
GTPFSLPSPA     XXXPLLXPFT XNXTITNLXX XXXMXXPGSR KFNTTERVLQ TLLGPMFKNT
SVGLLYSGCR     LTLLRSEKDG AATGVDAICT HRLDPKSPGV DREQLYWELS QLTNGIKELG
PYTLDRNSLY     VNGFTHWIPV PTSSTPGTST VDLGSGTPSS LPSPTTAGPL LVPFTLNFTI
TNLKYEEDMH     CPGSRKFNTT ERVLQSLLGP MFKNTSVGPL YSGCRLTLLR SEKDGAATGV
DAICTHRLDP     KSPGVDREQL YWELSQLTNG IKELGPYTLD RNSLYVNGFT HQTSAPNTST
PGTSTVDLGT     SGTPSSLPSP TXXXPLLXPF TXNXTITNLX XXXXMXXPGS RKFNTTEXVL
QGLLXPXFKN     XSVGXLYSGC RLTXLRXEKX GAATGXDAIC XHXXXPKXPG LXXEXLYWEL
SXLTXXIXEL     GPYTLDRXSL YVNGFTHWIP VPTSSTPGTS TVDLGSGTPS SLPSPTTAGP
LLVPFTLNFT     ITNLKYEEDM HCPGSRKFNT TERVLQSLLG PMFKNTSVGP LYSGCRLTSL
RSEKDGAATG     VDAICTHRVD PKSPGVDREQ LYWELSQLTN GIKELGPYTL DRNSLYVNGF
THQTSAPNTS     TPGTSTVXXG TSGTPSSXPX XTSAGPLLVP FTLNFTITNL QYEEDMHHPG
SRKFNTTERV     LQGLLGPMFK NTSVGLLYSG CRLTLLRPEK NGATTGMDAI CTHRLDPKSP
GLXXEXLYWE     LSXLTXXIXE LGPYTLDRXS LYVNGFTHXX SXPTTSTPGT STVXXGTSGT
PSSXPXXTXX     XPLLXPFTXN XTITNLXXXX XMXXPGSRKF NTTERVLQGL LKPLFRNSSL
EYLYSGCRLA     SLRPEKDSSA MAVDAICTHR PDPEDLGLDR ERLYWELSNL TNGIQELGPY
TLDRNSLYVN     GFTHRSSMPT TSTPGTSTVD VGTSGTPSSS PSPTTAGPLL IPFTLNFTIT
NLQYGEDMGH     PGSRKFNTTE RVLQGLLGPI FKNTSVGPLY SGCRLTSLRS EKDGAATGVD
AICIHHLDPK     SPGLNRERLY WELSQLTNGI KELGPYTLDR NSLYVNGFTH RTSVPTTSTP
GTSTVDLGTS     GTPFSLPSPA TAGPLLVLFT LNFTITNLKY EEDMHRPGSR KFNTTERVLQ
TLLGPMFKNT     SVGLLYSGCR LTLLRSEKDG AATGVDAICT HRLDPKSPGL XXEXLYWELS
XLTXXIXELG     PYTLDRXSLY VNGFTHXXSX PTTSTPGTST VXXGTSGTPS SXPXXTXXXP
LLXPFTXNXT     ITNLXXXXXM XXPGSRKFNT TERVLQGLLR PVFKNTSVGP LYSGCRLTLL
RPKKDGAATK     VDAICTYRPD PKSPGLDREQ LYWELSQLTH SITELGPYTQ DRDSLYVNGF
THRSSVPTTS     IPGTSAVHLE TTGTPSSFPG HTEPGPLLIP FTFNFTITNL RYEENMQHPG
SRKFNTTERV     LQGLLTPLFK NTSVGPLYSG CRLTLLRPEK QEAATGVDTI CTHRVDPIGP
GLDRERLYWE     LSQLTNSITE LGPYTLDRDS LYVDGFNPWS SVPTTSTPGT STVHLATSGT
PSPLPGHTAP     VPLLIPFTLN FTITDLHYEE NMQHPGSRKF NTTERVLQGL LKPLFKSTSV
GPLYSGCRLT     LLRPEKHGAA TGVDAICTLR LDPTGPGLDR ERLYWELSQL TNSITELGPY
TLDRDSLYVN     GFNPWSSVPT TSTPGTSTVH LATSGTPSSL PGHTTAGPLL VPFTLNFTIT
NLKYEEDMHC     PGSRKFNTTE RVLQSLHGPM FKNTSVGPLY SGCRLTLLRS EKDGAATGVD
AICTHRLDPK     SPGLXXEXLY WELSXLTXXI XELGPYTLDR XSLYVNGFTH XXSXPTTSTP
GTSTVXXGTS     GTPSSXPXXT XXXPLLXPFT XNXTITNLXX XXXMXXPGSR KFNTTEXVLQ
GLLXPXFKNX     SVGXLYSGCR LTXLRXEKXG AATGXDAICX HXXXPKXPGL XXEXLYWELS
XLTNSITELG     PYTLDRDSLY VNGFTHRSSM PTTSIPGTSA VHLETSGTPA SLPGHTAPGP
LLVPFTLNFT     ITNLQYEEDM RHPGSRKFNT TERVLQGLLK PLFKSTSVGP LYSGCRLTLL
RPEKRGAATG     VDTICTHRLD PLNPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF
THXXSXPTTS     TPGTSTVXXG TSGTPSSXPX XTXXXPLLXP FTXNXTITNL XXXXXMXXPG
SRKFNTTEXV     LQGLLXPXFK NXSVGXLYSG CRLTXLRXEK XGAATGXDAI CXHXXXPKXP
GLXXEXLYWE     LSXLTXXIXE LGPYTLDRXS LYVNGFHPRS SVPTTSTPGT STVHLATSGT
PSSLPGHTAP     VPLLIPFTLN FTITNLHYEE NMQHPGSRKF NTTERVLQGL LGPMFKNTSV
GLLYSGCRLT     LLRPEKNGAA TGMDAICSHR LDPKSPGLXX EXLYWELSXL TXXIXELGPY
TLDRXSLYVN     GFTHXXSXPT TSTPGTSTVX XGTSGTPSSX PXXTXXXPLL XPFTXNXTIT
NLXXXXXMXX     PGSRKFNTTE XVLQGLLXPX FKNXSVGXLY SGCRLTXLRX EKXGAATGXD
AICXHXXXPK     XPGLXXEXLY WELSXLTXXI XELGPYTLDR XSLYVNGFTH QNSVPTTSTP
GTSTVYWATT     GTPSSFPGHT EPGPLLIPFT FNFTITNLHY EENMQHPGSR KFNTTERVLQ
GLLTPLFKNT     SVGPLYSGCR LTLLRPEKQE AATGVDTICT HRVDPIGPGL XXEXLYWELS
XLTXXIXELG     PYTLDRXSLY VNGFTHXXSX PTTSTPGTST VXXGTSGTPS SXPXXTXXXP
LLXPFTXNXT     ITNLXXXXXM XXPGSRKFNT TEXVLQGLLX PXFKNXSVGX LYSGCRLTXL
RXEKXGAATG     XDAICXHXXX PKXPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF
THRSSVPTTS     SPGTSTVHLA TSGTPSSLPG HTAPVPLLIP FTLNFTITNL HYEENMQHPG
SRKFNTTERV     LQGLLKPLFK STSVGPLYSG CRLTLLRPEK HGAATGVDAI CTLRLDPTGP
GLXXEXLYWE     LSXLTXXIXE LGPYTLDRXS LYVNGFTHXX SXPTTSTPGT STVXXGTSGT
PSSXPXXTXX     XPLLXPFTXN XTITNLXXXX XMXXPGSRKF NTTEXVLQGL LXPXFKNXSV
GXLYSGCRLT     XLRXEKXGAA TGXDAICXHX XXPKXPGLXX EXLYWELSXL TXXIXELGPY
TLDRXSLYVN     GFTHRTSVPT TSTPGTSTVH LATSGTPSSL PGHTAPVPLL IPFTLNFTIT
NLQYEEDMHR     PGSRKFNTTE RVLQGLLSPI FKNSSVGPLY SGCRLTSLRP EKDGAATGMD
AVCLYHPNPK     RPGLDREQLY CELSQLTHNI TELGPYSLDR DSLYVNGFTH QNSVPTTSTP
GTSTVYWATT     GTPSSFPGHT XXXPLLXPFT XNXTITNLXX XXXMXXPGSR KFNTTEXVLQ
GLLXPXFKNX     SVGXLYSGCR LTXLRXEKXG AATGXDAICX HXXXPKXPGL XXEXLYWELS
XLTXXIXELG     PYTLDRXSLY VNGFTHWSSG LTTSTPWTST VDLGTSGTPS PVPSPTTAGP
LLVPFTLNFT     ITNLQYEEDM HRPGSRKFNA TERVLQGLLS PIFKNTSVGP LYSGCRLTLL
RPEKQEAATG     VDTICTHRVD PIGPGLXXEX LYWELSXLTX XIXELGPYTL DRXSLYVNGF
THXXSXPTTS     TPGTSTVXXG TSGTPSSXPX XTXXXPLLXP FTXNXTITNL XXXXXMXXPG
SRKFNTTEXV     LQGLLXPXFK NXSVGXLYSG CRLTXLRXEK XGAATGXDAI CXHXXXPKXP
GLXXEXLYWE     LSXLTXXIXE LGPYTLDRXS LYVNGFTHRS FGLTTSTPWT STVDLGTSGT
PSPVPSPTTA     GPLLVPFTLN FTITNLQYEE DMHRPGSRKF NTTERVLQGL LTPLFRNTSV
SSLYSGCRLT     LLRPEKDGAA TRVDAVCTHR PDPKSPGLXX EXLYWELSXL TXXIXELGPY
TLDRXSLYVN     GFTHXXSXPT TSTPGTSTVX XGTSGTPSSX PXXTXXXPLL XPFTXNXTIT
NLXXXXXMXX     PGSRKFNTTE XVLQGLLXPX FKNXSVGXLY SGCRLTXLRX EKXGAATGXD
AICXHXXXPK     XPGLXXEXLY WELSXLTXXI XELGPYTLDR XSLYVNGFTH WIPVPTSSTP
GTSTVDLGSG     TPSSLPSPTT AGPLLVPFTL NFTITNLQYG EDMGHPGSRK FNTTERVLQG
LLGPIFKNTS     VGPLYSGCRL TSLRSEKDGA ATGVDAICIH HLDPKSPGLX XEXLYWELSX
LTXXIXELGP     YTLDRXSLYV NGFTHXXSXP TTSTPGTSTV XXGTSGTPSS XPXXTXXXPL
LXPFTXNXTI     TNLXXXXXMX XPGSRKFNTT EXVLQGLLXP XFKNXSVGXL YSGCRLTXLR
XEKXGAATGX     DAICXHXXXP KXPGLXXEXL YWELSXLTXX IXELGPYTLD RXSLYVNGFT
HQTFAPNTST     PGTSTVDLGT SGTPSSLPSP TSAGPLLVPF TLNFTITNLQ YEEDMHHPGS
RKFNTTERVL     QGLLGPMFKN TSVGLLYSGC RLTLLRPEKN GAATRVDAVC THRPDPKSPG
LXXEXLYWEL     SXLTXXIXEL GPYTLDRXSL YVNGFTHXXS XPTTSTPGTS TVXXGTSGTP
SSXPXXTAPV     PLLIPFTLNF TITNLHYEEN MQHPGSRKFN TTERVLQGLL KPLFKSTSVG
PLYSGCRLTL     LRPEKHGAAT GVDAICTLRL DPTGPGLDRE RLYWELSQLT NSVTELGPYT
LDRDSLYVNG     FTQRSSVPTT SIPGTSAVHL ETSGTPASLP GHTAPGPLLV PFTLNFTITN
LQYEVDMRHP     GSRKFNTTER VLQGLLKPLF KSTSVGPLYS GCRLTLLRPE KRGAATGVDT
ICTHRLDPLN     PGLDREQLYW ELSKLTRGII ELGPYLLDRG SLYVNGFTHR NFVPITSTPG
TSTVHLGTSE     TPSSLPRPIV PGPLLVPFTL NFTITNLQYE EAMRHPGSRK FNTTERVLQG
LLRPLFKNTS     IGPLYSSCRL TLLRPEKDKA ATRVDAICTH HPDPQSPGLN REQLYWELSQ
LTHGITELGP     YTLDRDSLYV DGFTHWSPIP TTSTPGTSIV NLGTSGIPPS LPETTXXXPL
LXPFTXNXTI     TNLXXXXXMX XPGSRKFNTT ERVLQGLLKP LFKSTSVGPL YSGCRLTLLR
PEKDGVATRV     DAICTHRPDP KIPGLDRQQL YWELSQLTHS ITELGPYTLD RDSLYVNGFT
QRSSVPTTST     PGTFTVQPET SETPSSLPGP TATGPVLLPF TLNFTITNLQ YEEDMHRPGS
RKFNTTERVL     QGLLMPLFKN TSVSSLYSGC RLTLLRPEKD GAATRVDAVC THRPDPKSPG
LDRERLYWKL     SQLTHGITEL GPYTLDRHSL YVNGFTHQSS MTTTRTPDTS TMHLATSRTP
ASLSGPTTAS     PLLVLFTINF TITNLRYEEN MHHPGSRKFN TTERVLQGLL RPVFKNTSVG
PLYSGCRLTL     LRPKKDGAAT KVDAICTYRP DPKSPGLDRE QLYWELSQLT HSITELGPYT
LDRDSLYVNG     FTQRSSVPTT SIPGTPTVDL GTSGTPVSKP GPSAASPLLV LFTLNFTITN
LRYEENMQHP     GSRKFNTTER VLQGLLRSLF KSTSVGPLYS GCRLTLLRPE KDGTATGVDA
ICTHHPDPKS     PRLDREQLYW ELSQLTHNIT ELGHYALDND SLFVNGFTHR SSVSTTSTPG
TPTVYLGASK     TPASIFGPSA ASHLLILFTL NFTITNLRYE ENMWPGSRKF NTTERVLQGL
LRPLFKNTSV     GPLYSGSRLT LLRPEKDGEA TGVDAICTHR PDPTGPGLDR EQLYLELSQL
THSITELGPY     TLDRDSLYVN GFTHRSSVPT TSTGVVSEEP FTLNFTINNL RYMADMGQPG
SLKFNITDNV     MKHLLSPLFQ RSSLGARYTG CRVIALRSVK NGAETRVDLL CTYLQPLSGP
GLPIKQVFHE     LSQQTHGITR LGPYSLDKDS LYLNGYNEPG LDEPPTTPKP ATTFLPPLSE
ATTAMGYHLK     TLTLNFTISN LQYSPDMGKG SATFNSTEGV LQHLLRPLFQ KSSMGPFYLG
CQLISLRPEK     DGAATGVDTT CTYHPDPVGP GLDIQQLYWE LSQLTHGVTQ LGFYVLDRDS
LFINGYAPQN     LSIRGEYQIN FHIVNWNLSN PDPTSSEYIT LLRDIQDKVT TLYKGSQLHD
TFRFCLVTNL     TMDSVLVTVK ALFSSNLDPS LVEQVFLDKT LNASFHWLGS TYQLVDIHVT
EMESSVYQPT     SSSSTQHFYL NFTITNLPYS QDKAQPGTTN YQRNKRNIED ALNQLFRNSS
IKSYFSDCQV     STFRSVPNRH HTGVDSLCNF SPLARRVDRV AIYEEFLRMT RNGTQLQNFT
LDRSSVLVDG     YSPNRNEPLT GNSDLPFWAV ILIGLAGLLG LITCLICGVL VTTRRRKKEG
EYNVQQQCPG     YYQSHLDLED LQ
Stack trace follows ....


        at
org.biojavax.bio.seq.io.UniProtFormat.readRichSequence(UniProtFormat.java:609)
        at
org.biojavax.bio.seq.io.RichStreamReader.nextRichSequence(RichStreamReader.java:110)
        ... 2 more
Caused by: org.biojava.bio.symbol.IllegalAlphabetException: Can't handle this
in the general case: PROTEIN-TERM
        at
org.biojava.bio.symbol.PackingFactory.getPacking(PackingFactory.java:75)
        at
org.biojava.bio.symbol.PackedSymbolListFactory.makeSymbolList(PackedSymbolListFactory.java:88)
        at
org.biojava.bio.seq.io.ChunkedSymbolListFactory.addSymbols(ChunkedSymbolListFactory.java:220)
        at
org.biojavax.bio.seq.io.SimpleRichSequenceBuilder.addSymbols(SimpleRichSequenceBuilder.java:256)
        at
org.biojavax.bio.seq.io.UniProtFormat.readRichSequence(UniProtFormat.java:604)
        ... 3 more


-- 
Configure bugmail: http://bugzilla.open-bio.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.



More information about the biojava-dev mailing list