[Biopython-dev] Working with the new SearchIO API
    Kai Blin 
    kai.blin at biotech.uni-tuebingen.de
       
    Tue Oct 30 15:54:50 UTC 2012
    
    
  
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 2012-10-30 08:35, Kai Blin wrote:
Hi Bow,
> I'm mainly wondering why at this position, I can't just create the
> Hit object already, and then later set the HSPs. You could do this
> via a setter function that validates the IDs are identical if you
> want to make sure you're not shooting yourself in the foot there.
I've just stumbled over a case where not being able to pre-create Hit
objects really bites me.
See the attached hmmpfam output. You'll notice that the domain table
is not in the order of the hit table. As I'd like to preserve the
order of the hit table, the current setup of the API forces me to
either repeatedly parse the domain annotations until I find the
correct domain annotations for my hit, or to create the hits in the
order of the domain annotation table and then reshuffle them to make
sure they're in the order of the hit table.
If I could just create "empty" hit objects when parsing the hit table,
I could easily preserve the order of the hits but still add the hsps
as I parse them.
Cheers,
Kai
- -- 
Dipl.-Inform. Kai Blin         kai.blin at biotech.uni-tuebingen.de
Institute for Microbiology and Infection Medicine
Division of Microbiology/Biotechnology
Eberhard-Karls-Universität Tübingen
Auf der Morgenstelle 28                 Phone : ++49 7071 29-78841
D-72076 Tübingen                        Fax :   ++49 7071 29-5979
Germany
Homepage: http://www.mikrobio.uni-tuebingen.de/ag_wohlleben
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://www.enigmail.net/
iQEcBAEBAgAGBQJQj/hKAAoJEKM5lwBiwTTPWTYH/2miexrfxolw9J0tOCSHXFYn
eNEzLcIM8ZHUoBCL1fsS/9166VH8D8HpyZCgTQwsSt9BUhQbjkwTmyfmP9wr0QDp
80IbxqWkMAJmDv3Q1RxbVVmD8TTfY6AwezQuwnYb8EFJDD7wvcJOJgJEqlp6zZu1
K/fJNYOXt2GekcXkrOMO1jGkzzpiwBs1uhhpYH9LxMAHPW3vnfTf4/tVSRPOKWRr
IXtxRnLSSurmZP4DYNm1ys4NykY6cO6zPOWxJIiI1lBLR7AVaKNK1bZ75m2D7/Mr
Y4FjnIlqaCFuNwiYPSNWQvTHOIj/VF/nRSWAVRRCqYZoYaDuZa25rb3Fo5RHMC8=
=Lerj
-----END PGP SIGNATURE-----
-------------- next part --------------
hmmpfam - search one or more sequences against HMM database
HMMER 2.3.2 (Oct 2003)
Copyright (C) 1992-2003 HHMI/Washington University School of Medicine
Freely distributed under the GNU General Public License (GPL)
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
HMM file:                 ../Shared/Pfam_fs
Sequence file:            single_porphyra_AA.fa
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Query sequence: gi|90819130|dbj|BAE92499.1|
Accession:      [none]
Description:    glutamate synthase [Porphyra yezoensis]
Scores for sequence family classification (score includes all domains):
Model           Description                             Score    E-value  N 
--------        -----------                             -----    ------- ---
Glu_synthase    Conserved region in glutamate synthas   858.6   3.6e-255   2
GATase_2        Glutamine amidotransferases class-II    731.8   3.9e-226   1
Glu_syn_central Glutamate synthase central domain       649.1   7.9e-213   1
GXGXG           GXGXG motif                             367.3   2.7e-107   1
HdeA            hns-dependent expression protein A (H     9.6      0.015   1
GDC-P           Glycine cleavage system P-protein         7.1      0.086   1
Cache_1         Cache domain                              7.0       0.14   1
IBN_N           Importin-beta N-terminal domain           8.2       0.17   1
DUF1200         Protein of unknown function (DUF1200)     6.7       0.42   1
cobW            CobW/HypB/UreG, nucleotide-binding do     5.1       0.45   1
PUF             Pumilio-family RNA binding repeat         6.5       0.47   1
Arch_flagellin  Archaebacterial flagellin                 4.1       0.66   1
FMN_dh          FMN-dependent dehydrogenase               3.2       0.89   1
RNA_pol_Rpb2_4  RNA polymerase Rpb2, domain 4             4.6        1.4   1
DUF477          Domain of unknown function (DUF477)       3.8        1.7   1
FRG1            FRG1-like family                          0.2        1.7   1
DUF1393         Protein of unknown function (DUF1393)     3.1          2   1
tRNA_anti       OB-fold nucleic acid binding domain       4.9          2   1
SelT            Selenoprotein T                           3.1        2.2   1
RNase_PH_C      3' exoribonuclease family, domain 2       4.2        2.3   1
Pencillinase_R  Penicillinase repressor                   3.9        2.5   1
Hormone_4       Neurohypophysial hormones, N-terminal     4.4        2.5   1
DSRB            Dextransucrase DSRB                       2.7        2.7   1
FtsK_SpoIIIE    FtsK/SpoIIIE family                       2.6        3.1   1
UBA             UBA/TS-N domain                           4.2        3.1   1
DUF1981         Domain of unknown function (DUF1981)      3.6        3.3   1
Gla             Vitamin K-dependent carboxylation/gam     4.0        3.5   1
Scm3            Centromere protein Scm3                   2.2        3.5   1
Ribosomal_S6    Ribosomal protein S6                      3.3        3.7   1
Cystatin        Cystatin domain                           2.4        3.9   1
Phage_prot_Gp6  Phage portal protein, SPP1 Gp6-like       1.0          4   1
DUF1976         Domain of unknown function (DUF1976)     -1.5        4.3   1
DUF37           Domain of unknown function DUF37          3.0        4.5   1
Flavodoxin_NdrI NrdI Flavodoxin like                      2.1        4.6   1
Bac_rhodopsin   Bacteriorhodopsin                         0.9        4.9   1
Nitro_FeMo-Co   Dinitrogenase iron-molybdenum cofacto     2.1        5.3   1
MoCF_biosynth   Probable molybdopterin binding domain     1.3        5.6   1
PaaA_PaaC       Phenylacetic acid catabolic protein       0.4        5.6   1
Albicidin_res   Albicidin resistance domain               1.7        5.7   1
DUF1514         Protein of unknown function (DUF1514)     3.5        5.7   1
T5orf172        T5orf172 domain                           2.0        6.1   1
Nup133_N        Nup133 N terminal like                   -0.6        6.5   1
BicD            Microtubule-associated protein Bicaud    -1.6        6.8   1
Sel1            Sel1 repeat                               2.5          7   1
CAP_C           DE   Adenylate cyclase associated (CA     1.3        7.4   1
Colicin         Colicin pore forming domain               1.4        7.5   1
MADF_DNA_bdg    Alcohol dehydrogenase transcription f     1.8        8.2   1
DUF258          Protein of unknown function, DUF258       0.3        8.3   1
PspB            Phage shock protein B                     0.4        8.4   1
GspM            General secretion pathway, M protein      1.0        8.6   1
Coq4            Coenzyme Q (ubiquinone) biosynthesis     -0.3        9.1   1
P22_AR_N        P22_AR N-terminal domain                 -0.2        9.5   1
C1_2            C1 domain                                 1.1        9.6   1
Phage_Mu_P      Bacteriophage Mu P protein               -0.4         10   1
Parsed for domains:
Model           Domain  seq-f seq-t    hmm-f hmm-t      score  E-value
--------        ------- ----- -----    ----- -----      -----  -------
GATase_2          1/1      34   404 ..     1   385 []   731.8 3.9e-226
FRG1              1/1      88   107 ..   151   173 ..     0.2      1.7
C1_2              1/1     191   210 ..     9    27 ..     1.1      9.6
MADF_DNA_bdg      1/1     235   261 ..    57    95 .]     1.8      8.2
PaaA_PaaC         1/1     258   269 ..     1    13 [.     0.4      5.6
Albicidin_res     1/1     274   289 ..    50    65 ..     1.7      5.7
UBA               1/1     311   331 ..    18    38 .]     4.2      3.1
Gla               1/1     342   357 ..    27    42 .]     4.0      3.5
RNA_pol_Rpb2_4    1/1     369   381 ..     1    13 [.     4.6      1.4
MoCF_biosynth     1/1     371   396 ..    23    49 ..     1.3      5.6
DUF1200           1/1     389   401 ..     1    13 [.     6.7     0.42
Nup133_N          1/1     397   419 ..   475   498 .]    -0.6      6.5
DUF1976           1/1     428   448 ..  1296  1319 .]    -1.5      4.3
Bac_rhodopsin     1/1     445   472 ..   219   250 .]     0.9      4.9
Coq4              1/1     459   481 ..    60    82 ..    -0.3      9.1
Glu_syn_central   1/1     478   773 ..     1   301 []   649.1 7.9e-213
Flavodoxin_NdrI   1/1     488   497 ..   122   131 .]     2.1      4.6
P22_AR_N          1/1     524   541 ..   110   126 .]    -0.2      9.5
Cache_1           1/1     537   557 ..     1    23 [.     7.0     0.14
Glu_synthase      1/2     650   676 ..   297   323 ..     1.3        3
HdeA              1/1     727   749 ..    58    79 .]     9.6    0.015
Sel1              1/1     729   745 ..    32    49 .]     2.5        7
DUF1981           1/1     765   787 ..    62    88 .]     3.6      3.3
tRNA_anti         1/1     818   839 ..    54    85 .]     4.9        2
Cystatin          1/1     826   859 ..     1    38 [.     2.4      3.9
RNase_PH_C        1/1     827   846 ..    64    84 .]     4.2      2.3
Glu_synthase      2/2     830  1216 ..     1   412 []   857.3   9e-255
DUF258            1/1     839   860 ..   282   305 .]     0.3      8.3
Pencillinase_R    1/1     856   894 ..    84   118 .]     3.9      2.5
SelT              1/1     872   885 ..    96   111 .]     3.1      2.2
Nitro_FeMo-Co     1/1     879   897 ..    87   105 .]     2.1      5.3
DUF37             1/1     927   934 ..    61    68 .]     3.0      4.5
Scm3              1/1     953   963 ..   103   113 .]     2.2      3.5
cobW              1/1    1038  1058 ..   202   222 .]     5.1     0.45
Arch_flagellin    1/1    1050  1072 ..   197   219 .]     4.1     0.66
DUF1393           1/1    1055  1068 ..     1    14 [.     3.1        2
FtsK_SpoIIIE      1/1    1107  1143 ..   163   198 ..     2.6      3.1
FMN_dh            1/1    1109  1148 ..   291   330 ..     3.2     0.89
DSRB              1/1    1120  1134 ..     1    16 [.     2.7      2.7
Phage_Mu_P        1/1    1122  1131 ..     1    10 [.    -0.4       10
Hormone_4         1/1    1168  1176 ..     1     9 []     4.4      2.5
GDC-P             1/1    1205  1225 ..    10    30 ..     7.1    0.086
PspB              1/1    1268  1276 ..     1     9 [.     0.4      8.4
T5orf172          1/1    1271  1293 ..    35    58 ..     2.0      6.1
CAP_C             1/1    1283  1292 ..   161   170 .]     1.3      7.4
GXGXG             1/1    1290  1485 ..     1   228 []   367.3 2.7e-107
DUF1514           1/1    1453  1469 ..    50    66 .]     3.5      5.7
Colicin           1/1    1456  1467 ..   192   203 .]     1.4      7.5
Ribosomal_S6      1/1    1461  1481 ..    16    36 ..     3.3      3.7
BicD              1/1    1465  1481 ..     1    17 [.    -1.6      6.8
PUF               1/1    1470  1486 ..    19    35 .]     6.5     0.47
DUF477            1/1    1472  1495 ..     1    24 [.     3.8      1.7
Phage_prot_Gp6    1/1    1479  1492 ..     1    14 [.     1.0        4
IBN_N             1/1    1498  1516 ..     1    20 [.     8.2     0.17
GspM              1/1    1506  1520 ..     1    15 [.     1.0      8.6
Alignments of top-scoring domains:
GATase_2: domain 1 of 1, from 34 to 404: score 731.8, E = 3.9e-226
                CS    EEEEEEEEETSSHSBHHHHHHHHHHHHHGGGGSSCSTTSSCECEEEE
                   *->CGvlGfiAhikgkpshkivedaleaLerLeHRGavgADgktGDGAGI
                      CGv GfiA+ ++ ++hkiv +aleaL+++eHRGa++AD ++GDGAGI
  gi|9081913    34    CGV-GFIADVNNVANHKIVVQALEALTCMEHRGACSADRDSGDGAGI 79   
                CS EEECTCCCHHHHHHHCT----S GC-EEEEEEE-SSHHHHHHHHHHHHHH
                   ltqiPdgFFrevakelGieLpe.gqYAVGmvFLPqdelaraearkifEki
                    t+iP+++F++  ++++i++ ++   +VGm+FLP   l+    + i+E +
  gi|9081913    80 TTAIPWNLFQKSLQNQNIKFEQnDSVGVGMLFLPAHKLKES--KLIIETV 127  
                CS HHHTT-EEEEEEE--B-GGGS-HHHHHC--EEEEEEEE-TT--HHHHHHC
                   aeeeGLeVLGWReVPvnnsvLGetAlatePvIeQvFvgapsgdgedfErr
                   ++ee+Le++GWR VP+  +vLG++A  + P++eQvF+ +++ +++ +E++
  gi|9081913   128 LKEENLEIIGWRLVPTVQEVLGKQAYLNKPHVEQVFCKSSNLSKDRLEQQ 177  
                CS EEEEECHSCHHHHTHHH.    BEEEEEESSEEEEEECC-GGGHHHHBHG
                   LyviRkrieksivaenvn....fYiCSLSsrTIVYKGMLtseQLgqFYpD
                   L+++Rk+iek+i+  + +  ++fYiCSLS++TIVYKGM++s++LgqFY+D
  gi|9081913   178 LFLVRKKIEKYIGINGKDwaheFYICSLSCYTIVYKGMMRSAVLGQFYQD 227  
                CS GGSTTEEBSEEEEEECESSSSSCTGGGSSCEEECCCTTCEEEEEEEEETT
                   LqderfeSalAivHsRFSTNTfPsWplAQPfRVnslwgggivlAHNGEIN
                   L++++++S++Ai+H+RFSTNT+P+WplAQP+R         ++ HNGEIN
  gi|9081913   228 LYHSEYTSSFAIYHRRFSTNTMPKWPLAQPMR---------FVSHNGEIN 268  
                CS THHHHHHHHHHTSCCCSSTTCGHHHHCC-SSS-TTSCHHHHHHHHHHHHH
                   TlrgNrnwMraRegvlksplFgddldkLkPIvneggSDSaalDnvlEllv
                   Tl gN nwM++Re +l+s++++d++++LkPI n+++SDSa+lD ++Ell+
  gi|9081913   269 TLLGNLNWMQSREPLLQSKVWKDRIHELKPITNKDNSDSANLDAAVELLI 318  
                CS HTT--HHHHHHHHS----TT-GGGTST-HHHHHHHHHHHHHHCCHCCEEE
                   raGRslpeAlMMlIPEAWqnnpdmdkdrpekraFYeylsglmEPWDGPAa
                   ++GRs++eAlM+l+PEA+qn+pd   +++e+ +FYey+sgl+EPWDGPA+
  gi|9081913   319 ASGRSPEEALMILVPEAFQNQPDFA-NNTEISDFYEYYSGLQEPWDGPAL 367  
                CS EEEETSSEEEEEEETTTSCESEEEEEEEEEE.TTEEEEEESSC   
                   lvftDGryavgAtLDRNGLTRPaRygiTrdldkDglvvvaSEa<-*
                   +vft+G++ +gAtLDRNGL RPaRy+iT    kD+lv+v+SE+   
  gi|9081913   368 VVFTNGKV-IGATLDRNGL-RPARYVIT----KDNLVIVSSES    404  
FRG1: domain 1 of 1, from 88 to 107: score 0.2, E = 1.7
                   *->FQkfKvDLqdrklrinekDkkel<-*
                      FQk+   Lq+  +  +++D+ ++   
  gi|9081913    88    FQKS---LQNQNIKFEQNDSVGV    107  
C1_2: domain 1 of 1, from 191 to 210: score 1.1, E = 9.6
                   *->idgfyg...fYsCkkccddftl<-*
                      i+g+++ ++fY C+  c  +t+   
  gi|9081913   191    INGKDWaheFYICSLSC--YTI    210  
MADF_DNA_bdg: domain 1 of 1, from 235 to 261: score 1.8, E = 8.2
                   *->drYrrelrkirqgnsegsstgsgesykskWryyeelsFL<-*
                      +++  ++r+               ++ +kW+++  ++F    
  gi|9081913   235    SSFAIYHRRFS------------TNTMPKWPLAQPMRFV    261  
PaaA_PaaC: domain 1 of 1, from 258 to 269: score 0.4, E = 5.6
                CS    X............   
                   *->MYnFvEHGGvint<-*
                      M  Fv H G int   
  gi|9081913   258    M-RFVSHNGEINT    269  
Albicidin_res: domain 1 of 1, from 274 to 289: score 1.7, E = 5.7
                   *->LrlmharEPsLrkgtG<-*
                      L+ m+ rEP L+ +++   
  gi|9081913   274    LNWMQSREPLLQSKVW    289  
UBA: domain 1 of 1, from 311 to 331: score 4.2, E = 3.1
                CS    HHHHHHHHHTTT-HHHHHHHH   
                   *->eeakkALeatngnverAvewL<-*
                      ++a++ L a++ ++e+A+++L   
  gi|9081913   311    DAAVELLIASGRSPEEALMIL    331  
Gla: domain 1 of 1, from 342 to 357: score 4.0, E = 3.5
                CS    CSSHHHHHHHHHHCTC   
                   *->fednegtkefwrkYfg<-*
                      f++n+++  f++ Y g   
  gi|9081913   342    FANNTEISDFYEYYSG    357  
RNA_pol_Rpb2_4: domain 1 of 1, from 369 to 381: score 4.6, E = 1.4
                CS    EEETTEEEEEESS   
                   *->VYvNGklvGthrn<-*
                      V+ NGk++G + +   
  gi|9081913   369    VFTNGKVIGATLD    381  
MoCF_biosynth: domain 1 of 1, from 371 to 396: score 1.3, E = 5.6
                CS    CHHHHHHHHHHHTTTCEEEEEEEE-SS   
                   *->tNgpmLaalLresaGaevirygiVpDd<-*
                      tNg+ + a L +  G  ++ry+i +D+   
  gi|9081913   371    TNGKVIGATLDR-NGLRPARYVITKDN    396  
DUF1200: domain 1 of 1, from 389 to 401: score 6.7, E = 0.42
                   *->kYvltedtLlIks<-*
                      +Yv+t+d L+I+s   
  gi|9081913   389    RYVITKDNLVIVS    401  
Nup133_N: domain 1 of 1, from 397 to 419: score -0.6, E = 6.5
                   *->lylltrnsGvvrIeHaleedstne<-*
                      l++ + +sGvv++e +  + s  +   
  gi|9081913   397    LVIVSSESGVVQVE-PGNVKSKGR    419  
DUF1976: domain 1 of 1, from 428 to 448: score -1.5, E = 4.3
                   *->VsvYiyFkevtdnksLsEysVtyk<-*
                      V++++   ++++nk ++  sVt k   
  gi|9081913   428    VDIFS--HKILNNKEIK-TSVTTK    448  
Bac_rhodopsin: domain 1 of 1, from 445 to 472: score 0.9, E = 4.9
                CS    HHHHHHHHHHHHHHHHHCHHHTC---------   
                   *->vvAKVgFgfilLrsravlertvavgsalaage<-*
                      v++K+++g +l ++r++le  +   + l+++    
  gi|9081913   445    VTTKIPYGELLTDARQILE--HK--PFLSDQQ    472  
Coq4: domain 1 of 1, from 459 to 481: score -0.3, E = 9.1
                   *->rrILkEkPRissetldlkkLrkL<-*
                      r+IL  kP  s  ++d kkL +L   
  gi|9081913   459    RQILEHKPFLSDQQVDIKKLMQL    481  
Glu_syn_central: domain 1 of 1, from 478 to 773: score 649.1, E = 7.9e-213
                CS    HHHHHHCTT--HHHHHCTCHHHHHHSS--EE-S---S--CCC-SS--
                   *->llrrQkAFGYTyEdvelvllPMAetGkEalGSMGdDtPLAVLSekpr
                      l+++Q+AFGYT+Edvelv+++MA+++kE++++MGdD+PL +LSek++
  gi|9081913   478    LMQLQTAFGYTNEDVELVIEHMASQAKEPTFCMGDDIPLSILSEKSH 524  
                CS -GGGCEEE----SSS----TTTTGGG-B--EEES--S-TTS-SGGGC-CE
                   lLYdYFKQlFAQVTNPPIDPIREelVMSLetylGpegNlLeptpeqarrl
                   +LYdYFKQ+FAQVTNP+IDP+RE+lVMSL+ ++G+++NlL+  p+ a+++
  gi|9081913   525 ILYDYFKQRFAQVTNPAIDPLRESLVMSLAIQIGHKSNLLDDQPTLAKHI 574  
                CS EESSSB--HHHHHH.HHHH....CCCCEEEEESEEESTTSTTCHHHHHHH
                   kLesPILsnselekmlknidairegfkaatIditFdveeGvdgLeaaLdr
                   kLesP+++++el++ + +     +++++  I+++F  e+G++ ++  + +
  gi|9081913   575 KLESPVINEGELNA-IFE-----SKLSCIRINTLFQLEDGPKNFKQQIQQ 618  
                CS HHHHHHHHHHCT-SEEEEESTCG--CTTEEE--HHHHHHHHHHHHHCTT-
                   lceeAeeAirsGaniivLSDRndildeervaIPaLLAvGAVHhHLIrkgL
                   lce A++Ai +G ni+vLSD+n+ ld+e+v+IP+LLAvGAVHhHLI kgL
  gi|9081913   619 LCENASQAILDGNNILVLSDKNNSLDSEKVSIPPLLAVGAVHHHLINKGL 668  
                CS CCC-EEEEEESS--SHHHHHHHHCTT-SEEEEHCCHHHHHHHHCCCCCCC
                   RtkvslvVETGEaREvHHFAvLiGYGAsAInPYLAyETirdWWlirrGll
                   R+ +s+ VET++++++HHFA+LiGYGAsAI+PYLA+ET r+WW + ++++
  gi|9081913   669 RQEASILVETAQCWSTHHFACLIGYGASAICPYLAFETARHWWSNPKTKM 718  
                CS CHTTTS- T--HHHHHHHHHHHHHHHHHHHHHCTT--BHHHHCCS--EEE
                   lmskGkl.elsleeavkNYrkAiekGlLKIMSKMGISTlqSYrGAQIFEA
                   lmskG+l++++++ea++NY+kA+e+GlLKI+SKMGIS+l+SY+GAQIFE+
  gi|9081913   719 LMSKGRLpACNIQEAQANYKKAVEAGLLKILSKMGISLLSSYHGAQIFEI 768  
                CS SSB-H   
                   vGLsk<-*
                   +GL++   
  gi|9081913   769 LGLGS    773  
Flavodoxin_NdrI: domain 1 of 1, from 488 to 497: score 2.1, E = 4.6
                CS    -HHHHHHHHH   
                   *->TneDVerVrk<-*
                      TneDVe V +   
  gi|9081913   488    TNEDVELVIE    497  
P22_AR_N: domain 1 of 1, from 524 to 541: score -0.2, E = 9.5
                   *->dVLydYWtrkGkAv..NPR<-*
                      ++LydY+  + +A  +NP+   
  gi|9081913   524    HILYDYFK-QRFAQvtNPA    541  
Cache_1: domain 1 of 1, from 537 to 557: score 7.0, E = 0.14
                   *->wTePYvdaalktgdlViTiaqPv<-*
                      +T+P++d +  +++lV ++a+++   
  gi|9081913   537    VTNPAIDPL--RESLVMSLAIQI    557  
Glu_synthase: domain 1 of 2, from 650 to 676: score 1.3, E = 3
                CS    --HHHHHHHHHHHHHCTT-CCCSEEEE   
                   *->lPwelgLaevhqtLvengLRdrVsLia<-*
                      +P  l++ +vh  L++ gLR + s+ +   
  gi|9081913   650    IPPLLAVGAVHHHLINKGLRQEASILV    676  
HdeA: domain 1 of 1, from 727 to 749: score 9.6, E = 0.015
                   *->ACk.QdkkAsFkdKvkaEldKvk<-*
                      AC  Q+ +A++k+ v+a l K+    
  gi|9081913   727    ACNiQEAQANYKKAVEAGLLKIL    749  
Sel1: domain 1 of 1, from 729 to 745: score 2.5, E = 7
                CS    .HHH.HHHHHHHHHHTT-   
                   *->DyekeAlkwyekAAeqGn<-*
                      ++++ A + y+kA e+G    
  gi|9081913   729    NIQE-AQANYKKAVEAGL    745  
DUF1981: domain 1 of 1, from 765 to 787: score 3.6, E = 3.3
                   *->iFgvltlaakeesesivklAfqiid.qi<-*
                      iF++l+l++       v+lAf+ +++qi   
  gi|9081913   765    IFEILGLGSEV-----VNLAFKGTTsQI    787  
tRNA_anti: domain 1 of 1, from 818 to 839: score 4.9, E = 2
                CS    EEEEEEETTSSTSTCTCTT..EEEEEEEEEEE   
                   *->tGkvkkrpggeqNnlkTGeKAlelvveeievl<-*
                      +G v+ rpgge          ++++ +e+      
  gi|9081913   818    YGFVQYRPGGE----------YHINNPEMSKA    839  
Cystatin: domain 1 of 1, from 826 to 859: score 2.4, E = 3.9
                CS    ECEEEEET.STSHHHHHHHHHHHHHHHHHSSSSEEEEE   
                   *->GglspvdpNendpevqealdfAlakyNeksndnylfel<-*
                      Gg   +++    pe  +al+ A+  yN +  +ny++ l   
  gi|9081913   826    GGEYHINN----PEMSKALHQAVRGYNPEYYNNYQSLL    859  
RNase_PH_C: domain 1 of 1, from 827 to 846: score 4.2, E = 2.3
                CS    SSSS.B.HHHHHHHHHHHHHH   
                   *->GkgnglteelleealelAkeg<-*
                      G +++++ +++ +al++A+ g   
  gi|9081913   827    G-EYHINNPEMSKALHQAVRG    846  
Glu_synthase: domain 2 of 2, from 830 to 1216: score 857.3, E = 9e-255
                CS    -SS-HHHHHHHHHHHHC--T-HHHHHHHHHHHHTS.-S-SGGGGEEE
                   *->hrnepeviktlqkavqvpveskpsydkYreplnertpigalrdlLef
                      h n+pe++k l++av+    +   y +Y+ +l +r p++alrdlL++
  gi|9081913   830    HINNPEMSKALHQAVRG--YNPEYYNNYQSLLQNR-PPTALRDLLKL 873  
                CS --SS--......--GGGS--HHHHHTTEEEEEB-CTTC-HHHHHHHHHHH
                   kyaeepldtdkiipieevepaleikkrfctgaMSyGALSeeAheALAiAm
                    ++++p      i+i+eve+++ i + fctg+MS+GALS+e+he+LAiAm
  gi|9081913   874 QSNRAP------ISIDEVESIEDILQKFCTGGMSLGALSRETHETLAIAM 917  
                CS HHCT-EEEETTT---GGGCSB-TTS-T S BTTSTT--S--TT-B---SE
                   nriGtksNtGEGGedperlkpaadlds.G.SpTlpHLkGLqnednarSAI
                   nriG+ksN+GEGGedp r+k + d++s+G+Sp lpHLkGL+n+d+a+SAI
  gi|9081913   918 NRIGGKSNSGEGGEDPVRFKILNDVNSsGtSPLLPHLKGLKNGDTASSAI 967  
                CS EEE-TT-TT--............HHHHCC-SEEEEE---TTSTTT--EE-
                   kQvASGRFGVtkRnGefWeefkRseYLvnAdalEIKiAQGAKPGeGGhLP
                   kQ+ASGRFGVt            +eYL+nA++lEIKiAQGAKPGeGG+LP
  gi|9081913   968 KQIASGRFGVT------------PEYLMNAKQLEIKIAQGAKPGEGGQLP 1005 
                CS GGG--HHHHHHHTS-TT--EE--SS-TT-SSHHHHHHHHHHHHHH-.TTS
                   GeKVspeIAriRnstPGvgliSPpPHHDIysiEDLaqLIydLkeindpkA
                   G+K+sp+IA +R ++PGv liSPpPHHDIysiEDL+qLI+dL++in pkA
  gi|9081913  1006 GKKISPYIATLRKCKPGVPLISPPPHHDIYSIEDLSQLIFDLHQIN-PKA 1054 
                CS EEEEEEE-STTHHHHHHH...HHHTT-SEEEEE-TT---SSEECCHHHHC
                   pisVKLVsehgvgtiaaGhmqvakAnADiIlIdGhdGGTGASpktsikha
                   +isVKLVse g+gtiaaG   vak+nADiI+I+GhdGGTGASp++sikha
  gi|9081913  1055 KISVKLVSEIGIGTIAAG---VAKGNADIIQISGHDGGTGASPLSSIKHA 1101 
                CS ---HHHHHHHHHHHHHCTT-CCCSEEEEESS--SHHHHHHHHHCT-SEEE
                   GlPwelgLaevhqtLvengLRdrVsLiadGGLrTGaDVakAaaLGAdavg
                   G PwelgL+evhq+L en+LRdrV+L++dGGLrTG D+++Aa++GA+++g
  gi|9081913  1102 GSPWELGLSEVHQLLAENQLRDRVTLRVDGGLRTGSDIVLAAIMGAEEFG 1151 
                CS -SHHHHHHCT--S---CCCT--TTSSS---CCHH..CT----HHHHHHHH
                   iGTaaLiAlGCimaRvCHtntCPvGvATQDPeLrKrlkfegaperVvNyf
                   +GT+a+iA+GCimaR+CHtn+CPvGvATQ++eLr   +f g+pe +vN+f
  gi|9081913  1152 FGTVAMIATGCIMARICHTNKCPVGVATQREELR--ARFSGVPEALVNFF 1199 
                CS HHHHHHHHHHHHHHT-S   
                   iflaeEvrellaqlGfr<-*
                   +f+  Evre+la+lG++   
  gi|9081913  1200 LFIGNEVREILASLGYK    1216 
DUF258: domain 1 of 1, from 839 to 860: score 0.3, E = 8.3
                CS    HHHHHHHCTSS-HHHHHHHHHHHH   
                   *->AVkaAveeGeIseeRYesYlklle<-*
                      A+ +Av    +++e Y++Y+ ll+   
  gi|9081913   839    ALHQAVR--GYNPEYYNNYQSLLQ    860  
Pencillinase_R: domain 1 of 1, from 856 to 894: score 3.9, E = 2.5
                CS    XXXXXXXXXXXXXXXXXXX    XXXXXXXXXXXXXXXX   
                   *->drlfggsvgalvanfleee....klSeddieeLrelLde<-*
                      + l++++++ ++ ++l+ ++++ ++S d++e ++++L++   
  gi|9081913   856    QSLLQNRPPTALRDLLKLQsnraPISIDEVESIEDILQK    894  
SelT: domain 1 of 1, from 872 to 885: score 3.1, E = 2.2
                   *->KLqtGrvYAPPtpqEL<-*
                      KLq++r   P++++E+   
  gi|9081913   872    KLQSNRA--PISIDEV    885  
Nitro_FeMo-Co: domain 1 of 1, from 879 to 897: score 2.1, E = 5.3
                CS    EEE-TTSSBHHHHHHHHHC   
                   *->pikagegetieeaiealqe<-*
                      pi   e e+ie+ + ++ +   
  gi|9081913   879    PISIDEVESIEDILQKFCT    897  
DUF37: domain 1 of 1, from 927 to 934: score 3.0, E = 4.5
                   *->hpGGyDPV<-*
                      ++GG DPV   
  gi|9081913   927    GEGGEDPV    934  
Scm3: domain 1 of 1, from 953 to 963: score 2.2, E = 3.5
                   *->HLraLeteddi<-*
                      HL++L+++d++   
  gi|9081913   953    HLKGLKNGDTA    963  
cobW: domain 1 of 1, from 1038 to 1058: score 5.1, E = 0.45
                CS    ...HHHHHHHHHH-SSS-EEE   
                   *->adlekleadlrrlnpeapiip<-*
                      +dl++l+ dl+++np+a+i     
  gi|9081913  1038    EDLSQLIFDLHQINPKAKISV    1058 
Arch_flagellin: domain 1 of 1, from 1050 to 1072: score 4.1, E = 0.66
                   *->inpstkvrgeVvpenGapgtief<-*
                      inp  k+++++v+e+G+ ++      
  gi|9081913  1050    INPKAKISVKLVSEIGIGTIAAG    1072 
DUF1393: domain 1 of 1, from 1055 to 1068: score 3.1, E = 2
                   *->klSvKtVVAiGIGA<-*
                      k+SvK V  iGIG+   
  gi|9081913  1055    KISVKLVSEIGIGT    1068 
FtsK_SpoIIIE: domain 1 of 1, from 1107 to 1143: score 2.6, E = 3.1
                   *->lviDnydeLaeenlL.ervtsLknqGlsygvhvmata<-*
                      l++ + ++L +en+L++rvt+ + +Gl +g +++++a   
  gi|9081913  1107    LGLSEVHQLLAENQLrDRVTLRVDGGLRTGSDIVLAA    1143 
FMN_dh: domain 1 of 1, from 1109 to 1148: score 3.2, E = 0.89
                CS    HHHHHHHHHCHHTTTSSEEEEESS-SSHHHHHHHHHHTSS   
                   *->LpeVvPIlkeaAvkgdieVllDgGvRRGtDVlKALALGAr<-*
                      L eV  +l e  + +++   +DgG R+G+D++ A  +GA+   
  gi|9081913  1109    LSEVHQLLAENQLRDRVTLRVDGGLRTGSDIVLAAIMGAE    1148 
DSRB: domain 1 of 1, from 1120 to 1134: score 2.7, E = 2.7
                   *->mKvndrvtvKtDGgpR<-*
                       ++ drvt + DGg R   
  gi|9081913  1120    -QLRDRVTLRVDGGLR    1134 
Phage_Mu_P: domain 1 of 1, from 1122 to 1131: score -0.4, E = 10
                   *->sntVtLrvgG<-*
                       ++VtLrv+G   
  gi|9081913  1122    RDRVTLRVDG    1131 
Hormone_4: domain 1 of 1, from 1168 to 1176: score 4.4, E = 2.5
                CS    X-TT--TT-   
                   *->CyirnCPrG<-*
                      C  + CP+G   
  gi|9081913  1168    CHTNKCPVG    1176 
GDC-P: domain 1 of 1, from 1205 to 1225: score 7.1, E = 0.086
                   *->eqqeMLstiGlssLddLidat<-*
                      e++e+L+++G++sLdd ++++   
  gi|9081913  1205    EVREILASLGYKSLDDITGQN    1225 
PspB: domain 1 of 1, from 1268 to 1276: score 0.4, E = 8.4
                   *->MsaffLagP<-*
                      M+ ++La+P   
  gi|9081913  1268    MDDDILAIP    1276 
T5orf172: domain 1 of 1, from 1271 to 1293: score 2.0, E = 6.1
                   *->dvvalievedaraklEklLHkrFk<-*
                      d+ a+ ev++a  klE+++ k+Fk   
  gi|9081913  1271    DILAIPEVSNAI-KLETEITKHFK    1293 
CAP_C: domain 1 of 1, from 1283 to 1292: score 1.3, E = 7.4
                CS    EEEEEE----   
                   *->KLvTevveha<-*
                      KL+Te++ h    
  gi|9081913  1283    KLETEITKHF    1292 
GXGXG: domain 1 of 1, from 1290 to 1485: score 367.3, E = 2.7e-107
                CS    EEEEE-TT--STTHHHHHHHHHHCTTTS.S-TTCEEEEEEEEE-TTT
                   *->keeaiiNtdrlvgtrlsgeiakkygeegalpkdtgkivfnGsAGqsf
                      k+++i Nt+r+vgtrlsg iak yg+ g + k+ +k++f+GsAGqsf
  gi|9081913  1290    KHFKIANTNRTVGTRLSGIIAKNYGNTG-F-KGLIKLNFYGSAGQSF 1334 
                CS TTT-BTTEEEEEEEEE-S.TTTTT-ECCEEEEE--TT-.......SS-GG
                   GafmagGvtLeleGdAnddyvGkgmsGGeIvikgnagdpvGnnMdageyv
                   Gaf+a+G++L l+G+And yvGkgm+GG+Ivi+++ag         +e +
  gi|9081913  1335 GAFLASGINLKLMGEAND-YVGKGMNGGSIVIVPPAGT-------IYEDN 1376 
                CS GSEEC-SSTTTT--CEEEEESSEE-TTTTTT-.....CCEEEEESEB.-S
                   gnviaGNtclyGatGGkifiaGdAGerfgvrnkayKdsgatiVveGvaGd
                   ++vi+GNtclyGatGG++f++G+AGerf+vrn     s a+ VveGv Gd
  gi|9081913  1377 NQVIIGNTCLYGATGGYLFAQGQAGERFAVRN-----SLAESVVEGV-GD 1420 
                CS STTTT-EEEEEEESS-B-SSBTTT--CCEEEEE-TTS.......THHHHB
                   hggEYMtGGtivVlGdaGrnvGagMtGGiaYvlgeiedfsyMiatlpgkv
                   h++EYMtGG+ivVlG+aGrnvGagMtGG+aY+l+e+e        + ++v
  gi|9081913  1421 HACEYMTGGVIVVLGKAGRNVGAGMTGGLAYFLDEDE-------NFIDRV 1463 
                CS -CCCEEEE...ES-S......CCHHHHHHHH   
                   nleiVeledlkrievkrkklLpegekqlkel<-*
                   n+eiV+ +   r+ +      ++ge+qlk+l   
  gi|9081913  1464 NSEIVKIQ---RVIT------KAGEEQLKNL    1485 
DUF1514: domain 1 of 1, from 1453 to 1469: score 3.5, E = 5.7
                   *->LeeyrieveRikkevkk<-*
                      L e+++ ++R++ e+ k   
  gi|9081913  1453    LDEDENFIDRVNSEIVK    1469 
Colicin: domain 1 of 1, from 1456 to 1467: score 1.4, E = 7.5
                CS    SHHHHHHHHHCH   
                   *->DdkfveklNkli<-*
                      D++f++ +N +i   
  gi|9081913  1456    DENFIDRVNSEI    1467 
Ribosomal_S6: domain 1 of 1, from 1461 to 1481: score 3.3, E = 3.7
                CS    CCHHHHHHHHHHHHHCTT-EE   
                   *->EqvkqeiekYqkvLtnngAei<-*
                      ++v++ei k+q+v+t++g+e+   
  gi|9081913  1461    DRVNSEIVKIQRVITKAGEEQ    1481 
BicD: domain 1 of 1, from 1465 to 1481: score -1.6, E = 6.8
                   *->gqaysnqrkvAkdGeer<-*
                       + +++qr+ +k Gee+   
  gi|9081913  1465    SEIVKIQRVITKAGEEQ    1481 
PUF: domain 1 of 1, from 1470 to 1486: score 6.5, E = 0.47
                   *->lQkllevateeqkqlil<-*
                      +Q+++++a+eeq ++++   
  gi|9081913  1470    IQRVITKAGEEQLKNLI    1486 
DUF477: domain 1 of 1, from 1472 to 1495: score 3.8, E = 1.7
                   *->gtLspserarLeqalaalEqktga<-*
                      ++++++  ++L   ++  ++ktg+   
  gi|9081913  1472    RVITKAGEEQLKNLIENHAAKTGS    1495 
Phage_prot_Gp6: domain 1 of 1, from 1479 to 1492: score 1.0, E = 4
                   *->eEmikkFidkHklr<-*
                      eE +k++i+ H+++   
  gi|9081913  1479    EEQLKNLIENHAAK    1492 
IBN_N: domain 1 of 1, from 1498 to 1516: score 8.2, E = 0.17
                CS    HHHHHHHHHCCTHHCHHHHH   
                   *->AEkqLeqlekqklPgfllaL<-*
                      A++ Le+++++ lP+f++ +   
  gi|9081913  1498    AHTILEKWNSY-LPQFWQVV    1516 
GspM: domain 1 of 1, from 1506 to 1520: score 1.0, E = 8.6
                CS    XXXXXXXXXXXXXXX   
                   *->mneLqawWqgrspRE<-*
                      ++ L ++Wq ++p+E   
  gi|9081913  1506    NSYLPQFWQVVPPSE    1520 
//
    
    
More information about the Biopython-dev
mailing list