[EMBOSS] Database indexing logfiles

Peter Rice pmr at ebi.ac.uk
Thu Jul 28 16:14:06 UTC 2005


After comments on this list, I have updated the dbiflat logfile.

It now includes:

Field names are the short names used by the USA (the index file names still 
work on the dbiflat commandline). These are the same names as SRS uses in its 
commandline queries.

Numbers of tokens for each index field in each file, and total unique values 
in each field index.

Full paths for all directories (including the current working directory)

Today's date (also written to the index file headers) if no date is given.

The full commandline - if there were prompts with non-default replies these 
will be included in the commandline reported. This uses new ACD functions that 
can be used to report in other programs. Any special requests for this 
information in other outputs?

regards,

Peter

> %cat outfile.dbiflat
> ########################################
> # Program: dbiflat
> # Rundate: Thu Jul 28 2005 17:04:58
> # Dbname: EMBL
> # Release: 0.0
> # Date: 28/07/05
> # CurrentDirectory: /homes/pmr/hgmp/test/embl/
> # IndexDirectory: ./
> # IndexDirectoryPath: /homes/pmr/hgmp/test/embl/
> # Maxindex: 0
> # Fields: 6
> #   Field 1: id
> #   Field 2: acc
> #   Field 3: sv
> #   Field 4: des
> #   Field 5: key
> #   Field 6: org
> # Directory: ./
> # DirectoryPath: /homes/pmr/hgmp/test/embl/
> # Filenames: *.dat
> # Exclude: 
> # Files: 10
> #   File 1: ./est.dat
> #   File 2: ./fun.dat
> #   File 3: ./hum1.dat
> #   File 4: ./inv.dat
> #   File 5: ./pln.dat
> #   File 6: ./pro.dat
> #   File 7: ./rod.dat
> #   File 8: ./sts.dat
> #   File 9: ./vrl.dat
> #   File 10: ./vrt.dat
> ########################################
> # Commandline: dbiflat
> #    -fields acnum,seqvn,des,keyword,taxon
> #    -dbname EMBL
> #    -idformat embl
> #    -auto
> ########################################
> 
> processing filename 'est.dat' ... 1 entries
>    acc 1
>     sv 3
>    des 15
>    key 1
>    org 14
> processing filename 'fun.dat' ... 1 entries
>    acc 1
>     sv 3
>    des 8
>    key 1
>    org 9
> processing filename 'hum1.dat' ... 18 entries
>    acc 53
>     sv 54
>    des 200
>    key 43
>    org 252
> processing filename 'inv.dat' ... 3 entries
>    acc 3
>     sv 9
>    des 20
>    key 3
>    org 33
> processing filename 'pln.dat' ... 3 entries
>    acc 7
>     sv 9
>    des 19
>    key 6
>    org 54
> processing filename 'pro.dat' ... 9 entries
>    acc 13
>     sv 27
>    des 77
>    key 28
>    org 54
> processing filename 'rod.dat' ... 3 entries
>    acc 3
>     sv 9
>    des 28
>    key 1
>    org 45
> processing filename 'sts.dat' ... 1 entries
>    acc 1
>     sv 3
>    des 12
>    key 7
>    org 14
> processing filename 'vrl.dat' ... 1 entries
>    acc 2
>     sv 3
>    des 10
>    key 1
>    org 5
> processing filename 'vrt.dat' ... 4 entries
>    acc 4
>     sv 12
>    des 33
>    key 5
>    org 55
> 
> Index acc maxlen 8 items 84
> Index sv maxlen 10 items 90
> Index des maxlen 19 items 215
> Index key maxlen 44 items 81
> Index org maxlen 27 items 116
> 
> Total 10 files 44 entries





More information about the EMBOSS mailing list