[Bioperl-l] How to remove redundancy ?

michael michael@galton.ucl.ac.uk
Sat, 16 Nov 2002 07:59:29 +0000 (GMT)


> On Fri, 15 Nov 2002, Marc Logghe wrote:
>
> > I think he means a non-redundant data set on the sequence level,
> > like nr and nt of genbank.

> > Jason Stajich wrote:
>
> If indeed you want a nonredundant sequence db just use nrdb:
> http://blast.wustl.edu/pub/nrdb/
>
> This won't require you to rely on the sequences being in genbank.


	Apparently the NCBI DNA nonredundant sequence db isn't

<http://www.ncbi.nlm.nih.gov/blast/html/blastcgihelp.html#nucleotide_databases>

nr
All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, or phase 0, 1
or 2 HTGS sequences). No longer "non-redundant".



~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Michael John Lush PhD		 	 Tel:44-20-7679-5027
Nomenclature Bioinformatics Support 	 Fax:44-20-7387-3496
HUGO Gene Nomenclature Committee Email:  nome@galton.ucl.ac.uk
The Galton Laboratory
University College London, UK
URL: http://www.gene.ucl.ac.uk/nomenclature/
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~