[Bioperl-l] Now very OT: removing duplicate fasta records
Sam Griffiths-Jones
sgj@sanger.ac.uk
Wed, 18 Dec 2002 16:09:46 +0000 (GMT)
On Wed, 18 Dec 2002, Ewan Birney wrote:
> Much to the horror of the person who left this for me I replaced the
> whole thing in Perl and moved from a directory of UNIX files to a
> directory of RCS files in UNIX (I hadn't met relational databases
> then and I thought those were only for "professional" software
> engineers, and I stayed clear of them. Stupid in retrospect). It did
> provide marginally better data recovery at the very least. (I don't
> think Erik has ever forgiven me for not using his beloved gawk
> 5-liners). Looking back on it, I am not that proud of what I wrote,
> but it did work,
And still does!
> and it is still used in areas of Pfam today (a scary thought),
> though Kevin might have gutted most of my code by now.
>
pfam [scripts]103% find . -name '*.p[lm]' | xargs grep -i ewan
<big snips>
./pfamrcs/pfkill.pl: die("Badly - could not remove $family from
current! Please talk to ewan asap as database is in a bad state\n");
./pfamrcs/pfupdate.pl: print ("PFUPDATE: Problem - we could not copy
the files from the current set. <sigh>. Please talk to ewan\n");
./pfamrcs/pfmove.pl: print "I am really sorry. Currently this will
exit the program to prevent hanging scripts (try again in a couple of
minutes?). If the lock is not being free'd, talk to ewan\n";
Recognise these? :)
--------------------------------------------------------------------
Sam Griffiths-Jones sgj@sanger.ac.uk
http://www.sanger.ac.uk/Users/sgj +44 (0)1223 834244
Wisdom #2179: You can't hide a piece of broccoli in a glass of
milk.
--------------------------------------------------------------------