[Bioperl-l] New to BioPerl
Jason Stajich
jason@cgt.mc.duke.edu
Wed, 27 Feb 2002 10:27:17 -0500 (EST)
use Bio::SearchIO and read the bptutorial on how to extract the
information you need -- all the tools you need to do this are in the
Bio::SearchIO parser system and Bio::Search objects.
I have a script that extracts tissue information for an EST
blast see scripts/est_tissue_query.pl in the bioperl repository. That
script is trying to do a lot so it might be a little confusing.
it's run like
% perl scripts/est_tissue_query.pl -r genbank -p 0.00001 -f blast -b
MYFILE
# you can add -c cache if you want to build a temporary cache so multiple
This script has been fixed some just now to use Bio::SearchIO so you
probably want the latest CVS version which you can see how to get from
http://cvs.bioperl.org
I'd suggest that you spend a little time learning bioperl by running
through the tutorial and building a simle script that prints out the list
of hits from a blast report using our objects. Then I suspect the path
will be more obvious. Lots of good discussions of this type on our
mailing list archives.
-jason
On Wed, 27 Feb 2002 holford.5@osu.edu wrote:
> Does a Perl script exist for such a job or would I need to develop one?
>
> I am in the process of learning Perl and still very wet behind the ears. I
> am a C/C++ programmer and I have been asked to develop a software program
> to automate a process. This process deals with parsing blast
> output. Since my background in not in Biology I will not attempt to
> paraphrase the request. So here it is in its entirety.
>
> "Do you know of a program that would allow or facilitate in silico
> > Northern (gene expression) analysis? What I would like to do is to take
> > a specific gene sequence and blast it against the soybean EST database
> > and then extract the hits below a certain E value into *categories*
> > (e.g., "roots", "inoculated", etc) based on the cDNA library
> > information for the particular EST in the text that accompanies the
> > blast output. It could also be done based on the cDNA library #
> > associated with the EST hits. This allows a quick determination of
> > whether a particular gene, for instance, is root specific in its
> > expression, etc. It would be extremely valuable to us in deciding what
> > genes to focus on (from a gene family, for instance) for expression
> > analysis with real life Northerns.
> > I think that such a program would be very valuable for alot of us!"
>
> Thanking you in advance,
>
> Ian
>
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l@bioperl.org
> http://bioperl.org/mailman/listinfo/bioperl-l
>
--
Jason Stajich
Duke University
jason@cgt.mc.duke.edu