[Bioperl-l] bl2seq

Paulo Almeida palmeida at igc.gulbenkian.pt
Tue Aug 23 04:41:09 EDT 2005


Hi Usha,

Perhaps something like this:

my @seqs;
open IN, "yourfile";
while (<IN>) {
chomp();  #get rid of newline character
my @line = split(/\s+/);
push @seqs, $line[3] if $line[3] =~ /^A-Z+$/;
}

foreach my $seq (@seqs) {
#Do whatever you are doing successfully for a single sequence
}

I'm not sure about the syntax. because I haven't been using Perl, but that's 
the general idea.

-Paulo

On Tuesday 23 August 2005 02:32, Usha Rani Reddi wrote:
> Hi,
> I am trying to compare two hundred thousand probes(each one of them) to
> another genome. Format of the file containing probes is like this
> SEQ_ID PROBE_ID POSITION PROBE_SEQUENCE
> NC_004116 1 1 AATTAACATTGTTGATTTTATTCTTCAACATC
> NC_004116 3 13 TGATTTTATTCTTCAACATCTGTGGAAAACTT
> NC_004116 5 25 TCAACATCTGTGGAAAACTTTATTTTTTTATG
> NC_004116 7 37 GAAAACTTTATTTTTTTATGGTACAATATAAC
> NC_004116 9 49 TTTTTATGGTACAATATAACAATAATTATCCA
> NC_004116 11 61 AATATAACAATAATTATCCACAAGACAATAAG
> NC_004116 13 73 ATTATCCACAAGACAATAAGGAAGAAGCTATG
> NC_004116 15 85 ACAATAAGGAAGAAGCTATGACGGAAAACGAA
> What I am trying to do is compare PROBE_SEQUENCE to fasta file of
> Streptococcus agalactiae. I am trying to loop through the probes but not
> sure how to proceed. My program is working fine for single sequence. One
> more thing is I am not interested in matches, I want to display only
> mismatches. I am new to Bioperl, some one please help me with this.
> Thanks
> Usha
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/bioperl-l

-- 
Paulo Almeida
Tel: +351 21 4464635, Fax: +351 21 4407970
Instituto Gulbenkian de Ciência
Rua da Quinta Grande, 6
P-2780-156 Oeiras
Portugal
http://www.igc.gulbenkian.pt



More information about the Bioperl-l mailing list