[BioRuby-cvs] bioruby/sample any2fasta.rb,NONE,1.1
Pjotr Prins
pjotr at pub.open-bio.org
Fri Feb 17 14:59:29 UTC 2006
Update of /home/repository/bioruby/bioruby/sample
In directory pub.open-bio.org:/tmp/cvs-serv30194/sample
Added Files:
any2fasta.rb
Log Message:
Fixed tutorial on updated sequence handling (throw exception instead of
return nil value)
Added a FASTA output example that allows for regex filtering like
any2fasta.rb "/GAT[CT]/i" *.seq > reduced.fasta
--- NEW FILE: any2fasta.rb ---
#!/usr/bin/env ruby
#
# any2fasta.rb - convert input file into FASTA format using a regex
# filter
#
# Copyright (C) 2006 Pjotr Prins <p at bioruby.org>
#
# This program is free software; you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation; either version 2 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# $Id: any2fasta.rb,v 1.1 2006/02/17 14:59:27 pjotr Exp $
#
require 'bio/io/flatfile'
include Bio
usage = <<USAGE
Usage: any2fasta.rb [regex] infiles
Examples:
Output all sequences containing GATC or GATT ignoring case:
any2fasta.rb "/GAT[CT]/i" *.seq > reduced.fasta
USAGE
if ARGV.size == 0
print usage
exit 1
end
# ---- Valid regular expression - if it is not a file
regex = ARGV[0]
if regex=~/^\// and !File.exist?(regex)
ARGV.shift
else
regex = nil
end
ARGV.each do | fn |
ff = Bio::FlatFile.auto(fn)
ff.each_entry do |entry|
if regex != nil
next if eval("entry.seq !~ #{regex}")
end
print entry.seq.to_fasta(entry.definition,70)
end
end
More information about the bioruby-cvs
mailing list