[Bioperl-l] BinarySearch.pm
Fields, Christopher J
cjfields at illinois.edu
Tue Nov 29 22:30:58 UTC 2011
Peter,
Can you send a test file that is failing? I added a few tests using an example file with a ';' in the ID line, but everything is passing with our other EMBL example files. I'm also looking into adding a method to return secondary IDs for a specific type ('ACC' for instance) so we can test the repeat issue for accessions. Both changes pass tests as is, though, so I have committed them in the meantime.
chris
On Nov 29, 2011, at 7:39 AM, Peter Rice wrote:
> In trying to use bioflat_index.pl index files in EMBOSS, I ran into some problems.
>
> Both appear to be in the Bio/Flat/BinarySearch.pm source file.
>
> EMBL ID lines are failing to drop the ';' from the ID. Updating the regular expression to make sure the ';' is not picked up seems to work:
>
> if ($format =~ /embl/i) {
> return ('ID',
> "^ID (\\S+[^; ])",
> "^ID (\\S+[^; ])",
> {
> ACC => q/^AC (\S+);/,
> VERSION => q/^SV\s+(\S+)/
> });
> }
>
> The ACC secondary index has every record duplicated.
> This line is duplicated in the write_secondary_indices source code. Is that intentional?
>
> print $fh sprintf("%-${length}s",$record);
>
> regards,
>
> Peter Rice
> EMBOSS Team
> _______________________________________________
> Bioperl-l mailing list
> Bioperl-l at lists.open-bio.org
> http://lists.open-bio.org/mailman/listinfo/bioperl-l
More information about the Bioperl-l
mailing list