[emboss-dev] USA syntax and `%' character in sequence file names

Peter Rice pmr at ebi.ac.uk
Mon Nov 3 11:19:44 UTC 2008


Nicolas Joly wrote:
> On Thu, Oct 23, 2008 at 10:47:54PM +0100, ajb at ebi.ac.uk wrote:
>> Hi Nicolas,
>>
>> What it does, given a USA like:
>>
>>     foo%10
>>
>> is to seek 10 bytes into file foo and try to start
>> reading a sequence from there. It does not, however, currently check that
>> what appears after the '%' is a valid number. I believe invalid numbers
>> are equivalent to an offset of 0.
>>
>> I suspect it might have been intended as a useful debugging tool for
>> the programmer rather than as something for the biologist.
>> If we leave it as an option we ought to mention it the documentation
>> in some form though.
> 
> Thanks, Alan. Personally, i would get rid of it. But if you plan to
> keep it, please check for valid numbers before using it.

We do need it - for saving USAs when reading files.

For example, sequence file formats where the ID is not unique or has to be 
generated. Also potentially useful together with the offsets stored by the 
database indexing systems and for future use with other data types.

Yes, we will fix it to check that the number is valid... and add to the 
documentation.

regards,

Peter



More information about the emboss-dev mailing list